• 제목/요약/키워드: non-linear activation

검색결과 52건 처리시간 0.03초

뉴런 활성화 경사 최적화를 이용한 개선된 플라즈마 모델 (An improved plasma model by optimizing neuron activation gradient)

  • 김병환;박성진
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2000년도 제15차 학술회의논문집
    • /
    • pp.20-20
    • /
    • 2000
  • Back-propagation neural network (BPNN) is the most prevalently used paradigm in modeling semiconductor manufacturing processes, which as a neuron activation function typically employs a bipolar or unipolar sigmoid function in either hidden and output layers. In this study, applicability of another linear function as a neuron activation function is investigated. The linear function was operated in combination with other sigmoid functions. Comparison revealed that a particular combination, the bipolar sigmoid function in hidden layer and the linear function in output layer, is found to be the best combination that yields the highest prediction accuracy. For BPNN with this combination, predictive performance once again optimized by incrementally adjusting the gradients respective to each function. A total of 121 combinations of gradients were examined and out of them one optimal set was determined. Predictive performance of the corresponding model were compared to non-optimized, revealing that optimized models are more accurate over non-optimized counterparts by an improvement of more than 30%. This demonstrates that the proposed gradient-optimized teaming for BPNN with a linear function in output layer is an effective means to construct plasma models. The plasma modeled is a hemispherical inductively coupled plasma, which was characterized by a 24 full factorial design. To validate models, another eight experiments were conducted. process variables that were varied in the design include source polver, pressure, position of chuck holder and chroline flow rate. Plasma attributes measured using Langmuir probe are electron density, electron temperature, and plasma potential.

  • PDF

비선형, 비정상 시계열 예측을 위한 RBF(Radial Basis Function) 회로망 구조 (RBF Network Structure for Prediction of Non-linear, Non-stationary Time Series)

  • 김상환;이종호
    • 대한전기학회논문지:전력기술부문A
    • /
    • 제48권2호
    • /
    • pp.168-175
    • /
    • 1999
  • In this paper, a modified RBF(Radial Basis Function) network structure is suggested for the prediction of a time-series with non-linear, non-stationary characteristics. Coventional RBF network predicting time series by using past outputs sense the trajectory of the time series and react when there exists strong relation between input and hidden activation function's RBF center. But this response is highly sensitive to level and trend of time serieses. In order to overcome such dependencies, hidden activation functions are modified to react to the increments of input variable and multiplied by increment(or dectement) for prediction. When the suggested structure is applied to prediction of Macyey-Glass chaotic time series, Lorenz equation, and Rossler equation, improved performances are obtained.

  • PDF

활성화 함수에 따른 유출량 산정 인공신경망 모형의 성능 비교 (Comparison of Artificial Neural Network Model Capability for Runoff Estimation about Activation Functions)

  • 김마가;최진용;방재홍;윤푸른;김귀훈
    • 한국농공학회논문집
    • /
    • 제63권1호
    • /
    • pp.103-116
    • /
    • 2021
  • Analysis of runoff is substantial for effective water management in the watershed. Runoff occurs by reaction of a watershed to the rainfall and has non-linearity and uncertainty due to the complex relation of weather and watershed factors. ANN (Artificial Neural Network), which learns from the data, is one of the machine learning technique known as a proper model to interpret non-linear data. The performance of ANN is affected by the ANN's structure, the number of hidden layer nodes, learning rate, and activation function. Especially, the activation function has a role to deliver the information entered and decides the way of making output. Therefore, It is important to apply appropriate activation functions according to the problem to solve. In this paper, ANN models were constructed to estimate runoff with different activation functions and each model was compared and evaluated. Sigmoid, Hyperbolic tangent, ReLU (Rectified Linear Unit), ELU (Exponential Linear Unit) functions were applied to the hidden layer, and Identity, ReLU, Softplus functions applied to the output layer. The statistical parameters including coefficient of determination, NSE (Nash and Sutcliffe Efficiency), NSEln (modified NSE), and PBIAS (Percent BIAS) were utilized to evaluate the ANN models. From the result, applications of Hyperbolic tangent function and ELU function to the hidden layer and Identity function to the output layer show competent performance rather than other functions which demonstrated the function selection in the ANN structure can affect the performance of ANN.

Kinetic Modeling for Quality Prediction During Kimchi Fermentation

  • Chung, Hae-Kyung;Yeo, Kyung-Mok;Kim, Nyung-Hwan
    • Preventive Nutrition and Food Science
    • /
    • 제1권1호
    • /
    • pp.41-45
    • /
    • 1996
  • This study was conducted to develop the fermentation kinetic model for the prediction of acidity and pH changes in Kimchi as a function of fermentation temperatures. The fitness of the model was evaluated using traditional two-step method and an alternative non-linear regression method. The changes in acidity and pH during fermentation followed the pattern of the first order reaction of a two-step method. As the fermentation temperature increased from 4$^{\circ}C$ to 28, the reaction rates of acidity and pH were increased 8.4 and 7.6 times, respectively. The activation energies of acidity and pH were 16.125 and 16.003kcal/mole. The average activation energies of acidity and pH using a non-linear method were 16.006 by the first order and 15.813 kcal/mole by the zero order, respectively. The non-linear procedure had better fitting 개 experimental data of the acidity and pH than two-step method. The shelf-lives based on the time to reach the 1.0% of acidity were 33.1day at 4$^{\circ}C$ and 2.8 day 28$^{\circ}C$.

  • PDF

파라메트릭 활성함수를 이용한 심층신경망의 성능향상 방법 (Performance Improvement Method of Deep Neural Network Using Parametric Activation Functions)

  • 공나영;고선우
    • 한국콘텐츠학회논문지
    • /
    • 제21권3호
    • /
    • pp.616-625
    • /
    • 2021
  • 심층신경망은 임의의 함수를 근사화하는 방법으로 선형모델로 근사화한 후에 비선형 활성함수를 이용하여 추가적 근사화를 반복하는 근사화 방법이다. 이 과정에서 근사화의 성능 평가 방법은 손실함수를 이용한다. 기존 심층학습방법에서는 선형근사화 과정에서 손실함수를 고려한 근사화를 실행하고 있지만 활성함수를 사용하는 비선형 근사화 단계에서는 손실함수의 감소와 관계가 없는 비선형변환을 사용하고 있다. 본 연구에서는 기존의 활성함수에 활성함수의 크기를 변화시킬 수 있는 크기 파라메터와 활성함수의 위치를 변화시킬 수 있는 위치 파라미터를 도입한 파라메트릭 활성함수를 제안한다. 파라메트릭 활성함수를 도입함으로써 활성함수를 이용한 비선형 근사화의 성능을 개선시킬 수 있다. 각 은닉층에서 크기와 위치 파라미터들은 역전파 과정에서 파라미터들에 대한 손실함수의 1차 미분계수를 이용한 학습과정을 통해 손실함수 값을 최소화시키는 파라미터를 결정함으로써 심층신경망의 성능을 향상시킬 수 있다. MNIST 분류 문제와 XOR 문제를 통하여 파라메트릭 활성함수가 기존의 활성함수에 비해 우월한 성능을 가짐을 확인하였다.

콘크리트 강도예측을 위한 적산온도 함수의 활성화에너지에 관한 연구 (A Study on the Activation Energy of Maturity Function for Prediction of Concrete Strength)

  • 장종호;강용식;김용로;길배수;남재현;김무한
    • 한국콘크리트학회:학술대회논문집
    • /
    • 한국콘크리트학회 2002년도 가을 학술발표회 논문집
    • /
    • pp.81-84
    • /
    • 2002
  • Activation energy value is different according to cement, admixture and water-cement ratio also the relation of age-temperature is as non-linear as activation energy value is large. So to make accurate explanation for the effect of temperature on concrete strength development property, it is necessary to investigation for activation energy value. This study compares activation energy value recommended by Freiesleben and ASTM with activation energy value obtained by consequence of mortar examination according to ASTM C 1074-93. As the result of this study, activation energy value obtained by the study is 37.19KJ/mol, and in case of activation energy value obtained by the study explain temperature's influence about concrete strength development more accurate than activation energy value recommend by Freiesleben and ASTM.

  • PDF

Function Approximation Based on a Network with Kernel Functions of Bounds and Locality : an Approach of Non-Parametric Estimation

  • Kil, Rhee-M.
    • ETRI Journal
    • /
    • 제15권2호
    • /
    • pp.35-51
    • /
    • 1993
  • This paper presents function approximation based on nonparametric estimation. As an estimation model of function approximation, a three layered network composed of input, hidden and output layers is considered. The input and output layers have linear activation units while the hidden layer has nonlinear activation units or kernel functions which have the characteristics of bounds and locality. Using this type of network, a many-to-one function is synthesized over the domain of the input space by a number of kernel functions. In this network, we have to estimate the necessary number of kernel functions as well as the parameters associated with kernel functions. For this purpose, a new method of parameter estimation in which linear learning rule is applied between hidden and output layers while nonlinear (piecewise-linear) learning rule is applied between input and hidden layers, is considered. The linear learning rule updates the output weights between hidden and output layers based on the Linear Minimization of Mean Square Error (LMMSE) sense in the space of kernel functions while the nonlinear learning rule updates the parameters of kernel functions based on the gradient of the actual output of network with respect to the parameters (especially, the shape) of kernel functions. This approach of parameter adaptation provides near optimal values of the parameters associated with kernel functions in the sense of minimizing mean square error. As a result, the suggested nonparametric estimation provides an efficient way of function approximation from the view point of the number of kernel functions as well as learning speed.

  • PDF

뇌졸중 환자의 햅틱 로봇 기반 상지 재활 시 근육 동시활성도 분석 (Muscle Coactivation Analysis during Upper-Limb Rehabilitation using Haptic Robotics in Stroke Survivors)

  • 오건영
    • 대한의용생체공학회:의공학회지
    • /
    • 제45권2호
    • /
    • pp.66-74
    • /
    • 2024
  • This study analyzed the occurrence of abnormal muscle coactivations based on the assistance of upper limb weight during reaching task in stroke patients. Nine chronic stroke survivors with hemiplegia performed reaching tasks using a programmable haptic robot. Electromyography (EMG) coactivation levels in the upper limb muscles were analyzed using a linear model describing the activation levels of two muscles when the patient's upper limb weight was assisted at 0%, 25%, and 50%. As the upper limb weight assistance of the haptic robot decreased, the magnitude of the EMG signal in both the deltoid and biceps muscles increased simultaneously on both the paretic and non-paretic sides. However, no difference was found between the paretic and non-paretic sides when comparing the slope of the linear model describing the activation relationship between the deltoid and biceps. The aforementioned results suggest that in some stroke survivors, the deltoids, triceps, and biceps on the paretic side may not be abnormally coupled when supporting the upper limbs against gravity. Furthermore, these results suggest that the combination of haptic robots and EMG analysis might be utilized for evaluating abnormal coactivations in stroke patients.

Evaluation of Structure Development of Xanthan and Carob Bean Gum Mixture Using Non-Isothermal Kinetic Model

  • Yoon, Won-Byong;Gunasekaran, Sundaram
    • Food Science and Biotechnology
    • /
    • 제16권6호
    • /
    • pp.954-957
    • /
    • 2007
  • Gelation mechanism of xanthan-carob mixture (X/C) was investigated based on thermorheological behavior. Three X/C ratios (1:3, 1:1, and 3:1) were studied. Small amplitude oscillatory shear tests were performed to measure linear viscoelastic behavior during gelation. Temperature sweep ($-1^{\circ}C/min$) experiments were conducted. Using a non-isothermal kinetic model, activation energy (Ea) during gelation was calculated. At 1% total concentration, the Ea for xanthan fraction (${\phi}_x$)=0.25, 0.5, and 0.75 were 178, 159, and 123 kJ/mol, respectively. However, a discontinuity was observed in the activation energy plots. Based on this, two gelation mechanisms were presumed-association of xanthan and carob molecules and aggregation of polymer strands. The association process is the primary mechanism to form 3-D networks in the initial stage of gelation and the aggregation of polymer strands played a major role in the later stage.

자기연상 다층퍼셉트론의 이상 탐지 성질 분석 (Analysis of Novelty Detection Properties of Autoassociative MLP)

  • 이형주;황병호;조성준
    • 대한산업공학회지
    • /
    • 제28권2호
    • /
    • pp.147-161
    • /
    • 2002
  • In novelty detection, one attempts to discriminate abnormal patterns from normal ones. Novelty detection is quite difficult since, unlike usual two class classification problems, only normal patterns are available for training. Auto-Associative Multi-Layer Perceptron (AAMLP) has been shown to provide a good performance based upon the property that novel patterns usually have larger auto-associative errors. In this paper, we give a mathematical analysis of 2-layer AAMLP's output characteristics and empirical results of 2-layer and 4-layer AAMLPs. Various activation functions such as linear, saturated linear and sigmoid are compared. The 2-layer AAMLPs cannot identify non-linear boundaries while the 4-layer ones can. When the data distribution is multi-modal, then an ensemble of AAMLPs, each of which is trained with pre-clustered data is required. This paper contributes to understanding of AAMLP networks and leads to practical recommendations regarding its use.