• 제목/요약/키워드: hidden layer

검색결과 511건 처리시간 0.027초

Artificial Neural Networks for Flood Forecasting Using Partial Mutual Information-Based Input Selection

  • Jae Gyeong Lee;Li Li;Kyung Soo Jun
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2023년도 학술발표회
    • /
    • pp.363-363
    • /
    • 2023
  • Artificial Neural Networks (ANN) is a powerful tool for addressing various practical problems and it has been extensively applied in areas of water resources. In this study, Artificial Neural Networks (ANNs) were developed for flood forecasting at specific locations on the Han River. The Partial Mutual Information (PMI) technique was used to select input variables for ANNs that are neither over-specified nor under-specified while adequately describing the underlying input-output relationships. Historical observations including discharges at the Paldang Dam, flows from tributaries, water levels at the Paldang Bridge, Banpo Bridge, Hangang Bridge, and Junryu gauge station, and time derivatives of the observed water levels were considered as input candidates. Lagged variables from current time t to the previous five hours were assumed to be sufficient in this study. A three-layer neural network with one hidden layer was used and the neural network was optimized by selecting the optimal number of hidden neurons given the selected inputs. Given an ANN architecture, the weights and biases of the network were determined in the model training. The use of PMI-based input variable selection and optimized ANNs for different sites were proven to successfully predict water levels during flood periods.

  • PDF

MAGRU: Multi-layer Attention with GRU for Logistics Warehousing Demand Prediction

  • Ran Tian;Bo Wang;Chu Wang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제18권3호
    • /
    • pp.528-550
    • /
    • 2024
  • Warehousing demand prediction is an essential part of the supply chain, providing a fundamental basis for product manufacturing, replenishment, warehouse planning, etc. Existing forecasting methods cannot produce accurate forecasts since warehouse demand is affected by external factors such as holidays and seasons. Some aspects, such as consumer psychology and producer reputation, are challenging to quantify. The data can fluctuate widely or do not show obvious trend cycles. We introduce a new model for warehouse demand prediction called MAGRU, which stands for Multi-layer Attention with GRU. In the model, firstly, we perform the embedding operation on the input sequence to quantify the external influences; after that, we implement an encoder using GRU and the attention mechanism. The hidden state of GRU captures essential time series. In the decoder, we use attention again to select the key hidden states among all-time slices as the data to be fed into the GRU network. Experimental results show that this model has higher accuracy than RNN, LSTM, GRU, Prophet, XGboost, and DARNN. Using mean absolute error (MAE) and symmetric mean absolute percentage error(SMAPE) to evaluate the experimental results, MAGRU's MAE, RMSE, and SMAPE decreased by 7.65%, 10.03%, and 8.87% over GRU-LSTM, the current best model for solving this type of problem.

티타늄 합금의 변형률속도 및 온도를 고려한 인공신경망 기반 경화모델 성능평가 (Evaluation of Performance of Artificial Neural Network based Hardening Model for Titanium Alloy Considering Strain Rate and Temperature)

  • 김민기;임성식;김용배
    • 소성∙가공
    • /
    • 제33권2호
    • /
    • pp.96-102
    • /
    • 2024
  • This study addresses evaluation of performance of hardening model for a titanium alloy (Ti6Al4V) based on the artificial neural network (ANN) regarding the strain rate and the temperature. Uniaxial compression tests were carried out at different strain rates from 0.001 /s to 10 /s and temperatures from 575 ℃ To 975 ℃. Using the experimental data, ANN models were trained and tested with different hyperparameters, such as size of hidden layer and optimizer. The input features were determined with the equivalent plastic strain, strain rate, and temperature while the output value was set to the equivalent stress. When the number of data is sufficient with a smooth tendency, both the Bayesian regulation (BR) and the Levenberg-Marquardt (LM) show good performance to predict the flow behavior. However, only BR algorithm shows a predictability when the number of data is insufficient. Furthermore, a proper size of the hidden layer must be confirmed to describe the behavior with the limited number of the data.

은닉노드의 특징 값을 기반으로 한 최적신경망 구조의 BPN성능분석 (Performance Analysis of Optimal Neural Network structural BPN based on character value of Hidden node)

  • 강경아;이기준;정채영
    • 한국컴퓨터정보학회논문지
    • /
    • 제5권2호
    • /
    • pp.30-36
    • /
    • 2000
  • 은닉노드는 주어진 문제에서 입력패턴(input pattern)들의 특징을 구분해주는 중요한 역할을 한다. 이 때문에 최적의 은닉노드 수로 구성된 신경망 구조가 성능에 가장 큰 영향을 주는 요인으로 중요성이 대두되고 있다. 그러나 역전파(back-propagation) 학습 알고리즘을 기반으로 하여 은닉노드 수를 결정하는데는 문제점이 있다. 은닉노드 수가 너무 적게 지정되면 주어진 입력패턴을 충분히 구분할 수 없게 되어 완전한 학습이 이루어지지 않는 반면, 너무 많이 지정하면 불필요한 연산의 실행과 기억장소의 낭비로 과적응(overfitting)이 일어나 일반성이 떨어져 인식률이 낮아지기 때문이다. 따라서 본 논문에서는 백 프로퍼게이션 알고리즘을 이용하여 학습을 수행하는 다층 신경망의 학습오차 감소와 수렴율 개선을 위하여 신경망을 구성하는 매개변수를 가지고 은닉노드의 특징 값을 구하고, 그 값은 은닉노드를 제거(pruning)하기 위한 평가치로 사용된다. 구해진 특징 값 중 최대 값과 최소 값을 갖는 노드를 감소(pruning)대상에서 제외하고 나머지 은닉노드 특징 값의 평균과 각 은닉노드의 특징 값을 비교하여 평균보다 작은 특징 값을 갖는 은닉노드를 pruning시키므로서 다층 신경망의 최적 구조를 결정하여 신경망의 학습 속도를 개선하고자 한다.

  • PDF

Acoustic Diagnosis of a Pump by Using Neural Network

  • Lee, Sin-Young
    • Journal of Mechanical Science and Technology
    • /
    • 제20권12호
    • /
    • pp.2079-2086
    • /
    • 2006
  • A fundamental study for developing a fault diagnosis system of a pump is performed by using neural network. Acoustic signals were obtained and converted to frequency domain for normal products and artificially deformed products. The neural network model used in this study was 3-layer type composed of input, hidden, and output layer. The normalized amplitudes at the multiples of real driving frequency were chosen as units of input layer. And the codes of pump malfunctions were selected as units of output layer. Various sets of teach signals made from original data by eliminating some random cases were used in the training. The average errors were approximately proportional to the number of untaught data. Neural network trained by acoustic signals can detect malfunction or diagnose fault of a given machine from the results.

다층구조 퍼셉트론을 이용한 분류 영상압축 및 코딩 (Classified Image Compression and Coding using Multi-Layer Percetpron)

  • 조광보;박철훈;이수영
    • 한국통신학회논문지
    • /
    • 제19권11호
    • /
    • pp.2264-2275
    • /
    • 1994
  • 본 논문에서는 블록 분류와 코딩과 함께 신경회로망을 이용한 영상압축을 보였다. 오차 역전파 알고리즘으로 학습되는 다층구조 신경회로망은 정규화된 영상데이타를 감소된 공간 중복성을 가지는 은닉층의 값으로 변환하는데 사용된다. 기본적으로 영상압축은 입력층과 출력층의 뉴런보다 적은 수의 은닉층 뉴런에 의해 얻어진다. 여기에 시각체계의 민감도에 따른 영상블럭 복잡성에 따라 적응적으로 압축되므로 블록을 분류한다. 또한 은닉뉴런의 양자화된 값은 효과적인 전송을 위해 entropy coding을 이용한 경우 화질의 큰 저하없이 약 25:1의 압축률을 얻었다.

  • PDF

Random generator-controlled backpropagation neural network to predicting plasma process data

  • Kim, Sungmo;Kim, Sebum;Kim, Byungwhan
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2003년도 ISIS 2003
    • /
    • pp.599-602
    • /
    • 2003
  • A new technique is presented to construct predictive models of plasma etch processes. This was accomplished by combining a backpropagation neural network (BPNN) and a random generator (RC). The RG played a critical role to control neuron gradients in the hidden layer, The predictive model constructed in this way is referred to as a randomized BPNN (RG-BPNN). The proposed scheme was evaluated with a set of experimental plasma etch process data. The etch process was characterized by a 2$^3$ full factorial experiment. The etch responses modeled are 4, including aluminum (Al) etch rate, profile angle, Al selectivity, and do bias. Additional test data were prepared to evaluate model appropriateness. The performance of RC-BPNN was evaluated as a function of the number of hidden neurons and the range of gradient. for given range and hidden neurons, 100 sets of random neuron gradients were generated and among them one best set was selected for evaluation. Compared to the conventional BPNN, the proposed RC-BPNN demonstrated about 50% improvements in all comparisons. This illustrates that the RG-BPNN of multi-valued gradients is an effective way to considerably improve the predictive ability of current BPNN of single-valued gradient.

  • PDF

Unsupervised Incremental Learning of Associative Cubes with Orthogonal Kernels

  • Kang, Hoon;Ha, Joonsoo;Shin, Jangbeom;Lee, Hong Gi;Wang, Yang
    • 한국지능시스템학회논문지
    • /
    • 제25권1호
    • /
    • pp.97-104
    • /
    • 2015
  • An 'associative cube', a class of auto-associative memories, is revisited here, in which training data and hidden orthogonal basis functions such as wavelet packets or Fourier kernels, are combined in the weight cube. This weight cube has hidden units in its depth, represented by a three dimensional cubic structure. We develop an unsupervised incremental learning mechanism based upon the adaptive least squares method. Training data are mapped into orthogonal basis vectors in a least-squares sense by updating the weights which minimize an energy function. Therefore, a prescribed orthogonal kernel is incrementally assigned to an incoming data. Next, we show how a decoding procedure finds the closest one with a competitive network in the hidden layer. As noisy test data are applied to an associative cube, the nearest one among the original training data are restored in an optimal sense. The simulation results confirm robustness of associative cubes even if test data are heavily distorted by various types of noise.

An Energy Efficient Multichannel MAC Protocol for QoS Provisioning in MANETs

  • Kamruzzaman, S.M.;Hamid, Md. Abdul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제5권4호
    • /
    • pp.684-702
    • /
    • 2011
  • This paper proposes a TDMA-based multichannel medium access control (MAC) protocol for QoS provisioning in mobile ad hoc networks (MANETs) that enables nodes to transmit their packets in distributed channels. The IEEE 802.11 standard supports multichannel operation at the physical (PHY) layer but its MAC protocol is designed only for a single channel. The single channel MAC protocol does not work well in multichannel environment because of the multichannel hidden terminal problem. Our proposed protocol enables nodes to utilize multiple channels by switching channels dynamically, thus increasing network throughput. Although each node of this protocol is equipped with only a single transceiver but it solves the multichannel hidden terminal problem using temporal synchronization. The proposed energy efficient multichannel MAC (EM-MAC) protocol takes the advantage of both multiple channels and TDMA, and achieves aggressive power savings by allowing nodes that are not involved in communications to go into power saving "sleep mode". We consider the problem of providing QoS guarantee to nodes as well as to maintain the most efficient use of scarce bandwidth resources. Our scheme improves network throughput and lifetime significantly, especially when the network is highly congested. The simulation results show that our proposed scheme successfully exploits multiple channels and significantly improves network performance by providing QoS guarantee in MANETs.

지식기반신경망에서 은닉노드삽입을 이용한 영역이론정련화 (Theory Refinements in Knowledge-based Artificial Neural Networks by Adding Hidden Nodes)

  • 심동희
    • 한국정보처리학회논문지
    • /
    • 제3권7호
    • /
    • pp.1773-1780
    • /
    • 1996
  • 인공지능의 기호적 방법과 수치적 방법을 결합한 지식기반신경망은 다른 기계 학 습모델보다 우수한 성능을 나타내고 있다. 그러나 지식기반신경망은 신경망으로 형성 된 후 동적으로 그 구조를 변경할 수 없어서 영역이론정련화 기능을 갖추지 못하였다. 지식기반신경망의 이러한 단점을 보완하기 위하여 TopGen 알고리즘이 제안되었으나 삽입된 은닉노드를 모두 입력 노드에 연결한 점, 빔탐색을 이용한 등의 문제를 안고 있다. 본 논문에서는 TopGen의 문제점을 해소하기 위하여 은닉 노드를 다음 하위계층 의 노드에 링크 시켰으며, 역추적을 허용한 언덕 오르기를 이용하는 알고리즘을 설계 하였다.

  • PDF