• Title/Summary/Keyword: RNN(Recurrent Neural Network)

Search Result 231, Processing Time 0.02 seconds

A Study on Development of Embedded System for Speech Recognition using Multi-layer Recurrent Neural Prediction Models & HMM (다층회귀신경예측 모델 및 HMM 를 이용한 임베디드 음성인식 시스템 개발에 관한 연구)

  • Kim, Jung hoon;Jang, Won il;Kim, Young tak;Lee, Sang bae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.3
    • /
    • pp.273-278
    • /
    • 2004
  • In this paper, the recurrent neural networks (RNN) is applied to compensate for HMM recognition algorithm, which is commonly used as main recognizer. Among these recurrent neural networks, the multi-layer recurrent neural prediction model (MRNPM), which allows operating in real-time, is used to implement learning and recognition, and HMM and MRNPM are used to design a hybrid-type main recognizer. After testing the designed speech recognition algorithm with Korean number pronunciations (13 words), which are hardly distinct, for its speech-independent recognition ratio, about 5% improvement was obtained comparing with existing HMM recognizers. Based on this result, only optimal (recognition) codes were extracted in the actual DSP (TMS320C6711) environment, and the embedded speech recognition system was implemented. Similarly, the implementation result of the embedded system showed more improved recognition system implementation than existing solid HMM recognition systems.

Development of Dolphin Click Signal Classification Algorithm Based on Recurrent Neural Network for Marine Environment Monitoring (해양환경 모니터링을 위한 순환 신경망 기반의 돌고래 클릭 신호 분류 알고리즘 개발)

  • Seoje Jeong;Wookeen Chung;Sungryul Shin;Donghyeon Kim;Jeasoo Kim;Gihoon Byun;Dawoon Lee
    • Geophysics and Geophysical Exploration
    • /
    • v.26 no.3
    • /
    • pp.126-137
    • /
    • 2023
  • In this study, a recurrent neural network (RNN) was employed as a methodological approach to classify dolphin click signals derived from ocean monitoring data. To improve the accuracy of click signal classification, the single time series data were transformed into fractional domains using fractional Fourier transform to expand its features. Transformed data were used as input for three RNN models: long short-term memory (LSTM), gated recurrent unit (GRU), and bidirectional LSTM (BiLSTM), which were compared to determine the optimal network for the classification of signals. Because the fractional Fourier transform displayed different characteristics depending on the chosen angle parameter, the optimal angle range for each RNN was first determined. To evaluate network performance, metrics such as accuracy, precision, recall, and F1-score were employed. Numerical experiments demonstrated that all three networks performed well, however, the BiLSTM network outperformed LSTM and GRU in terms of learning results. Furthermore, the BiLSTM network provided lower misclassification than the other networks and was deemed the most practically appliable to field data.

Synthesis of Expressive Talking Heads from Speech with Recurrent Neural Network (RNN을 이용한 Expressive Talking Head from Speech의 합성)

  • Sakurai, Ryuhei;Shimba, Taiki;Yamazoe, Hirotake;Lee, Joo-Ho
    • The Journal of Korea Robotics Society
    • /
    • v.13 no.1
    • /
    • pp.16-25
    • /
    • 2018
  • The talking head (TH) indicates an utterance face animation generated based on text and voice input. In this paper, we propose the generation method of TH with facial expression and intonation by speech input only. The problem of generating TH from speech can be regarded as a regression problem from the acoustic feature sequence to the facial code sequence which is a low dimensional vector representation that can efficiently encode and decode a face image. This regression was modeled by bidirectional RNN and trained by using SAVEE database of the front utterance face animation database as training data. The proposed method is able to generate TH with facial expression and intonation TH by using acoustic features such as MFCC, dynamic elements of MFCC, energy, and F0. According to the experiments, the configuration of the BLSTM layer of the first and second layers of bidirectional RNN was able to predict the face code best. For the evaluation, a questionnaire survey was conducted for 62 persons who watched TH animations, generated by the proposed method and the previous method. As a result, 77% of the respondents answered that the proposed method generated TH, which matches well with the speech.

The Study of Service Event Relation Analysis Using Recurrent Neural Network (Recurrent Neural Network를 활용한 서비스 이벤트 관계 분석에 관한 연구)

  • Jeon, Woosung;Park, Youngsuk;Choi, Jeongil
    • Journal of Information Technology Services
    • /
    • v.17 no.4
    • /
    • pp.75-83
    • /
    • 2018
  • Enterprises need to monitor systems for reliable IT service operations to quickly detect and respond to events affecting the service, thereby preventing failures. Events in non-critical systems can be seen as a precursor to critical system incidents. Therefore, event relationship analysis in the operation of IT services can proactively recognize and prevent faults by identifying non-critical events and their relationships with incidents. This study used the Recurrent Neural Network and Long Short Term Memory techniques to create a model to analyze event relationships in a system and to verify which models are suitable for analyzing event relationships. Verification has shown that both models are capable of analyzing event relationships and that RNN models are more suitable than LSTM models. Based on the pattern of events occurring, this model is expected to support the prediction of the next occurrence of events and help identify the root cause of incidents to help prevent failures and improve the quality of IT services.

System Identification Using Gamma Multilayer Neural Network (감마 다층 신경망을 이용한 시스템 식별)

  • Go, Il-Whan;Won, Sang-Chul;Choi, Han-Go
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.9 no.3
    • /
    • pp.238-244
    • /
    • 2008
  • Dynamic neural networks have been applied to diverse fields requiring temporal signal processing. This paper presents gamma neural network(GAM) to improve the dynamics of multilayer network. The GAM network uses the gamma memory kernel in the hidden layer of feedforword multilayer network. The GAM network is evaluated in linear and nonlinear system identification, and compared with feedforword(FNN) and recurrent neural networks(RNN) for the relative comparison of its performance. Experimental results show that the GAM network performs better with respect to the convergence and accuracy, indicating that it can be a more effective network than conventional multilayer networks in system identification.

  • PDF

A SE Approach for Real-Time NPP Response Prediction under CEA Withdrawal Accident Conditions

  • Felix Isuwa, Wapachi;Aya, Diab
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.18 no.2
    • /
    • pp.75-93
    • /
    • 2022
  • Machine learning (ML) data-driven meta-model is proposed as a surrogate model to reduce the excessive computational cost of the physics-based model and facilitate the real-time prediction of a nuclear power plant's transient response. To forecast the transient response three machine learning (ML) meta-models based on recurrent neural networks (RNNs); specifically, Long Short Term Memory (LSTM), Gated Recurrent Unit (GRU), and a sequence combination of Convolutional Neural Network (CNN) and LSTM are developed. The chosen accident scenario is a control element assembly withdrawal at power concurrent with the Loss Of Offsite Power (LOOP). The transient response was obtained using the best estimate thermal hydraulics code, MARS-KS, and cross-validated against the Design and control document (DCD). DAKOTA software is loosely coupled with MARS-KS code via a python interface to perform the Best Estimate Plus Uncertainty Quantification (BEPU) analysis and generate a time series database of the system response to train, test and validate the ML meta-models. Key uncertain parameters identified as required by the CASU methodology were propagated using the non-parametric Monte-Carlo (MC) random propagation and Latin Hypercube Sampling technique until a statistically significant database (181 samples) as required by Wilk's fifth order is achieved with 95% probability and 95% confidence level. The three ML RNN models were built and optimized with the help of the Talos tool and demonstrated excellent performance in forecasting the most probable NPP transient response. This research was guided by the Systems Engineering (SE) approach for the systematic and efficient planning and execution of the research.

Chord-based stepwise Korean Trot music generation technique using RNN-GAN (RNN-GAN을 이용한 코드 기반의 단계적 트로트 음악 생성 기법)

  • Hwang, Seo-Rim;Park, Young-Cheol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.6
    • /
    • pp.622-628
    • /
    • 2020
  • This paper proposes a music generation technique that automatically generates trot music using a Generative Adversarial Network (GAN) model composed of a Recurrent Neural Network (RNN). The proposed method uses a method of creating a chord as a skeleton of the music, creating a melody and bass in stages based on the chord progression made, and attaching it to the corresponding chord to complete the structured piece. Also, a new chorus chord progression is created from the verse chord progression by applying the characteristics of a trot song that repeats the structure divided into an individual section, such as intro, verse, and chorus. And it extends the length of the created trot. The quality of the generated music was specified using subjective evaluation and objective evaluation methods. It was confirmed that the generated music has similar characteristics to the existing trot.

Real-time Approximation of a Hydraulic Servo System Using a Recurrent Neural Network with 2-D Learning Algorithm (2차원 학습 회귀적 신경망을 이용한 전기.유압 서보시스템의 실시간 추종)

  • 정봉호;곽동훈;이춘태;이진걸
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.20 no.8
    • /
    • pp.94-100
    • /
    • 2003
  • This paper presents the experiments on the approximation of a hydraulic servo system using a real time recurrent neural networks (RTRN) with time varying weights. In order to verify the effectiveness of the RTRN algorithm in hydraulic servo system, we design the experimental hydraulic system and implemented the real time approximation of system output. Experimental results show that approximated output of the RTRN well follows the position trajectory of the electro-hydraulic servo system. And also it is verified that the 2-D RNN can be implemented in sampling time even though high sampling frequency experimentally.

Sound Event Detection based on Deep Neural Networks (딥 뉴럴네트워크 기반의 소리 이벤트 검출)

  • Chung, Suk-Hwan;Chung, Yong-Joo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.14 no.2
    • /
    • pp.389-396
    • /
    • 2019
  • In this paper, various architectures of deep neural networks were applied for sound event detection and their performances were compared using a common audio database. The FNN, CNN, RNN and CRNN were implemented using hyper-parameters optimized for the database as well as the architecture of each neural network. Among the implemented deep neural networks, CRNN performed best at all testing conditions and CNN followed CRNN in performance. Although RNN has a merit in tracking the time-correlations in audio signals, it showed poor performance compared with CNN and CRNN.

Nonlinear Prediction using Gamma Multilayered Neural Network (Gamma 다층 신경망을 이용한 비선형 적응예측)

  • Kim Jong-In;Go Il-Hwan;Choi Han-Go
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.7 no.2
    • /
    • pp.53-59
    • /
    • 2006
  • Dynamic neural networks have been applied to diverse fields requiring temporal signal processing such as system identification and signal prediction. This paper proposes the gamma neural network(GAM), which uses gamma memory kernel in the hidden layer of feedforward multilayered network, to improve dynamics of networks and then describes nonlinear adaptive prediction using the proposed network as an adaptive filter. The proposed network is evaluated in nonlinear signal prediction and compared with feedforword(FNN) and recurrent neural networks(RNN) for the relative comparison of prediction performance. Simulation results show that the GAM network performs better with respect to the convergence speed and prediction accuracy, indicating that it can be a more effective prediction model than conventional multilayered networks in nonlinear prediction for nonstationary signals.

  • PDF