• Title/Summary/Keyword: Fully Recurrent Neural Networks

Search Result 12, Processing Time 0.021 seconds

Nonlinear Prediction of Nonstationary Signals using Neural Networks (신경망을 이용한 비정적 신호의 비선형 예측)

  • Choi, Han-Go;Lee, Ho-Sub;Kim, Sang-Hee
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.10
    • /
    • pp.166-174
    • /
    • 1998
  • Neural networks, having highly nonlinear dynamics by virtue of the distributed nonlinearities and the learing ability, have the potential for the adaptive prediction of nonstationary signals. This paper describes the nonlinear prediction of these signals in two ways; using a nonlinear module and the cascade combination of nonlinear and linear modules. Fully-connected recurrent neural networks (RNNs) and a conventional tapped-delay-line (TDL) filter are used as the nonlinear and linear modules respectively. The dynamic behavior of the proposed predictors is demonstrated for chaotic time series adn speech signals. For the relative comparison of prediction performance, the proposed predictors are compared with a conventional ARMA linear prediction model. Experimental results show that the neural networks based adaptive predictor ourperforms the traditional linear scheme significantly. We also find that the cascade combination predictor is well suitable for the prediction of the time series which contain large variations of signal amplitude.

  • PDF

Performance comparison of various deep neural network architectures using Merlin toolkit for a Korean TTS system (Merlin 툴킷을 이용한 한국어 TTS 시스템의 심층 신경망 구조 성능 비교)

  • Hong, Junyoung;Kwon, Chulhong
    • Phonetics and Speech Sciences
    • /
    • v.11 no.2
    • /
    • pp.57-64
    • /
    • 2019
  • In this paper, we construct a Korean text-to-speech system using the Merlin toolkit which is an open source system for speech synthesis. In the text-to-speech system, the HMM-based statistical parametric speech synthesis method is widely used, but it is known that the quality of synthesized speech is degraded due to limitations of the acoustic modeling scheme that includes context factors. In this paper, we propose an acoustic modeling architecture that uses deep neural network technique, which shows excellent performance in various fields. Fully connected deep feedforward neural network (DNN), recurrent neural network (RNN), gated recurrent unit (GRU), long short-term memory (LSTM), bidirectional LSTM (BLSTM) are included in the architecture. Experimental results have shown that the performance is improved by including sequence modeling in the architecture, and the architecture with LSTM or BLSTM shows the best performance. It has been also found that inclusion of delta and delta-delta components in the acoustic feature parameters is advantageous for performance improvement.