기계학습을 이용한 음성 신호처리 연구동향

  • Published : 2016.05.25

Abstract

Keywords

References

  1. 김기응, 알파고 충격에서 무엇을 배워야 할 것인가, 중앙일보시론, 2016년 3월 15일, http://news.joins.com/article/19723142
  2. Mastering the game of Go with deep neural networks and tree search, Nature 529, 2016년
  3. G. Hinton, Y. Bengio, Y. LeCun, NIPS tutorial: Deep Learning, 2015년
  4. A. Mohamed, D. Yu, L. Deng, Investiation of full-sequence training of deep belief networks for speech recognition, Interspeech, 2010년
  5. G. Dahl, D. Yu, L. Deng, A. Acero, Large vocabulary continuous speech recognition with context-dependent DBNHMMS, ICASSP, 2011년
  6. H. Sak, A. Senior, K Rao, A. Graves, F. Beaufays, J Schalkwyk, Learning acoustic frame labeling for speech recognition with recurrent neural networks, ICASSP 2015년
  7. D. Amodei et al, Deep Speech 2: End-to-end speech recognition in English and Mandarin, arXiv:1512.02595, 2015년
  8. G. Saon, H.-K Kuo, S. Rennie, M. Picheny, The IBM 2015 English conversational telephone speech recognition system, ICASSP 2015
  9. T. Sainath, R. J. Weiss, K. W. Wilson, A. Narayanan, M. Bacchiani, Factored spatial and spectral multichannel raw waveform CLDNNS, ICASSP 2016년
  10. J. F. Cardoso, Blind signal separation: statistical principles, Proceedings of the IEEE, 1998년
  11. A. Hyvarinen, E. Oja, Independent component analysis: algorithms and applications, Neural Networks, 2000년
  12. D. Lee, H. Seung, Learning the parts of objects by nonnegative matrix factorization, Nature, 1999년
  13. P. Smaragdis, Probabilistic decompositions of spectra for sound separation, Blind speech separation, 2007년
  14. X. Lu, Y. Tsao, S. Matsuda, C. Hori, Speech enhancement based on deep denoising autoencoder, Interspeech, 2013년
  15. G. Hu, D. Wang, Monaural speech segregation based on pitch tracking and amplitude modulation, IEEE Transactions on Neural Networks, 2004년
  16. Y. Wang, A. Narayanan, D. Wang, On training targets for supervised speech separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014년
  17. D. Williamson, Y. Wang, D. Wang, Complex ratio masking for monaural speech separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2016년