참고문헌
- S. Furui, "Speaker-Independent Isolated Word Recognition Using Dynamic Features of Speech Spectrum," IEEE Trans. Acoust., Speech Signal Process., vol. 34, no. 1, Feb. 1986, pp. 52-59. https://doi.org/10.1109/TASSP.1986.1164788
- S. Davis and P. Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences," IEEE Trans. Acoust. Speech Signal Process., vol. 28, no. 4, Aug. 1980, pp. 357-366. https://doi.org/10.1109/TASSP.1980.1163420
- H. Hermansky, "Perceptual Linear Prediction (PLP) Analysis of Speech," J. Acoust. Soc. America, vol. 87, no. 4, Apr. 1990, pp. 1738-1752. https://doi.org/10.1121/1.399423
- W.H. Abdulla, "Auditory Based Feature Vectors for Speech Recognition Systems," Advances in Communications And Software Technologies, WSEAS ed., Athens, Greece: WSEAS Press, 2002, pp. 231-236.
- D.-S. Kim, S.-Y. Lee, and R.M. Kil, "Auditory Processing of Speech Signals for Robust Speech Recognition in Real-World Noisy Environments," IEEE Trans. Speech Audio Process., vol. 7, no. 1, Jan. 1999, pp. 55-69. https://doi.org/10.1109/89.736331
- S. Young et al., The HTK Book (for HTK version 3.4), Cambridge, England: Cambridge University Engineering Department, 2006.
- B. Milner, "A Comparison of Front-End Configurations for Robust Speech Recognition," Proc. ICASSP, Orlando, FL, USA, vol. 1, May 13-17, 2002, pp. 797-800.
- S.J. Lee et al., "Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition," ETRI J., vol. 32, no. 5, Oct. 2010, pp. 801-809. https://doi.org/10.4218/etrij.10.1510.0024
피인용 문헌
- Multilingual speech-to-speech translation system for mobile consumer devices vol.60, pp.3, 2014, https://doi.org/10.1109/tce.2014.6937337
- Weighted Finite State Transducer-Based Endpoint Detection Using Probabilistic Decision Logic vol.36, pp.5, 2014, https://doi.org/10.4218/etrij.14.2214.0030
- Speech Enhancement Using Phase-Dependent A Priori SNR Estimator in Log-Mel Spectral Domain vol.36, pp.5, 2014, https://doi.org/10.4218/etrij.14.2214.0039
- Online Blind Channel Normalization Using BPF-Based Modulation Frequency Filtering vol.38, pp.6, 2014, https://doi.org/10.4218/etrij.16.0115.0994
- 원어민 및 외국인 화자의 음성인식을 위한 심층 신경망 기반 음향모델링 vol.9, pp.2, 2017, https://doi.org/10.13064/ksss.2017.9.2.095
- Multimodal Unsupervised Speech Translation for Recognizing and Evaluating Second Language Speech vol.11, pp.6, 2014, https://doi.org/10.3390/app11062642