1 |
J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, issue 1, pp. 1-3, Jan. 1999. DOI: https://www.doi.org/10.1109/97.736233
DOI
|
2 |
M. Vondrasek and P. Pollak, "Methods for Speech SNR estimation: Evaluation Tool and Analysis of VAD Dependency," Radioengineering 14(1), April 2005 DOI: https://doaj.org/article/a53fe518a9634318b417fb15a8c37fa8
|
3 |
C.H. Taal, R.C. Hendrilks, R. Heusdens, and J. Jensen, "An algorithm for intelligibility prediction of time frequency weighted noisy speech," IEEE Transactions on Audio, Speech, and Language Processing, vol.19, no.7, pp.2125-2136, 2011. DOI: https://www.doi.org/10.1109/TASL.2011.2114881
DOI
|
4 |
D. K. Yun, H. N. Lee, and S. H. Choi, "A Deep Learning-Based Approach to Non-Intrusive Speech Intelligibility Estimation," IEICE Trans. Information and Systems, pp. 1207-1208, Apr. 2018. DOI: https://www.doi.org/10.1587/transinf.2017EDL8225
DOI
|
5 |
Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 32, no. 6, Dec. 1984 DOI: https://www.doi.org/10.1109/TASSP.1984.1164453
|
6 |
R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics", IEEE Transactions on Speech and Audio Processing, vol. 9, no. 5, July 2001 DOI: https://www.doi.org/10.1109/89.928915
|
7 |
S. Molau, M. Pitz, R. Schluter, and H. Ney, "Computing mel-frequency cepstral coefficients on the power spectrum", IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 73-76, May 2001 DOI: https://www.doi.org/10.1109/ICASSP.2001.940770
|
8 |
V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines", Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010. DOI: https://dl.acm.org/citation.cfm?id=3104425
|
9 |
D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization", arXiv preprint arXiv: 1412.6980, 2014. DOI: https://arxiv.org/abs/1412.6980
|
10 |
Multi-lingual speech database for telephonometry (1994). [Online]. Available: http://www.ntt-at.com/product/speech/. NTT Adv. Technol. Corp. Accessed 18 April 2016.
|