Adaptive Noise Canceller for Speech Enhancement Using 2-D Binary Mask |
Lee, Gihyoun
(Dept. of Medical & Biological Eng., Graduate School, Kyungpook National University)
Lee, Jyung Hyun (Dept. of Biomedical Eng., School of Medicine, Kyungpook National University) Cho, Jin-Ho (School of Electronics Eng., College of IT Engineering, Kyungpook National University) Kim, Myoung Nam (Dept. of Biomedical Eng., School of Medicine, Kyungpook National University) |
1 | P.C. Loizou, Speech Enhancement: Theory and Practice, 2nd ed., CRC Press, Boca Raton, Florida, 2013. |
2 | M. Grimm and K. Kroschel, Robust Speech Recognition and Understanding, I-Tech Education and Publishing, Vienna Austria, 2007. |
3 | J. Proakis and D. Manolakis, Digital Signal Processing, 3rd ed., Prentice Hall, Upper Saddle Rive, NJ, 1996. |
4 | S.F. Boll, “Suppression of Acoustic Noise in Speech Using Spectral Subtraction,” IEEE Transaction on Acoustics Speech Signal Processing, Vol. 27, No. 2, pp. 113-120, 1979. DOI |
5 | N. Wiener, Extrapolation, Interpolation, and Smoothing of Stationary Time Series Vol. 2, MIT Press, Cambridge, 1949. |
6 | J. Beh and H. Ko. "A Novel Spectral Subtraction Scheme for Robust Speech Recognition: Spectral Subtraction Using Spectral Harmonics of Speech," Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Vol. 1. pp. I-64, 2003. |
7 | Y. Hu and P. Loizou, “A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise,” IEEE Transaction on Speech and Audio Processing, Vol. 11, No. 4, pp. 334-341, 2003. DOI |
8 | J.F. Zhu and Y.D. Huang, “Improved Threshold Function of Wavelet Domain Signal DeNoising,” Proceeding of Internetional Conference on Wavelet Analysis and Pattern Recognition, pp. 14-17, 2013. |
9 | ITU, Perceptual Evaluation of Speech Quality (PESQ), and Objective Method for End-to-End Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs, ITU-T Recommendation P.862, 2000. |
10 | Y. Hu and P. Loizou, “Speech Enhancement Based on Wavelet Thresholding the Multitaper Spectrum,” IEEE Transaction on Speech and Audio Processing, Vol. 12, No. 1, pp. 59-67, 2004. DOI |
11 | Y. Li and D. Wang, “On the Optimality of Ideal Binary Time–Frequency Masks,” Speech Communication, Vol. 51, No. 3, pp. 230-239, 2009. DOI |
12 | G.H. Lee, Y.J. Lee, J.H. Cho, M.N. Kim, “Voice Activity Detection Algorithm Using Fuzzy Membership Shifted C-means Clustering in Low SNR Environment,“ Journal of the Korea Multimedia Society, Vol. 17, No. 3, pp. 312-323, 2014. DOI |
13 | J.J. Godfrey, C.E. Holliman, and J. McDaniel, "SWITCHBOARD: Telephone Speech Corpus for Research and Development," Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, pp. 517-520, 1992. |
14 | A. Varga and J.M.S. Herman, "Assessment for Automatic Speech Recognition: II. NOISEX-92: A Database and an Experiment to Study the Effect of Additive Noise on Speech Recognition Systems," Speech Communication, Vol. 12, No. 3, pp. 247-251, 1993. DOI |