Browse > Article
http://dx.doi.org/10.9717/kmms.2013.16.8.905

Recognition Performance Improvement for Noisy-speech by Parallel Model Compensation Adaptation Using Frequency-variant added with ML  

Choi, Sook-Nam (영남대학교 정보통신공학과)
Chung, Hyun-Yeol (영남대학교 정보통신공학과)
Publication Information
Abstract
The Parallel Model Compensation Using Frequency-variant: FV-PMC for noise-robust speech recognition is a method to classify the noises, which are expected to be intermixed with input speech when recognized, into several groups of noises by setting average frequency variant as a threshold value; and to recognize the noises depending on the classified groups. This demonstrates the excellent performance considering noisy speech categorized as good using the standard threshold value. However, it also holds a problem to decrease the average speech recognition rate with regard to unclassified noisy speech, for it conducts the process of speech recognition, combined with noiseless model as in the existing PMC. To solve this problem, this paper suggests a enhanced method of recognition to prevent the unclassified through improving the extent of rating scales with use of maximum likelihood so that the noise groups, including input noisy speech, can be classified into more specific groups, which leads to improvement of the recognition rate. The findings from recognition experiments using Aurora 2.0 database showed the improved results compared with those from the method of the previous FV-PMC.
Keywords
PMC; MLE; Frequency-variant; Speech Recognition;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 최숙남, 정현열, "주파수 변이를 이용한 환경 인식 기반의 GMM 적응에 관한 연구," 한국 신호처리.시스템 학회 추계학술대회 논문집, pp. 181-185, 2012.
2 Richard O. Duda, Peter E. Hart, and David G. Stock, Pattern Classification 2nd Edition, Wiley-Interscience, New York, 2001.
3 Philipos C .Loizou, Speech Enhancement -Theory and Practice, CRC Press, Florida, 2007.
4 ITU-T PSQM, Objective Quality Measurement of Telephone-band(300-3400Hz) Speech Codecs, 2001.
5 J.G. Beerends, A.P. Heckstra, A.W. Rix, and M.P. Hollier, "Perceptual Evaluation of Speech Quality (PESQ) the New Itu Standard for End-to-end Speech Quality Assessment Part II - Psychoacoustic Model," Journal of the Audio Engineering Society, Vol. 50, No. 10, pp. 765-778, 2002.
6 손영호, 최재훈, 장준혁, "환경인식 기반의 향상된 Minimum Statistics 잡음전력 추정기법," 한국음향학회지, 제30권, 제3호, pp. 123-128, 2011.   과학기술학회마을   DOI   ScienceOn
7 H.-G Hirsch and D. Pearce, "The AURORA Experimental Framework for the Performance Evaluation of Speech Recognition Systems Under Noisy Conditions," ISCA ITRW ASR 2000, 2000.
8 김상만, 서광석, 김종교, "이산 웨이브렛과 비균일 필터뱅크를 적용한 음성특징 추출," 정보통신산업진흥원, 2000.
9 Gong Y., "Speech Recognition in Noisy Environments: A Survey," Speech Communication, Vol. 16, Issue 3, pp. 261-292, 1995.   DOI   ScienceOn
10 최숙남, 신광호, 정현열, "켑스트럼 정규화와 켑스트럼 거리기반 묵음특징정규화 방법을 이용한 잡음음성 인식" 멀티미디어학회논문지, 제14권, 제10호, pp.1221-1228, 2011
11 J.C. Junqua and J.P. Haton, "Robustness in Automatic Speech Recognition: Fundamentals and Applications," Kluwer Academic Publishers, Netherlands, 1996.
12 J.S. Lim, "Speech Enhancement," Prentice Hall, New Jersey, 1983.
13 D.H. Klatt, "A Digital Filterbank for Spectral Matching," Proc. ICASSP , pp. 573-576, 1979.
14 M.J.F. Gales and S. Young, "An improved Approach to the Hidden Markov Model Decomposition of Speech and Noise," Proc. ICASSP-92, Vol. 1, pp. 233-236, 1992.
15 A.P. Varga and R.K. Moore, "Hidden Markov Model Decomposition of Speech and Noise," Proc. ICASSP, pp. 845-848, 1990.
16 M.J.F. Gales and S. Young, Model Based Techniques for Noise Robust Speech Recognition, Dissertation at the University of Cambridge, 1995.