환경 변이에 강인한 화자 인식 기술

;;

Review of KIISC (정보보호학회지)

Volume 12 Issue 2
/
Pages.41-49
/
2002
/
1598-3978(pISSN)

Korea Institute of Information Security and Cryptology (한국정보보호학회)

환경 변이에 강인한 화자 인식 기술

김유진 (인하대학교 전자전기공학부 DSP연구실) ;
정재호 (인하대학교 전자전기공학부 DSP연구실)

Published : 2002.04.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

음성 인식 기술과 뿌리를 공유하는 화자 인식 기술은 지난 수십 년간의 연구결과로 괄목할 만한 진보가 이루어졌으며 최근에는 일반화될 수 있으리라는 기대를 가지도록 하기에 충분했다. 하지만 이러한 기술이 실제 환경에 적용되었을 때, 발성 환경을 제어할 수 없으며 그 결과 훈련 환경과는 다른 환경에서 발성된 음성을 인식 해야하는 이른바 '불일치 조건(mismatch condition)' 현상이 발생하게된다. 초기에는 이 현상을 극복하기 위해 잡음 자체를 모델링하고 제거함으로써 훈련과 인식 환경의 차이를 일정하게 정규화(normalization)해주는 연구가 진행되었다. 하지만 최근에는 잡음에 의한 왜곡의 모델이 복잡하고 실제 인식 성능에 직접적으로 나타나지 않는 문제점을 추가로 극복하기 위해, 훈련과 인식 환경의 차이를 보상해주는(compensation) 연구가 활발히 진행되고 있다. 본 논문에서는 기본적인 화자인식기술과 함께 성능저하를 일으키는 불일치 요인들 및 그것들을 극복하기 위한 기술들을 소개하고자 한다.

Keywords

References

Proceeding of the IEEE v.64 no.4 Automatic Recognition of speakers from Their Voices Bishnu,S.Atal. https://doi.org/10.1109/PROC.1976.10155
IEEE ASSP Magazine Speaker Recognition IEEE ASSP Magazine D.O'Shaughnessy
IEEE Signal Processing Magazine Text-Independent Speaker Identification Herbert Gish;Michael Schmidt
Proceeding of the IEEE v.64 no.4 Automatic Speaker Verification : A Review AARON E. Rosenberg https://doi.org/10.1109/PROC.1976.10156
IEEE Communications Magazine speaker Verification : A Tutorial Jayant M. Naik
Computer Science Research at Ubilab CAVE-Speaker Verification in Banking and Telecommunications K.U.mazel;H.P.Frei
Digital Processing of Speech Signals L.R.Rabiner;R.W.Schafer
Fundamentals of Speech Recognition L.R.Rabiner;B.H.Juang
Automatic Speech and Speaker Recognition:Advanced Topics Chin-Hui Lee;Frank K.Soong;Kuldip K. Paliwal
IEEE Trans. on Speech and Audio Processing v.3 no.1 Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models Douglas A. Reynolds;Richard C. Rose https://doi.org/10.1109/89.365379
Proc. of ICASSP comparison of Whole Word and Subword Modeling Technique for Speaker Verification With Limited Training Data s. Enler;R.Langlitz;J.Zinke
Proc. of ICASSP Concatenated Phoneme Models for Text-variable Speaker Recognition Tomoko Matsui;Sadaoki Furui
Proc. of ICASSP A Hybrid Score Measurement For HMMBased Speaker Verification Yong Gu;Trevor Thomas
Proc. of ICASSP A Comparison of A proiri Threshold Setting Procedures For Speaker Verification in the CAVE Project J.B.Pierrot;J.Lindberg;J.Koolwaaij;H.P.Hutter;D.Genoud;M.Blomberg;F.Bimbot
Digital Signal Processing v.10 Speaker verification using adapted gaussian mixture models Digitaql Signal Processing Douglas Reynolds;Thomas Quatieri;Robert Dunn https://doi.org/10.1006/dspr.1999.0361
IEEE Signal Processing Magazine Robust Speaker Recognition R.J.Mammone;X.Zhang;R.P.Ramachandran
Speech Communication v.25 On Stochastic Feature and Model compensation Approaches to Robust Speech Recognition Chin-Hui Lee https://doi.org/10.1016/S0167-6393(98)00028-4
Acoustical and Evnironmental Robustness in Automatic Speech Recognition A.Acero
Proc. of Eurospeech-2001 v.4 Formant-Broadened CMS Using Peak-Picking in Log Spectrum Yu-Jin Kim;Hea-Kyoung Jung;Jae-Ho Chung
IEEE Trans. on Acoustics, Speech, And Signal Processing v.29 Suppression of Acoustic Noise in Speech using Spectral Subtraction Boll
IEEE Trans. on Speech and Audio Processing v.2 no.4 RASTA Processing of Speech Hynek Hermansky;Nelson Morgan
IEEE Trans. on Speech and Audio Processing v.2 no.4 New LP-Derived Features for Speaker-Identification K.T.Assaleh;R.J.Mammone
Journal of Acoustical Society of America v.87 no.4 Perceptual linear predictive(PLP) analysis of speech Hynek Hermansky
한국음향학회지 v.18 no.5 전화선 채널이 화자확인 시스템 성능에 미치는 영향 조태현;김유진;이재영;정재호
proc. of ICASSP Cepstral Analysis Technique for Automatic Speaker Verification Sadaoki Furi
Computer Speech and Language 9 Robust speech recognition in additive and convolutional noise using parallel model combination M.J.F.Gales;S.J.Young
IEEE Trans. on Speech and Audio Processing v.2 no.2 Integrated models of speech and backgroud with application to speaker identifiation in noise R.C.Rose;E.M.Hofstetter;D.A.Reynolds https://doi.org/10.1109/89.279273
IEEE Trans. on Speech and Audio Processing v.4 no.1 Signal Bias Removal by Maximum Likelihood Estimation for Robust Telephone Speech Recognition Mazin G. Rahim;Biing-Hwang Juang
IEEE Trans. on Speech and Audio Processing v.8 no.5 Estimation of Handest Nonlinearity with Application to Speaker Recognition T.F.Quatieri;D.A.Reynolds https://doi.org/10.1109/89.861376
Proc. of ICASSP. Student-Forum Signal Bias Removal Based GMM for Robust Speaker Recognition Yu-Jin Kim;Jae-Ho Chung

Review of KIISC (정보보호학회지)

환경 변이에 강인한 화자 인식 기술

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)