Browse > Article
http://dx.doi.org/10.5762/KAIS.2011.12.7.3267

A Study on the Multilingual Speech Recognition using International Phonetic Language  

Kim, Suk-Dong (Dept. of Computer Science, Hoseo University)
Kim, Woo-Sung (Dept. of Computer Science, Hoseo University)
Woo, In-Sung (Dept. of Computer Science, Hoseo University)
Publication Information
Journal of the Korea Academia-Industrial cooperation Society / v.12, no.7, 2011 , pp. 3267-3274 More about this Journal
Abstract
Recently, speech recognition technology has dramatically developed, with the increase in the user environment of various mobile devices and influence of a variety of speech recognition software. However, for speech recognition for multi-language, lack of understanding of multi-language lexical model and limited capacity of systems interfere with the improvement of the recognition rate. It is not easy to embody speech expressed with multi-language into a single acoustic model and systems using several acoustic models lower speech recognition rate. In this regard, it is necessary to research and develop a multi-language speech recognition system in order to embody speech comprised of various languages into a single acoustic model. This paper studied a system that can recognize Korean and English as International Phonetic Language (IPA), based on the research for using a multi-language acoustic model in mobile devices. Focusing on finding an IPA model which satisfies both Korean and English phonemes, we get 94.8% of the voice recognition rate in Korean and 95.36% in English.
Keywords
Multilingual Speech Recognition; IPA;
Citations & Related Records
연도 인용수 순위
  • Reference
1 H.-M. Park and R. M. Stern, "Missing-feature speech recognition using dereverberation and echo suppression in reverberant environments," IEEE International Conference on Acoustics, Speech, and Signal Processing, April 2007, Honolulu, Hawaii.   DOI
2 Thomas K. Harris, Arthur Toth, James Sanders, Alexander Rudnicky. "Towards Efficient Human Machine Speech Communication". ACM Transactions on Speech and Language Processing, February 2005..
3 Jahanzeb Sherwani et el " Towards Speech-based Access by Semi-literate Users". In Proc. Speech in Mobile and Pervasive Environments, Singapore, September 2007.
4 John S. Garofolo, Jonathan G. Fiscus,William M. Fisher "Design and prtparation of the 1996 HUB-4 Broadcast News Benchmark Test Corpora." DARPA Speech Recognition Workshop, Feb. 1997, pp. 15 - 21.
5 A. G. Hauptmann, et el. "Multi-Lingual Broadcast News Retrieval", TRECVID'06 TREC, NIST Gaithersburg, MD., November 2006.
6 Z. Al Bawab, B, Raj, and R. M. Stern, "Analysis-by-synthesis features for speech recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, April 2008, Las Vegas, Nevada.   DOI
7 Stefanie Tomko, and Roni Rosenfeld. " A Speechand Language-based Information Management Environment". In Proc. IEEE Int.l Conference on Acoustics, Speech and Signal Processing, Toulouse, France, May 2006.