[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5762/KAIS.2011.12.7.3267

A Study on the Multilingual Speech Recognition using International Phonetic Language

Kim, Suk-Dong (Dept. of Computer Science, Hoseo University)
Kim, Woo-Sung (Dept. of Computer Science, Hoseo University)
Woo, In-Sung (Dept. of Computer Science, Hoseo University)

Publication Information

Journal of the Korea Academia-Industrial cooperation Society / v.12, no.7, 2011 , pp. 3267-3274 More about this Journal

Abstract

Recently, speech recognition technology has dramatically developed, with the increase in the user environment of various mobile devices and influence of a variety of speech recognition software. However, for speech recognition for multi-language, lack of understanding of multi-language lexical model and limited capacity of systems interfere with the improvement of the recognition rate. It is not easy to embody speech expressed with multi-language into a single acoustic model and systems using several acoustic models lower speech recognition rate. In this regard, it is necessary to research and develop a multi-language speech recognition system in order to embody speech comprised of various languages into a single acoustic model. This paper studied a system that can recognize Korean and English as International Phonetic Language (IPA), based on the research for using a multi-language acoustic model in mobile devices. Focusing on finding an IPA model which satisfies both Korean and English phonemes, we get 94.8% of the voice recognition rate in Korean and 95.36% in English.

Keywords

Multilingual Speech Recognition; IPA;

Citations & Related Records

Reference

1	H.-M. Park and R. M. Stern, "Missing-feature speech recognition using dereverberation and echo suppression in reverberant environments," IEEE International Conference on Acoustics, Speech, and Signal Processing, April 2007, Honolulu, Hawaii. DOI
2	Thomas K. Harris, Arthur Toth, James Sanders, Alexander Rudnicky. "Towards Efficient Human Machine Speech Communication". ACM Transactions on Speech and Language Processing, February 2005..
3	Jahanzeb Sherwani et el " Towards Speech-based Access by Semi-literate Users". In Proc. Speech in Mobile and Pervasive Environments, Singapore, September 2007.
4	John S. Garofolo, Jonathan G. Fiscus,William M. Fisher "Design and prtparation of the 1996 HUB-4 Broadcast News Benchmark Test Corpora." DARPA Speech Recognition Workshop, Feb. 1997, pp. 15 - 21.
5	A. G. Hauptmann, et el. "Multi-Lingual Broadcast News Retrieval", TRECVID'06 TREC, NIST Gaithersburg, MD., November 2006.
6	Z. Al Bawab, B, Raj, and R. M. Stern, "Analysis-by-synthesis features for speech recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, April 2008, Las Vegas, Nevada. DOI
7	Stefanie Tomko, and Roni Rosenfeld. " A Speechand Language-based Information Management Environment". In Proc. IEEE Int.l Conference on Acoustics, Speech and Signal Processing, Toulouse, France, May 2006.

1	Design And Implementation of a Speech Recognition Interview Model based-on Opinion Mining Algorithm / [Kim, Kyu-Ho;Kim, Hee-Min;Lee, Ki-Young;Lim, Myung-Jae;Kim, Jeong-Lae;] / The Journal of the Institute of Internet, Broadcasting and Communication
2	OCS based on the Speech Input / [Youn, Sek-Koun;Song, Jeong-Young;] / The Journal of Korean Institute of Information Technology

KSCI

A Study on the Multilingual Speech Recognition using International Phonetic Language IPA를 활용한 다국어 음성 인식에 관한 연구

A Study on the Multilingual Speech Recognition using International Phonetic Language