[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2013.32.5.438

An Enhancement of Japanese Acoustic Model using Korean Speech Database

Lee, Minkyu (과학기술연합대학원대학교 컴퓨터소프트웨어 및 공학, 한국전자통신연구원 자동통역인공지능연구센터 자동통역연구실)
Kim, Sanghun (과학기술연합대학원대학교 컴퓨터소프트웨어 및 공학, 한국전자통신연구원 자동통역인공지능연구센터 자동통역연구실)

Publication Information

The Journal of the Acoustical Society of Korea / v.32, no.5, 2013 , pp. 438-445 More about this Journal

Abstract

In this paper, we propose an enhancement of Japanese acoustic model which is trained with Korean speech database by using several combination strategies. We describe the strategies for training more than two language combination, which are Cross-Language Transfer, Cross-Language Adaptation, and Data Pooling Approach. We simulated those strategies and found a proper method for our current Japanese database. Existing combination strategies are generally verified for under-resourced Language environments, but when the speech database is not fully under-resourced, those strategies have been confirmed inappropriate. We made tyied-list with only object-language on Data Pooling Approach training process. As the result, we found the ERR of the acoustic model to be 12.8 %.

Keywords

Speech recognition; Japanese speech recognition; Automatic speech translation; Language adaptation; Data pooling; Under-resourced language;

Citations & Related Records

Reference

1	Ulla Uebler, "Multilingual speech recognition in seven languages," Speech. Commun. 35, 53-69 (2001). DOI ScienceOn
2	C. van Heerden, N. Kleynhans, E. Barnard, and M. Davel, "Pooling ASR data for closely reslated languages," in Proc. SLTU, 17-23 (2010).
3	Tanja Schultz and Alex Waibel, "Language Portability in Acoustic Modeling," Speech. Commun. 10, 59-64 (2000).
4	Kenan Çarki, Petra Geutner, and Tanja Schultz, "Turkish LVCSR: toward better speech recognition for agglutinative languages," In Proc. ICASSP, 1563-1566 (2000).
5	J. K. Lee, "Comparative study on korean and japanese formant values by korean speakers and japanese speakers," Ono Yongu Studies in Linguistics. 15, 61-75 (1997)
6	Mecab: Yet Another Part-of-Speech and Morphological Analyzer, http://mecab.googlecode.com/svn/trunk/mecab/doc/index.html, 2013.
7	The SRI Language Modeling Toolkit, http://www.speech.sri.com/projects/srilm, 2013.
8	S. H. Kim, I. Lee, and J. Park, "Developing fast/light korean recognizer through building FST_based search network," in Proc. KSCSP, 25, 1, 21-24 (2008).
9	A. Constantinescu, and G. Chollet, "On cross-language experiments and data-driven units for ALISP," In Proc. ASRU, 606-613 (1997).

KSCI

An Enhancement of Japanese Acoustic Model using Korean Speech Database 한국어 음성데이터를 이용한 일본어 음향모델 성능 개선

An Enhancement of Japanese Acoustic Model using Korean Speech Database