Browse > Article
http://dx.doi.org/10.7776/ASK.2013.32.5.438

An Enhancement of Japanese Acoustic Model using Korean Speech Database  

Lee, Minkyu (과학기술연합대학원대학교 컴퓨터소프트웨어 및 공학, 한국전자통신연구원 자동통역인공지능연구센터 자동통역연구실)
Kim, Sanghun (과학기술연합대학원대학교 컴퓨터소프트웨어 및 공학, 한국전자통신연구원 자동통역인공지능연구센터 자동통역연구실)
Abstract
In this paper, we propose an enhancement of Japanese acoustic model which is trained with Korean speech database by using several combination strategies. We describe the strategies for training more than two language combination, which are Cross-Language Transfer, Cross-Language Adaptation, and Data Pooling Approach. We simulated those strategies and found a proper method for our current Japanese database. Existing combination strategies are generally verified for under-resourced Language environments, but when the speech database is not fully under-resourced, those strategies have been confirmed inappropriate. We made tyied-list with only object-language on Data Pooling Approach training process. As the result, we found the ERR of the acoustic model to be 12.8 %.
Keywords
Speech recognition; Japanese speech recognition; Automatic speech translation; Language adaptation; Data pooling; Under-resourced language;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Ulla Uebler, "Multilingual speech recognition in seven languages," Speech. Commun. 35, 53-69 (2001).   DOI   ScienceOn
2 C. van Heerden, N. Kleynhans, E. Barnard, and M. Davel, "Pooling ASR data for closely reslated languages," in Proc. SLTU, 17-23 (2010).
3 Tanja Schultz and Alex Waibel, "Language Portability in Acoustic Modeling," Speech. Commun. 10, 59-64 (2000).
4 Kenan Çarki, Petra Geutner, and Tanja Schultz, "Turkish LVCSR: toward better speech recognition for agglutinative languages," In Proc. ICASSP, 1563-1566 (2000).
5 J. K. Lee, "Comparative study on korean and japanese formant values by korean speakers and japanese speakers," Ono Yongu Studies in Linguistics. 15, 61-75 (1997)
6 Mecab: Yet Another Part-of-Speech and Morphological Analyzer, http://mecab.googlecode.com/svn/trunk/mecab/doc/index.html, 2013.
7 The SRI Language Modeling Toolkit, http://www.speech.sri.com/projects/srilm, 2013.
8 S. H. Kim, I. Lee, and J. Park, "Developing fast/light korean recognizer through building FST_based search network," in Proc. KSCSP, 25, 1, 21-24 (2008).
9 A. Constantinescu, and G. Chollet, "On cross-language experiments and data-driven units for ALISP," In Proc. ASRU, 606-613 (1997).