[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2012.31.3.170

The Automated Threshold Decision Algorithm for Node Split of Phonetic Decision Tree

Kim, Beom-Seung (코레일 정보기술단)
Kim, Soon-Hyob (광운대학교 컴퓨터공학과)

Publication Information

The Journal of the Acoustical Society of Korea / v.31, no.3, 2012 , pp. 170-178 More about this Journal

Abstract

In the paper, phonetic decision tree of the triphone unit was built for the phoneme-based speech recognition of 640 stations which run by the Korail. The clustering rate was determined by Pearson and Regression analysis to decide threshold used in node splitting. Using the determined the clustering rate, thresholds are automatically decided by the threshold value according to the average clustering rate. In the recognition experiments for verifying the proposed method, the performance improved 1.4~2.3 % absolutely than that of the baseline system.

Keywords

Speech recognition of train station; Phonetic decision tree; PLU; ASR;

Citations & Related Records

Times Cited By KSCI : 2 (Citation Analysis)

Reference
Cited By KSCI

1	L. Gu, K. Rose, "Sub-state tying in tied mixture hidden Markov models," Proc. IEEE, Acoustics, Speech, and Signal Processing, pp. 1062-1065, 2000.
2	R. D. R. Fagundes, J. S. Correa, P. Dumouchel, "A New Phonetic model for continuous speech recognition systems," Proc. ICSP, pp. 572-575, 2002.
3	S. J. Young, J. J. Odell, and P. C. Woodland, "Tree- Based State Tying for Hight Accuracy Accoustic Modelling," in Proceedings of the Workshop on Human Language Technology, Plainsboro, NJ, Mar. 1994.
4	T. O. Ann, "A Study on the Optimization of State Tying Acoustic Models using Mixture Gaussian Clustering", Jounal of Electronics Engineers of Korea, vol. 42, no. 6, pp. 167-176, Nov. 2005.
5	김성호, 최태성, 사회과학을 위한 통계자료분석 (SPSS 11.0활용), 다산출판사, 2004.
6	L. R. Rabiner, "A Tutorial on Hidden Markov Models and Selected Application in Speech Recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, 1989.
7	D. Jurafsky and J. H. Martin, Speech and Language Processing, PrenticeHall (2nd), 2008.
8	S. Young, G. Evermmana, M. Gales, T. Hain, et al, "The HTK Book for HTK Version 3.4," 2006.
9	J. J. Odell, "The Use of Context in Large Vocabulary Speech Recognition," PhD's Dissertation, University of Cambridge, 1995.
10	B. S. Kim, S. H. Kim, "A Study on Realization of Speech Recognition System based on VoiceXML for Railroad Reservation Service," Jounal of the Korea Society for Railway, vol. 14, no. 2, pp. 130-136, 2011. DOI ScienceOn
11	B. S. Kim, S. H. Kim, "A Study on the Speech Recognition for Commands of Ticketing Machine using CHMM," Jounal of the Korea Society for Railway, vol. 12, no. 2, pp. 285-290, 2009.
12	B. S. Kim, S. H. Kim, "A Study on Speech Recognition based on Phoneme for Korean Subway Station Names," Jounal of the Korea Society for Railway, vol. 14, no. 3, pp. 285-290, 2011. DOI ScienceOn
13	A. Lazarides, Y. Normandin, and R. Kuhn, "Improving decision trees for acoustic modeling," in Proc. ICSLP, Philadelphia, October. 1996.
14	D. B. Paul, "Extensions to phone-state decision-tree clustering: single tree and tagged clustering," in Proc. ICASSP, vol. 2, pp. 1487-490, 1997.

KSCI

The Automated Threshold Decision Algorithm for Node Split of Phonetic Decision Tree 음소 결정트리의 노드 분할을 위한 임계치 자동 결정 알고리즘

The Automated Threshold Decision Algorithm for Node Split of Phonetic Decision Tree