[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.4218/etrij.16.0115.0499

Language Model Adaptation Based on Topic Probability of Latent Dirichlet Allocation

Jeon, Hyung-Bae (Department of Bio and Brain Engineering, KAIST, SW & Contents Research Laboratory, ETRI)
Lee, Soo-Young (Department of Electrical Engineering and Department of Bio and Brain Engineering, KAIST)

Publication Information

ETRI Journal / v.38, no.3, 2016 , pp. 487-493 More about this Journal

Abstract

Two new methods are proposed for an unsupervised adaptation of a language model (LM) with a single sentence for automatic transcription tasks. At the training phase, training documents are clustered by a method known as Latent Dirichlet allocation (LDA), and then a domain-specific LM is trained for each cluster. At the test phase, an adapted LM is presented as a linear mixture of the now trained domain-specific LMs. Unlike previous adaptation methods, the proposed methods fully utilize a trained LDA model for the estimation of weight values, which are then to be assigned to the now trained domain-specific LMs; therefore, the clustering and weight-estimation algorithms of the trained LDA model are reliable. For the continuous speech recognition benchmark tests, the proposed methods outperform other unsupervised LM adaptation methods based on latent semantic analysis, non-negative matrix factorization, and LDA with n-gram counting.

Keywords

Language model adaptation; topic model; Latent Dirichlet allocation; weighted mixture model; LDA;

Citations & Related Records

Reference

1	J.R. Bellegarda, "Statistical LM Adaptation: Review and Perspectives," Speech Commun., vol. 42, no. 1, Jan. 2004, pp. 93-108. DOI
2	R. Kneser, J. Peters, and D. Klakow, "Language Model Adaptation Using Dynamic Marginals," European Conf. Speech Commun. Technol., Rhodes, Greece, Sept. 1997, pp. 1971-1974.
3	M. Federico, "Efficient Language Model Adaptation through MDI Estimation," European Conf. Speech Commun. Technol., Budapest, Hungary, Sept. 1999, pp. 1583-1586.
4	Y. Si et al., "Block-Based Language Model for Target Domain Adaptation towards Web Corpus," J. Computational Inf. Syst., vol. 9, Nov. 2013, pp. 9139-9146.
5	K. Thadani, F. Biadsy, and D.M. Bikel, "On-the-fly Topic Adaptation for YouTube Video Transcription," Interspeech, Portland, OR, USA, Sept. 2012. pp. 210-213
6	J.R. Bellegarda, "Exploiting Latent Semantic Information in Statistical Language Modeling," Proc. IEEE, vol. 88, no. 8, 2000, pp. 1279-1296. DOI
7	Y. Akita and T. Kawahara, "Language Model Adaptation Based on PLSA of Topics and Speakers for Automatic Transcription of Panel Discussions," IEICE Trans. Inf. Syst., vol. E88-D, no. 3, Mar. 2005, pp. 439-445. DOI
8	W. Xu, X. Liu, and Y. Gong, "Document Clustering Based On Non-negative Matrix Factorization," Ann. Int. ACM SIGIR Conf. Res. Develop. Inf. Retrieval, Toronto, Canada, July 2003, pp. 267-273.
9	D.M. Blei, A.Y. Ng, and M.I. Jordan, "Latent Dirichlet Allocation," J. Machine Learning Res., vol. 3, Feb. 2003, pp. 993-1022.
10	D.M. Blei and J.D. Lafferty, "TOPIC MODELS," Text Mining: Classification, Clustering, and Applications, vol. 10, no. 71, June 2009, p. 34.
11	Y.C. Tam and T. Schultz, "Unsupervised Language Model Adaptation Using Latent Semantic Marginals," Interspeech, Pittsburgh, PA, USA, Sept. 2006, pp. 2206-2209.
12	M.A. Haidar and D. O'Shaughnessy, "Unsupervised LM Adaptation Using LDA-Based Mixture Models and Latent Semantic Marginals," Comput. Speech Language, vol. 29, no. 1, 2015, pp. 20-31. DOI
13	L. Xiaoyong and W.B. Croft, "Cluster-Based Retrieval Using Language Model," Ann. Int. ACM SIGIR Conf. Res. Develop. Inf. Retrieval, Sheffield, UK, July 2004, pp. 186-193.
14	A. Stolcke, "Entropy-Based Pruning of Backoff Language Models," DARPA Broadcast News Transcription Understanding Workshop, Lansdowne, PA, USA, Feb. 1998, pp. 270-274.

None	(2016) Mathematical problems in engineering Graph-Based Collaborative Filtering with MLP / 2018 (None) , 1
12	(2020) Applied sciences Online Speech Recognition Using Multichannel Parallel Acoustic Score Computation and Deep Neural Network (DNN)- Based Voice-Activity Detector / 10 (12) , 4091
5	(2020) ETRI journal Automatic proficiency assessment of Korean speech read aloud by non-natives using bidirectional LSTM-based speech recognition / 42 (5) , 761