Browse > Article
http://dx.doi.org/10.7840/kics.2013.38C.2.213

Factored MLLR Adaptation for HMM-Based Speech Synthesis in Naval-IT Fusion Technology  

Sung, June Sig (서울대학교 전기컴퓨터공학부 뉴미디어통신공동연구소)
Hong, Doo Hwa (서울대학교 전기컴퓨터공학부 뉴미디어통신공동연구소)
Jeong, Min A (목포대학교)
Lee, Yeonwoo (목포대학교)
Lee, Seong Ro (목포대학교)
Kim, Nam Soo (서울대학교 전기컴퓨터공학부)
Abstract
One of the most popular approaches to parameter adaptation in hidden Markov model (HMM) based systems is the maximum likelihood linear regression (MLLR) technique. In our previous study, we proposed factored MLLR (FMLLR) where each MLLR parameter is defined as a function of a control vector. We presented a method to train the FMLLR parameters based on a general framework of the expectation-maximization (EM) algorithm. Using the proposed algorithm, supplementary information which cannot be included in the models is effectively reflected in the adaptation process. In this paper, we apply the FMLLR algorithm to a pitch sequence as well as spectrum parameters. In a series of experiments on artificial generation of expressive speech, we evaluate the performance of the FMLLR technique and also compare with other approaches to parameter adaptation in HMM-based speech synthesis.
Keywords
Speech synthesis; adaptation; MLLR; HMM; expressive speech;
Citations & Related Records
연도 인용수 순위
  • Reference
1 C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, Apr. 1995.   DOI   ScienceOn
2 Y. Sung, C. Boulis, and D. Jurafsky, "Maximum conditional likelihood linear regression and maximum a posteriori for hidden conditional random fields speaker adaptation," in Proc. ICASSP, Las Vegas, NV, 2008, pp. 4293-4296
3 J. Yamagishi et al., "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Trans. Audio, Speech Lang. Process., vol. 17, no. 1, pp. 66-83, Jan. 2009.   DOI   ScienceOn
4 T. Nose, Y. Kato, and T. Kobayashi, "A speaker adaptation technique for MRHSMM-based style control of synthetic speech," in Proc. ICASSP, Honolulu, HI, 2007, pp. 833-836.
5 J. Sundberg, "The acoustics of the singing voice," Sci. Amer., pp. 82-91, Mar. 1977
6 N. S. Kim, J. S. Sung and D. H. Hong, "Factored MLLR adaptation," IEEE Signal Processing Letters, vol. 18, no. 2, pp. 99-102, Feb. 2011.   DOI   ScienceOn
7 J. S. Sung, D. H. Hong, H. W. Koo and N. S. Kim, "Factored MLLR Adaptation Algorithm for HMM-based Expressive TTS," in Interspeech2012, Portland, Sep. 2012.
8 J. S. Sung, S. J. Kang, J.-H. Chang, and N. S. Kim, "Factored MLLR Adaptation for HMM-based Singing Voice Synthesis", in Proc. KICS Int. Conf. Commun. 2011, Jeju Island, Korea, June, 2011.
9 H. Zen et al., "The HMM-based speech synthesis system version 2.0," in Proc. of ISCA SSW6, Bonn, Germany, Aug. 2007.