참고문헌
- D. Jang, T. Lee, Y. Lee, and J. Yoo, "A Personalized Preset-based Audio System for Interactive Service," 121st AES Convention, 2006.
- Consideration of Interactive Music Service, ISO/IEC JTC1/SC29/WG11 (MPEG), Archamps, Document M15390, 2008.
- J. Herre and S. Disch, "New Concepts in Parametric Coding of Spatial Audio: From SAC to SAOC," 2007 International Conference on Multimedia and Expo, pp. 1894-1897, 2007.
- J. Engdegard, B. Resch, C. Falch, O. Hellmuth, J. Hilpert, A. Hoelzer, L. Terentiev, J. Breebaart, J. Koppens, E. Schuijers, and W. Oomen, "Spatial Audio Object Coding (SAOC) -The Upcoming MPEG Standard on Parametric Object Based Audio Coding," 124th AES Convention, 2008.
- O. Hellmuth, H. Purnhagen, J. Koppens, J. Herre, J. Engdegard, J. Hilpert, L. Villemoes, L. Terentiv, C. Falch, A. Holzer, M.L. Valero, B. Resch, H. Mundt, and H. Oh, "MPEG Spatial Audio Object Coding - the ISO/MPEG Standard for Efficient Coding of Interactive Audio Scenes," 129th AES Convention, 2010.
- L.R. Rabiner, M.J. Cheng, A. Rosenberg, and C.A. McGonegal, "A Comparative Performance Study of Several Pitch Detection Algorithms," IEEE Trans. on ASSP , vol. ASSP-24, No. 5, pp. 399-418, 1976.
- M. Goto, "A Predominant-F0 Estimation Method for CD Recordings: MAP Estimation using an EM Algorithm for Adaptive Tone Models," Proc. Int. Conf. on Acoustics, Speech and Signal Processing, Vol. 5, pp. 3365 -3368, 2001.
- A. de Cheveigne and H. Kawahara, "YIN, a Fundamental Frequency Estimator for Speech and Music." The Journal of the Acoust. Soc. Am., Vol. 111, No. 4, pp. 1917-1930, 2002. https://doi.org/10.1121/1.1458024
- M. Wu, D. Wang, and G.J. Brown, "A Multipitch Tracking Algorithm for Noisy Speech," Proc. IEEE Trans. Speech and Audio, Vol. 11, No. 3, pp. 229-241, 2003. https://doi.org/10.1109/TSA.2003.811539
- M. Goto, "A Real-Time Music-Scene- Description System: Predominant -F0 Estimation for Detecting Melody and Bass Lines in Real-World Audio Signals," Speech Com., Vol. 43, No. 4, pp. 311-329, 2004. https://doi.org/10.1016/j.specom.2004.07.001
- A. Klapuri, "Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes," Proc. International Conference on Music Information Retrieval, pp. 216-212, 2006.
- H. Fujihara, M. Goto, J. Ogata, K. Komatani, T. Ogata, and H.G. Okuno, "Automatic Synchronization Between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals," IEEE International Symposium on Multimedia, pp. 257-264, 2006.
- S. Kim, J. Kim, and M. Hahn, "HMM-Based Korean Speech Synthesis System for Hand- Held Devices," IEEE Trans. Consumer Electronics, Vol. 52, No. 4, pp. 1384-1390, 2006. https://doi.org/10.1109/TCE.2006.273160
- S. Kim, J. Kim, and M. Hahn, "Implementation and Evaluation of an HMM-based Korean Speech Synthesis System," IEICE Transactions on Information and Systems, Vol. E89- D, No. 3, pp. 1116-1119, 2006. https://doi.org/10.1093/ietisy/e89-d.3.1116
- S. Kim, J. Kim, and M. Hahn, "Two-band Excitation for HMM-based Speech Synthesis," IEICE Trans. Information and Systems, Vol. E90-D, No. 1, pp. 378-381, 2007. https://doi.org/10.1093/ietisy/e90-1.1.378
- S. Han, S. Jeong, and M. Hahn, "Optimum MVF Estimation-Based Two-Band Excitation for HMM-Based Speech Synthesis," ETRI J ournal, Vol. 31, No. 4, pp. 457-459, 2009. https://doi.org/10.4218/etrij.09.0209.0112
- P.C. Loizou, Speech Enhancement: Theory and Practice, Talor & Francis, New York, 2009.
- ITU-R Recommendation, Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA), ITU, BS. 1543-1, 2001.
- T. Kim and J. Chang " A Study on Speech Period and Pitch Detection for Continuous Speech Recognition," Journal of Korea Multimedia Society, Vol. 8, no. 1, pp. 55-61, 2005.