Browse > Article
http://dx.doi.org/10.5573/ieek.2013.50.12.197

Artificial Bandwidth Extension Based on Harmonic Structure Extension and NMF  

Kim, Kijun (Dept. of Electronics Engineering, Kwangwoon University)
Park, Hochong (Dept. of Electronics Engineering, Kwangwoon University)
Publication Information
Journal of the Institute of Electronics and Information Engineers / v.50, no.12, 2013 , pp. 197-204 More about this Journal
Abstract
In this paper, we propose a new method for artificial bandwidth extension of narrow-band signal in frequency domain. In the proposed method, a narrow-band signal is decomposed into excitation signal and spectral envelope, which are extended independently in frequency domain. The excitation signal is extended such that low-band harmonic structure is maintained in high band, and the spectral envelope is extended based on sub-band energy using NMF. Finally, the spectral phase is determined based on signal correlation between frames in time domain, resulting in the final wide-band signal. The subjective evaluation verified that the wide-band signal generated by the proposed method has a higher quality than the original narrow-band signal.
Keywords
artificial bandwidth extension; NMF(non-negative matrix factorization); harmonic structure; spectral envelope; phase;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 J. Sung, H. W. Kim, D. Y. Kim, B. S. Lee and Y. H. Ko, "A candidate codec algorithm on superwideband extension to ITU-T G.711.1 and G.722," J. Institute of Electronics Engineers of Korea, vol. SP-47, no. 5, pp. 62-73, 2010. 9.   과학기술학회마을
2 P. Jax and P. Vary, "On artificial bandwidth extension of telephone speech," Signal Processing, vol. 83, no. 8, pp. 1707-1719, August 2003.   DOI   ScienceOn
3 S. Chennoukh, A. Gerrits, G. Miet and R. Sluijter, "Speech enhancement via frequency bandwidth extension using line spectral frequencies," in Proc. IEEE Conf. on Acoustics, Speech, and Signal Processing, pp. 665-668, Salt Lake City, Utah, USA, May 2001.
4 P. Jax and P. Vary, "Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model," in Proc. IEEE Conf. on Acoustics, Speech, and Signal Processing, pp. 680-683, Hong Kong, China, April 2003.
5 K. Y. Park and H. S. Kim, "Narrowband to wideband conversion of speech using GMM based transformation," in Proc. IEEE Conf. on Acoustics, Speech, and Signal Processing, pp. 1843-1846, Istanbul, Turkey, June 2000.
6 K. B. Hong, G. H. Jeong and I. S. Lee, "Enhancement of super-wideband coder by considering audio feature in MDCT domain," J. Institute of Electronics Engineers of Korea, vol. SP-48, no. 5, pp. 129-136, 2011.9.
7 D. D. Lee and H. S. Seung. "Learning the parts of objects by non-negative matrix factorization," Nature, vol. 401, pp. 788-791, August 1999.   DOI   ScienceOn
8 D. Bansal, B. Raj and P. Smaragdis, "Bandwidth expansion of narrowband speech using non-negative matrix factorization," in Proc. Interspeech, pp. 1505-1508, Lisbon, Portugal, September 2005.
9 M. Dietz, L. Liljeryd, K. Kjorling and O. Kunz, "Spectral band replication, a novel approach in audio coding," in Proc. 112th AES Convention, pp. 10-13, Munich, Germany, May 2002.
10 ITU-T Rec. P.800, "Methods for subjective determination of transmission quality," August 1996.