Browse > Article
http://dx.doi.org/10.5909/JBE.2014.19.1.64

Robust Primary-ambient Signal Decomposition Method using Principal Component Analysis with Phase Alignment  

Baek, Yong-Hyun (Computer Science. Dep., Yonsei Univ.)
Hyun, Dong-Il (Electrcal & Electronic Eng., Yonsei Univ.)
Park, Young-Cheol (Computer & Telecommunications Eng., Yonsei Univ.)
Publication Information
Journal of Broadcast Engineering / v.19, no.1, 2014 , pp. 64-74 More about this Journal
Abstract
The primary and ambient signal decomposition of a stereo sound is a key step to the stereo upmix. The principal component analysis (PCA) is one of the most widely used methods of primary-ambient signal decomposition. However, previous PCA-based decomposition algorithms assume that stereo sound sources are only amplitude-panned without any consideration of phase difference. So it occurs some performance degradation in case of live recorded stereo sound. In this paper, we propose a new PCA-based stereo decomposition algorithm that can consider the phase difference between the channel signals. The proposed algorithm overcomes limitation of conventional signal model using PCA with phase alignment. The phase alignment is realized by using inter-channel phase difference (IPD) which is widely used in parametric stereo coding. Moreover, Enhanced Modified PCA(EMPCA) is combined to solve the problem of conventional PCA caused by Primary to Ambient energy Ratio(PAR) and panning angle dependency. The simulation results are presented to show the improvements of the proposed algorithm.
Keywords
PCA; decomposition; primary singal; ambient signal; inter-channel phase difference;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 R. Dressler, "Dolby Surround Prologic II de-coder principles of operation," Tech. Rep. Dolby Laboratories, 2000
2 J. Usher and J. Benesty, "Enhancement of spatial sound quality: A new reverberation-extraction audio upmixer," Audio, Speech, and Language Processing, IEEE Trans, vol. 15, no. 7, pp. 2141-2150, 2007   DOI   ScienceOn
3 C. Faller, "Multiple-loudspeaker playback of stereo signals," J. AES, vol. 54, no.11, pp. 1051-1064, 2006.
4 S.W. Jeon, Y.C. Park, S.P. Lee, D.H. Yoon, "Robust Representation of Spatial Sound in Stereo-to Multichannel Upmix", AES convention, 2010
5 Pulkki, Ville. "Virtual Sound Source Positioning Using Vector Base Amplitude Panning", AES convention. vol 45, Issue 6, pp 456-466, 1997
6 jianjun He, Ee-Leng Tan, Woon-Seng Gan, "Time-Shifted Principal Component Analysis Based Cue Extraction for Stereo Audio Signal", ICASSP, 2013
7 Yong-Hyun Baek, Se-Woon Jeon, Seok-pil Lee, and Young-Cheol Park, "Efficient Primary-Ambient Decomposition Algorithm for Audio Upmix" JBE Vol. 17, No. 6, 924-932, November 2012
8 Petersen, T, and S. Boll., "Critical band analysis-synthesis." Acoustics, Speech and Signal Processing, IEEE Trans. Vol 31, no. 3, pp. 656-663, 1983.   DOI
9 MERIMAA, Juha, GOODWIN, Michael M., JOT Jean-Marc. "Correlation-based ambience extraction from stereo recordings", In AES convention123. Oct. 2007.
10 Hyun, Dong-il, Young-cheol Park, and Dae Hee Youn. "Estimation and quantization of ICC-dependent phase parameters for parametric stereo audio coding." EURASIP Journal on ASMP, pp. 1-12, 2012