DOI QR코드

DOI QR Code

입체음향효과 향상을 위한 스테레오-10.2채널 블라인드 업믹스 기법

Stereo-10.2Channel Blind Upmix Technique for the Enhanced 3D Sound

  • 최선웅 (연세대학교 전기전자공학과) ;
  • 현동일 (연세대학교 전기전자공학과) ;
  • 이석필 (전자부품연구원(KETI) 디지털미디어 연구센터) ;
  • 박영철 (연세대학교 컴퓨터정보통신공학부) ;
  • 윤대희 (연세대학교 전기전자공학과)
  • 투고 : 2012.02.29
  • 심사 : 2012.05.16
  • 발행 : 2012.07.31

초록

본 논문에서는 입체음향효과의 향상을 위한 스테레오-10.2채널의 블라인드 업믹스 알고리즘을 제안하였다. 최근에, 소비자들은 더 나은 입체음향효과나 3D 사운드를 즐기길 원하고 다양한 멀티채널 포맷의 등장으로 업믹스 알고리즘들이 연구 되어 왔다. 그러나 기존의 업믹스 알고리즘들은 공간정보를 왜곡하는 문제점을 가지고 있었다. 이러한 문제점을 해결하기위해 전 후방 채널에 대한 이득 조절 및 10.2 채널의 각 채널별 믹싱 알고리즘을 제안한다. 기존의 상용화된 멀티채널 업믹스 알고리즘들과의 주관적 평가 실험결과 제안한 알고리즘은 입력신호의 공간정보를 유지하면서 입체음향효과를 향상시킨 것으로 확인되었다.

In this paper, we proposed the stereo-10.2channel blind upmix algorithm for the enhanced 3D sound. Recently, consumers want to enjoy better sound and the use of a various of multichannel configuration has been steadily improved. Thus, upmix algorithms have been researched. However, conventional upmix algorithms have the problem that distorts the spatial information of original source. To solve this problem and enhance the spatial sound quality, we proposed front and rear channel gain adjustment and 10.2 channel upmix algorithm for each additional channel. The listening test results show that it maintains spatial information of stereo input and enhances 3D sound effects unlike other conventional upmix algorithms.

키워드

참고문헌

  1. Dolby Laboratories, Dolby Surround Prologic II Decoder, Principles of Operation., http://www.dolby.com/uploaded Files/zz-_Shared_ Assets/English_PDFs/Professional/209_Dolby_Surround_Pro_Logic_II_Decoder_Principles_of_Operation.pdf
  2. DTS Laboratories, An Overview of DTS NEO: 6 Multi-Channel, http://www.dts.com/media/upload/pdfs/DTS%20Neo6%20Overview.pdf, 2010
  3. R. Irwan and R. M. Aarts, "Two-to-Five Channel Sound Processing," J.Audio Eng. Soc.,vol.50, no.11, pp. 914-926, 2002.
  4. M. R. Bai, G.-Y. Shih, and J.-R. Hong, "Upmixing and downmixing two-channel stereo audio for consumer electronics," IEEE Trans. on Consumer Electronics, vol. 53, no. 3, pp. 1011-1019, 2007. https://doi.org/10.1109/TCE.2007.4341580
  5. C. Avendano and J.-M. Jot, "A frequency domain approach to multichannel upmix," J.Audio Eng. Soc., vol. 52, no. 7/8, pp. 740-749, 2004.
  6. C. Faller, "Multiple-loudspeaker playback of stereo signals," J. Audio Eng. Soc., vol. 54, no. 11, pp. 1051-1064, 2006.
  7. J. Breebaart and E. Schuijers, "Phantom Materialization: A Novel Method to Enhance Stereo Audio Reproduction on Headphones," IEEE Trans. on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 1503-1511, 2008.
  8. http://en.wikipedia.org/wiki/10.2
  9. V. Pulkki, "Virtual sound source positioning using vector base amplitude panning," J. Audio Eng. Soc., vol. 45, no. 6, pp. 456-466, 1997.
  10. S. W. Jeon, Y. C. Park, S. P. Lee, and D. H. Youn, "Robust representation of spatial sound in stereoto- multichannel upmix," AES 128th Conv., London, 2010.
  11. J. Blauert, Spatial Hearing: The Psychophysics of Human Sound Localization., Cambridge, MA : MIT Press, Cambridge, Massachusetts, USA, revised edition, 1997.
  12. T. Holman. "Mixing the Sound," Surround Magazine, pp. 35-37, 2001.
  13. S. W. Jeon, D. G. Hyun, J. G. Seo, Y. C. Park, and D. H. Youn, "Enhancement of principal to ambient energy ratio for PCA-based parametric audio coding," in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pp. 385-388, 2010.
  14. S. Haykin, Adaptive Filter Theory, 4th.upper Saddle River, NJ: Prentice-Hall, 2002.
  15. ITU-Recommendation ITU-R BS.775-1: Multichannel Stereophonic Sound System with and without Accompanying Picture, Geneva, 1992-1994.
  16. Earl Vickers, "Frequency-Domain Two-to Three Channel Upmix for Center Channel Derivation and Speech Enhancement," AES 127th Convention, no. 7917, 2009.
  17. S-W. Jeon, Y-C. Park, S-P. Lee, and D-H. Youn "Virtual Source Panning using Multiple-Wise Vector Base in the Multispeaker Stereo Format," in Proc in 19th European Signal Processing Conference, pp. 1337-1341, 2011.
  18. J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol. 65, pp. 943-950, 1979. https://doi.org/10.1121/1.382599
  19. R.Y. Litovsky and H. S. Colburn, "The precedence effect," J. Acoustic. Soc. Am., vol. 106, no. 4, pp. 277-303, 1999.
  20. Rice, JJ. May BJ, Spirou GA, and Young, ED. "Pinna-based spectral cues for sound localization in cat," Hearing Res, pp. 132-152, 1992.
  21. G.S. Kendall," The Decorrelation of Audio Signals and Its Impact on Spatial Imagery," Computer Music Journal., vol. 19, no. 4, pp. 71-87, 1995. https://doi.org/10.2307/3680992
  22. 전세운, 박영철, 이석필, 윤대희, "다채널 포맷 변환과 공간적인 입체 음향 정보의 효과적인 유지에 대한 연구," 전자공학회 2010년도 하계종합학술발표회 논문집, 2010.
  23. M.S. Vonton, M. F. Davis, and C. Q. Robinson, "Signal models and Upmixing Techniques For Generating Multichannel Audio," AES 127th International Conference, no. 7917, 2009.
  24. http://en.wikipedia.org/wiki/Subwoofer
  25. J. Borenius, "Perceptibility of direction and time delay errors in subwoofer reproduction," presented at the AES 79th Convention, no. 2290, 1985.
  26. W. Martens, "The Impact of Decorrelated Low Frequency Reproduction on Auditory Spatial Imagery : Are Two Subwoofers Better than One?" AES 16th International Conference, pp. 67-77, 1999.
  27. ITU-R BS.562.3, "Subjective assessment of sound quality", International Telecommunications Union, Geneva, Switzerland, 1990.
  28. http://en.wikipedia.org/wiki/Dolby_Pro_Logic#Dolby_Pro_Logic_IIx