Browse > Article
http://dx.doi.org/10.7776/ASK.2013.32.6.541

Audio Mixer Algorithm for Enhancing Speech Quality of Multi-party Audio Telephony  

Ryu, Sang-Hyeon (광운대학교 전파공학과)
Kim, Hyoung-Gook (광운대학교 전파공학과)
Abstract
The speech quality of multi-party audio telephony between two, three or more participants is decreased by audio volume imbalance, audio volume saturation and noise level increase. To solve this issue, this paper proposes an advanced audio mixing algorithm for software-based multi-point control unit. Our approach is based on the combined voice activity detection and gain control technique that consists of a set of algorithms that classify audio signals, estimate audio volumes, adjust gain factors and mix audio signals of all channels. The proposed audio mixing algorithm is computationally efficient, delivers high-quality speech, and is suitable for use in any practical multi-party audio telephony.
Keywords
Multi-party audio telephony; Voice activity detection; Gain control; Audio volume equalization;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 D. Song, Y. Mo, and F. Wang, "Architecture of multiparty conferencing using SIP," in Proc. IEEE Int'l. Conf. on WiCOM, 2, 1361-1364 (2005).
2 F. Xing G, U. Wei-Kang, and Y. Xiu-qing, "Research on fast real time adaptive audio mixing in multimedia conference," J. of Zhejiang Univ. Sci. 6, 507-512 (2005).   DOI
3 S. V. Gerven and F. Xie, "A comparative study of speech detection methods," in Proc. of EUROSPEECH, 3, 1095-098 (1997).
4 S. P. Chandra, K. M. Senthil, and M. P. P. Bala, "Audio mixer for multi-party conferencing in VoIP," in Poc. IEEE Int'l. Conf. on IMSAA, 1-6 (2009).
5 V. M. Baskaran, and K. Wong "Audio mixer with automatic gain controller for software based multipoint control unit," APCCAS, 164-167 (2010).
6 Y. S. Um, J. H. Chang, and D. K. Kim, "Signal subspace-based vioc activity detection using generalized gaussian distortion" (in Korean), J. Acoust. Soc. Kr. 32, 131-137 (2013).   DOI   ScienceOn
7 3GPP TS 26.194, AMR Wideband Speech Codec : Voice Activity Detection (VAD), 2004.