Browse > Article
http://dx.doi.org/10.7776/ASK.2011.30.6.308

Direction Estimation of Multiple Sound Sources Using Circular Probability Distributions  

Nam, Seung-Hyon (배재대학교 전자공학과)
Kim, Yong-Hoh (배재대학교 전자공학과)
Abstract
This paper presents techniques for estimating directions of multiple sound sources ranging from $0^{\circ}$ to $360^{\circ}$ using circular probability distributions having a periodic property. Phase differences containing direction information of sources can be modeled as mixtures of multiple probability distributions and source directions can be estimated by maximizing log-likelihood functions. Although the von Mises distribution is widely used for analyzing this kind of periodic data, we define a new class of circular probability distributions from Gaussian and Laplacian distributions by adopting a modulo operation to have $2{\pi}$-periodicity. Direction estimation with these circular probability distributions is done by implementing corresponding EM (Expectation-Maximization) algorithms. Simulation results in various reverberant environments confirm that Laplacian distribution provides better performance than von Mises and Gaussian distributions.
Keywords
Direction estimation; Circular probability distribution; Log-likelihood function; Expectation-Maximization;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D. R. Campbell, K. J. Palomaki, and G. J. Brown, "Roomsim, a matlab simulation of shoebox room acoustics for use in teaching and research," in http://media.paisley.ac.uk/-campbell/ Roomsim/, 2008.
2 N. T. Thom, and S. H. Nam, "An expectation-maximization method for the permutation problem in frequency-domain blind speech separation," in Proc. of ICASSP2010, 2010.
3 Y. Hioka, M. Matsuo, and N. Hamada, "Multiple-speechsource localization using advanced histogram mapping method," Acousitical Sicence and Technology, vol. 30, no. 2, 2009.
4 P. Smaragdis, and P. Boufounos, "Position and trajectory learning for microphone arrays," IEEE Trans. on Speech and Audio Proc., Jan. 2007.
5 L. A. Jeffress, "A place theory of sound localization," J. Comparative Physiol. Psychol., vol. 41, no. 1, pp. 35-39, 1948.   DOI
6 N. Mitianoudis and T. Stathaki, "Batch and online underdetermined source separation using laplacian mixture models," IEEE Trans. on Audio, Speech, and Lang. Proc. vol. 15, pp. 1818-1832, 2007.   DOI   ScienceOn
7 C. M. Bishop, Pattern recognition and machine learning, Springer, 2006.
8 C. Liu, B. C. Wheeler, Jr, R. C. Bilger, C. R. Lansing, and A. S. Feng, "Localization of multiple sound sources with two microphones," J. Acoust. Soc. Amer., vol. 108, no. 4, pp. 1888-1905, 2000.   DOI   ScienceOn
9 R. O. Schmidt, "Multiple emitter location and signal parameter estimation," IEEE Trans. Antennas Propag., vol. 34, pp. 276- 280, 1986.   DOI
10 H. Wang and M. Kaveh, "Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources," IEEE Trans. Acoust. Speech Signal Process., vol. 33, pp. 823-831, 1985.   DOI
11 P. Aarabi, "Self-localizing dynamic microphone arrays," IEEE Trans. Syst., Man, Cybern. C, vol. 32, no. 4, pp. 474-484, 2002.   DOI   ScienceOn
12 C. H. Knapp and G. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust. Speech Signal Process., vol. 24, pp. 320-327, 1976.   DOI
13 M. I. Mandel, D. P. W. Ellis, and T. Jebara, "An EM algorithm for localizing multiple sound sources in reverberant environments," in Adv. Neural Info. Process. Syst., B. Schölkopf, J. Platt, and T. Hoffman, Eds. Cambridge, MA: MIT Press, pp. 953- 960, 2007.
14 J. Benesty, J. Chen, and Y. Huang, Microphone array signal processing, Springer, 2008.