Search | Korea Science

Sound Source Localization Method Based on Deep Neural Network (깊은 신경망 기반 음원 추적 기법)

Park, Hee-Mun;Jung, Jong-Dae
- Journal of IKEEE
- /
- v.23 no.4
- /
- pp.1360-1365
- /
- 2019
In this paper, we describe a sound source localization(SSL) system which can be applied to mobile robot and automatic control systems. Usually the SSL method finds the Interaural Time Difference, the Interaural Level Difference, and uses the geometrical principle of microphone array. But here we proposed another approach based on the deep neural network to obtain the horizontal directional angle(azimuth) of the sound source. We pick up the sound source signals from the two microphones attached symmetrically on both sides of the robot to imitate the human ears. Here, we use difference of spectral distributions of sounds obtained from two microphones to train the network. We train the network with the data obtained at the multiples of 10 degrees and test with several data obtained at the random degrees. The result shows quite promising validity of our approach.
https://doi.org/10.7471/ikeee.2019.23.4.1360 인용 PDF KSCI

An efficient space dividing method for the two-dimensional sound source localization (2차원 상의 음원위치 추정을 위한 효율적인 영역분할방법)

Kim, Hwan-Yong;Choi, Hong-Sub
- The Journal of the Acoustical Society of Korea
- /
- v.35 no.5
- /
- pp.358-367
- /
- 2016
SSL (Sound Source Localization) has been applied to several applications such as man-machine interface, video conference system, smart car and so on. But in the process of sound source localization, angle estimation error is occurred mainly due to the non-linear characteristics of the sine inverse function. So an approach was proposed to decrease the effect of this non-linear characteristics, which divides the microphone's covering space into narrow regions. In this paper, we proposed an optimal space dividing way according to the pattern of microphone array. In addition, sound source's 2-dimensional position is estimated in order to evaluate the performance of this dividing method. In the experiment, GCC-PHAT (Generalized Cross Correlation PHAse Transform) method that is known to be robust with noisy environments is adopted and triangular pattern of 3 microphones and rectangular pattern of 4 microphones are tested with 100 speech data respectively. The experimental results show that triangular pattern can't estimate the correct position due to the lower space area resolution, but performance of rectangular pattern is dramatically improved with correct estimation rate of 67 %.
https://doi.org/10.7776/ASK.2016.35.5.358 인용 PDF KSCI

Performance analysis of GCC-PHAT-based sound source localization for intelligent robots (지능형 로봇을 위한 GCC-PHAT 기반 음원추적 기술의 성능분석)

Park, Beom-Chul;Ban, Kyu-Dae;Kwak, Keun-Chang;Yoon, Ho-Sup
- The Journal of Korea Robotics Society
- /
- v.2 no.3
- /
- pp.270-274
- /
- 2007
In this paper, we present a Sound Source Localization (SSL) based GCC (Generalized Cross Correlation)-PHAT (Phase Transform) and new measurement method of angle with robot auditory system for a network-based intelligent service robot. The main goal of this paper is to analysis performance of TDOA and GCC-PHAT sound source localization method and new angle measurement method is compared. We use GCC-PHAT for measuring time delays between several microphones. And sound source location is calculated by using time delays and new measurement method of angle. The robot platform used in this work is wever-R2, which is a network-based intelligent service robot developed at Intelligent Robot Research Division in ETRI.
PDF

Fast 360° Sound Source Localization using Signal Energies and Partial Cross Correlation for TDOA Computation

Yiwere, Mariam;Rhee, Eun Joo
- Journal of Information Technology Applications and Management
- /
- v.24 no.1
- /
- pp.157-167
- /
- 2017
This paper proposes a simple sound source localization (SSL) method based on signal energies comparison and partial cross correlation for TDOA computation. Many sound source localization methods include multiple TDOA computations in order to eliminate front-back confusion. Multiple TDOA computations however increase the methods' computation times which need to be as minimal as possible for real-time applications. Our aim in this paper is to achieve the same results of localization using fewer computations. Using three microphones, we first compare signal energies to predict which quadrant the sound source is in, and then we use partial cross correlation to estimate the TDOA value before computing the azimuth value. Also, we apply a threshold value to reinforce our prediction method. Our experimental results show that the proposed method has less computation time; spending approximately 30% less time than previous three microphone methods.
https://doi.org/10.21219/jitam.2017.24.1.157 인용 PDF KSCI

Model-based Clustering of DOA Data Using von Mises Mixture Model for Sound Source Localization

Dinh, Quang Nguyen;Lee, Chang-Hoon
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.13 no.1
- /
- pp.59-66
- /
- 2013
In this paper, we propose a probabilistic framework for model-based clustering of direction of arrival (DOA) data to obtain stable sound source localization (SSL) estimates. Model-based clustering has been shown capable of handling highly overlapped and noisy datasets, such as those involved in DOA detection. Although the Gaussian mixture model is commonly used for model-based clustering, we propose use of the von Mises mixture model as more befitting circular DOA data than a Gaussian distribution. The EM framework for the von Mises mixture model in a unit hyper sphere is degenerated for the 2D case and used as such in the proposed method. We also use a histogram of the dataset to initialize the number of clusters and the initial values of parameters, thereby saving calculation time and improving the efficiency. Experiments using simulated and real-world datasets demonstrate the performance of the proposed method.
https://doi.org/10.5391/IJFIS.2013.13.1.59 인용 PDF KSCI

3-D Sound Source Localization using Energy-Based Region Selection and TDOA (에너지 기반 영역 선택과 TDOA에 의한 3차원 음원 위치 추정)

Yiwere, Mariam;Rhee, Eun Joo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.21 no.2
- /
- pp.294-300
- /
- 2017
This paper proposes a method for 3-D sound source localization (SSL) using region selection and TDOA. 3-D SSL involves the estimation of an azimuth angle and an elevation angle. With the aim of reducing the computation time, we compare signal energies to select one out of three regions. In the selected region, we compute only one TDOA value for the azimuth angle estimation. Also, to estimate the vertical angle, we choose the higher energy signal from the selected region and pair it up with the elevated microphone's signal for TDOA computation and elevation angle estimation. Our experimental results show that the proposed method achieves average error values of $0.778^{\circ}$ in azimuth and $1.296^{\circ}$ in elevation, which is similar to other methods. The method uses one energy comparison and two TDOA computations therefore, the total processing time is reduced.
https://doi.org/10.6109/jkiice.2017.21.2.294 인용 PDF KSCI

Search Result 6, Processing Time 0.023 seconds

Sound Source Localization Method Based on Deep Neural Network (깊은 신경망 기반 음원 추적 기법)

An efficient space dividing method for the two-dimensional sound source localization (2차원 상의 음원위치 추정을 위한 효율적인 영역분할방법)

Performance analysis of GCC-PHAT-based sound source localization for intelligent robots (지능형 로봇을 위한 GCC-PHAT 기반 음원추적 기술의 성능분석)

Fast 360° Sound Source Localization using Signal Energies and Partial Cross Correlation for TDOA Computation

Model-based Clustering of DOA Data Using von Mises Mixture Model for Sound Source Localization

3-D Sound Source Localization using Energy-Based Region Selection and TDOA (에너지 기반 영역 선택과 TDOA에 의한 3차원 음원 위치 추정)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)