• Title/Summary/Keyword: GCC-PHAT

Search Result 13, Processing Time 0.029 seconds

A study on robust generalized cross correlation-phase transform based time delay estimation in impulsive noise environment using nonlinear preprocessing and frequency domain low-pass filter (비선형 전처리와 주파수 영역 저역 필터에 의한 임펄스성 잡음 환경에 강인한 위상 변환 일반 상호 상관 시간 지연 추정기 연구)

  • Jun-Seok Lim;Keunwa Lee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.4
    • /
    • pp.406-413
    • /
    • 2024
  • The proposed method uses Generalized Cross Correlation - Phase Transform (GCC-PHAT) method with nonlinear preprocessing and a frequency domain low-pass filter. In this paper, by reinterpreting the calculation process of GCC-PHAT as DFT, we derive that there is an effective frequency band used for time delay estimation in GCC-PHAT, and by using only the effective band using a low-pass filter, the noise component is reduced and it improvesthe time delay performance in impulsive noise environments. By comparing the proposed method with the traditional GCC-PHAT in an impulsive noise environment, we show that the GCC-PHAT becomes more robust to the impulsive noise.

Performance analysis of GCC-PHAT-based sound source localization for intelligent robots (지능형 로봇을 위한 GCC-PHAT 기반 음원추적 기술의 성능분석)

  • Park, Beom-Chul;Ban, Kyu-Dae;Kwak, Keun-Chang;Yoon, Ho-Sup
    • The Journal of Korea Robotics Society
    • /
    • v.2 no.3
    • /
    • pp.270-274
    • /
    • 2007
  • In this paper, we present a Sound Source Localization (SSL) based GCC (Generalized Cross Correlation)-PHAT (Phase Transform) and new measurement method of angle with robot auditory system for a network-based intelligent service robot. The main goal of this paper is to analysis performance of TDOA and GCC-PHAT sound source localization method and new angle measurement method is compared. We use GCC-PHAT for measuring time delays between several microphones. And sound source location is calculated by using time delays and new measurement method of angle. The robot platform used in this work is wever-R2, which is a network-based intelligent service robot developed at Intelligent Robot Research Division in ETRI.

  • PDF

Improved generalized cross correlation-phase transform based time delay estimation by frequency domain autocorrelation (주파수영역 자기상관에 의한 위상 변환 일반 상호 상관 시간 지연 추정기 성능 개선)

  • Lim, Jun-Seok;Cheong, MyoungJun;Kim, Seongil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.37 no.5
    • /
    • pp.271-275
    • /
    • 2018
  • There are several methods for estimating the time delay between incoming signals to two sensors. Among them, the GCC-PHAT (Generalized Cross Correlation-Phase Transform) method, which estimates the relative delay from the signal whitening and the cross-correlation between the different signal inputs to the two sensors, is a traditionally well known method for achieving stable performance. In this paper, we have identified a part of GCC-PHAT that can improve the periodicity. Also, we apply the auto-correlation method that is widely used as a method to improve the periodicity. Comparing the proposed method with the GCC-PHAT method, we show that the proposed method improves the mean square error performance by 5 dB ~ 15 dB at the SNR above 0 dB for white Gaussian signal source and also show that the method improves the mean square error performance up to 15 dB at the SNR above 2 dB for the color signal source.

On the speaker's position estimation using TDOA algorithm in vehicle environments (자동차 환경에서 TDOA를 이용한 화자위치추정 방법)

  • Lee, Sang-Hun;Choi, Hong-Sub
    • Journal of Digital Contents Society
    • /
    • v.17 no.2
    • /
    • pp.71-79
    • /
    • 2016
  • This study is intended to compare the performances of sound source localization methods used for stable automobile control by improving voice recognition rate in automobile environment and suggest how to improve their performances. Generally, sound source location estimation methods employ the TDOA algorithm, and there are two ways for it; one is to use a cross correlation function in the time domain, and the other is GCC-PHAT calculated in the frequency domain. Among these ways, GCC-PHAT is known to have stronger characteristics against echo and noise than the cross correlation function. This study compared the performances of the two methods above in automobile environment full of echo and vibration noise and suggested the use of a median filter additionally. We found that median filter helps both estimation methods have good performances and variance values to be decreased. According to the experimental results, there is almost no difference in the two methods' performances in the experiment using voice; however, using the signal of a song, GCC-PHAT is 10% more excellent than the cross correlation function in terms of the recognition rate. Also, when the median filter was added, the cross correlation function's recognition rate could be improved up to 11%. And in regarding to variance values, both methods showed stable performances.

Direction Estimation of Multiple Sound Sources Using Non-negative Matrix Factorization and Generalized Cross-Correlation (비음수 행렬 분해 및 일반화된 상호상관계수 기법을 이용한 TV시청 환경에서의 다중 음원 방향 추정 방법)

  • Yu, Seung Woo;Jeon, Kwang Myung;Park, Ji Hyun;Kim, Hong Kook
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2015.11a
    • /
    • pp.16-17
    • /
    • 2015
  • 본 논문에서는 실내 환경 중 TV 시청환경에서 마이크로폰 어레이를 이용하여 다양한 다중 음원 방향을 추정하는 기법을 제안한다. 제안된 기법은 기존의 하나의 음원에 특화되어 있는 GCC-PHAT 기반의 방법을 GCC-PHAT 버퍼와 NMF를 도입하여 다중음원의 방향 추정을 가능하게 만들었다. 제안된 기법의 성능을 평가하기 위해서 실 거주 환경에서 발생하는 소음원과 TV 소리 방향 추정 결과에 대한 실측치와 추정치 간의 오차인 절대 평균오차를 측정하였으며, 실험 결과 제안한 기법이 기존의 방법인 GCC-PHAT보다 우수한 추정 성능을 보임을 확인하였다.

  • PDF

GPU-based Acceleration of Particle Filter Signal Processing for Efficient Moving-target Position Estimation (이동 목표물의 효율적인 위치 추정을 위한 파티클 필터 신호 처리의 GPU 기반 가속화)

  • Kim, Seongseop;Cho, Jeonghun;Park, Daejin
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.12 no.5
    • /
    • pp.267-275
    • /
    • 2017
  • Time of difference of arrival (TDOA) method using passive sonar sensor array has normally been used to estimate the location of a concealed moving target in underwater environment. Particle filter has been introduced for effective target estimation for non-Gaussian and nonlinear systems. In this paper, we propose a GPU-based acceleration of target position estimation using particle filter and propose efficient embedded system and software architecture. For the TDOA measurement from the passive sonar sensor, we use the generalized cross correlation phase transform (GCC-PHAT) method to obtain the correlation coefficient of the signal using FFT and we try to accelerate the calculation of GCC-PHAT based TDOA measurements using FFT with GPU CUDA. We also propose parallelization method of the target position estimation algorithm using the GPU CUDA to update the state of each particle for the target position estimation using the measured values. The target estimation algorithm was verified using Matlab and implemented using GPU CUDA. Then, we realized the proposed signal processing acceleration system using NVIDIA Jetson TX1 as the target board to analyze in terms of the execution time. The execution time of the algorithm is reduced by 55% to the CPU standalone-operation on the target board. Experiment results show that the proposed architecture is a feasible solution in terms of high-performance and area-efficient architecture.

Real-Time Sound Localization System For Reverberant And Noisy Environment (반향음과 잡음 환경을 고려한 실시간 소리 추적 시스템)

  • Kee, Chang-Don;Kim, Ghang-Ho;Lee, Taik-Jin
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.38 no.3
    • /
    • pp.258-263
    • /
    • 2010
  • Sound localization algorithm usually adapts three step process: sampling sound signals, estimating time difference of arrival between microphones, estimate location of sound source. To apply this process in indoor environment, sound localization algorithm must be strong enough against reverberant and noisy condition. Additionally, calculation efficiency must be considered in implementing real-time sound localization system. To implement real-time robust sound localization system we adapt four low cost condenser microphones which reduce the cost and total calculation load. And to get TDOA(Time Differences of Arrival) of microphones we adapt GCC-PHAT(Generalized Cross Correlation-Phase Transform) which is robust algorithm to the reverberant and noise environment. The position of sound source was calculated by using iterative least square algorithm which produce highly accurate position data.

An efficient space dividing method for the two-dimensional sound source localization (2차원 상의 음원위치 추정을 위한 효율적인 영역분할방법)

  • Kim, Hwan-Yong;Choi, Hong-Sub
    • The Journal of the Acoustical Society of Korea
    • /
    • v.35 no.5
    • /
    • pp.358-367
    • /
    • 2016
  • SSL (Sound Source Localization) has been applied to several applications such as man-machine interface, video conference system, smart car and so on. But in the process of sound source localization, angle estimation error is occurred mainly due to the non-linear characteristics of the sine inverse function. So an approach was proposed to decrease the effect of this non-linear characteristics, which divides the microphone's covering space into narrow regions. In this paper, we proposed an optimal space dividing way according to the pattern of microphone array. In addition, sound source's 2-dimensional position is estimated in order to evaluate the performance of this dividing method. In the experiment, GCC-PHAT (Generalized Cross Correlation PHAse Transform) method that is known to be robust with noisy environments is adopted and triangular pattern of 3 microphones and rectangular pattern of 4 microphones are tested with 100 speech data respectively. The experimental results show that triangular pattern can't estimate the correct position due to the lower space area resolution, but performance of rectangular pattern is dramatically improved with correct estimation rate of 67 %.

Comparison of the sound source localization methods appropriate for a compact microphone array (소형 마이크로폰 배열에 적용 가능한 음원 위치 추정법 비교)

  • Jung, In-Jee;Ih, Jeong-Guon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.1
    • /
    • pp.47-56
    • /
    • 2020
  • The sound source localization technique has various application fields in the era of internet-of-things, for which the probe size becomes critical. The localization methods using the acoustic intensity vector has an advantage of downsizing the layout of the array owing to a small finite-difference error for the short distance between adjacent microphones. In this paper, the acoustic intensity vector and the Time Difference of Arrival (TDoA) method are compared in the viewpoint of the localization error in the far-field. The comparison is made according to the change of spacing between adjacent microphones of the three-dimensional microphone array arranged in a tetrahedral shape. An additional test is conducted in the reverberant field by varying the reverberation time to verify the effectiveness of the methods applied to the actual environments. For estimating the TDoA, the Generalized Cross Correlation-Phase transform (GCC-PHAT) algorithm is adopted in the computation. It is found that the mean localization error of the acoustic intensimetry is 2.9° and that of the GCC-PHAT is 7.3° for T60 = 0.4 s, while the error increases as 9.9°, 13.0° for T60 = 1.0 s, respectively. The data supports that a compact array employing the acoustic intensimetry can localize of the sound source in the actual environment with the moderate reflection conditions.

Nonnegative Matrix Factorization Based Direction-of-Arrival Estimation of Multiple Sound Sources Using Dual Microphone Array (이중 마이크로폰을 이용한 비음수 행렬분해 기반 다중음원 도래각 예측)

  • Jeon, Kwang Myung;Kim, Hong Kook;Yu, Seung Woo
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.2
    • /
    • pp.123-129
    • /
    • 2017
  • This paper proposes a new nonnegative matrix factorization (NMF) based direction-of-arrival (DOA) estimation method for multiple sound sources using a dual microphone array. First of all, sound signals coming from the dual microphone array are segmented into consecutive analysis frames, and a steered-response power phase transform (SRP-PHAT) beamformer is applied to each frame so that stereo signals of each frame are represented in a time-direction domain. The time-direction outputs of SRP-PHAT are stored for a pre-defined number of frames, which is referred to as a time-direction block. Next, In order to estimate DOAs robust to noise, each time-direction block is normalized along the time by using a block subtraction technique. After that, an unsupervised NMF method is applied to the normalized time-direction block in order to cluster the directions of each sound source in a multiple sound source environments. In particular, the activation and basis matrices are used to estimate the number of sound sources and their DOAs, respectively. The DOA estimation performance of the proposed method is evaluated by measuring a mean absolute error (MAE) and the standard deviation of errors between the oracle and estimated DOAs under a three source condition, where the sources are located in [$-35{\circ}$, 5m], [$12{\circ}$, 4m], and [$38{\circ}$, 4.m] from the dual microphone array. It is shown from the experiment that the proposed method could relatively reduce MAE by 56.83%, compared to a conventional SRP-PHAT based DOA estimation method.