Search | Korea Science

Integrated Algorithm of Sound Source Separation and Localization (음원 분리와 음원 위치 추정 통합 알고리즘)

Han, Taek-Jin;Park, Hochong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2014.11a
- /
- pp.111-114
- /
- 2014
본 논문에서는 다양한 스테레오 환경에서도 정확한 음원 위치 추정이 가능한 방법을 제안한다. 기존의 음원 위치 추정 방법은 방향성을 가지고 있는 주성분 신호와 방향성이 없는 주변 성분으로 구성된 스테레오 환경에서만 음원의 위치 추정이 가능했다. 그러나 현재 제공되고 있는 스테레오 신호는 방향성을 가지는 다수의 음원으로 구성되어있고, 기존의 음원 위치 추정 방법으로는 정확한 음원 위치 추정이 어렵다. 이와 같은 문제 때문에 다수의 음원을 분리한 뒤, 음원의 위치를 추정하는 방법이 제안되었다. 그러나 음원의 분리 과정에서 생기는 분리 오차가 커서 음원 위치 추정이 정확하지 않다. 이에 본 논문에서는 정확한 음원 위치 추정을 위하여 음원 분리와 음원 위치 추정이 통합된 새로운 알고리즘을 제안한다. 제안한 알고리즘은 음원 위치를 기존의 방법보다 정확하게 추정하는 것을 확인할 수 있었다.
PDF

Direction Estimation of Multiple Sound Sources Using Non-negative Matrix Factorization and Generalized Cross-Correlation (비음수 행렬 분해 및 일반화된 상호상관계수 기법을 이용한 TV시청 환경에서의 다중 음원 방향 추정 방법)

Yu, Seung Woo;Jeon, Kwang Myung;Park, Ji Hyun;Kim, Hong Kook
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.11a
- /
- pp.16-17
- /
- 2015
본 논문에서는 실내 환경 중 TV 시청환경에서 마이크로폰 어레이를 이용하여 다양한 다중 음원 방향을 추정하는 기법을 제안한다. 제안된 기법은 기존의 하나의 음원에 특화되어 있는 GCC-PHAT 기반의 방법을 GCC-PHAT 버퍼와 NMF를 도입하여 다중음원의 방향 추정을 가능하게 만들었다. 제안된 기법의 성능을 평가하기 위해서 실 거주 환경에서 발생하는 소음원과 TV 소리 방향 추정 결과에 대한 실측치와 추정치 간의 오차인 절대 평균오차를 측정하였으며, 실험 결과 제안한 기법이 기존의 방법인 GCC-PHAT보다 우수한 추정 성능을 보임을 확인하였다.
PDF

A Source Separation Algorithm for Stereo Panning Sources (스테레오 패닝 음원을 위한 음원 분리 알고리즘)

Baek, Yong-Hyun;Park, Young-Cheol
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.4 no.2
- /
- pp.77-82
- /
- 2011
In this paper, we investigate source separation algorithms for stereo audio mixed using amplitude panning method. This source separation algorithms can be used in various applications such as up-mixing, speech enhancement, and high quality sound source separation. The methods in this paper estimate the panning angles of individual signals using the principal component analysis being applied in time-frequency tiles of the input signal and independently extract each signal through directional filtering. Performances of the methods were evaluated through computer simulations.
https://doi.org/10.17661/jkiiect.2011.4.2.077 인용 PDF

Passive Range Estimation Based on Towed Line Array in Multi-Target Environment (다중 음원 환경에서의 수동 거리 추정)

양인식;김준환;김기만
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2000.05a
- /
- pp.367-370
- /
- 2000
Various methods of enhancing the performance of passive range sonar arrays have been discussed, triangulation, wavefront curvature method etc. But they are not appropriate to the methods because of very low SNR in underwater environment. We made appropriate sub-arrays in a linear array and applied to the beamformers such as a minimum variance with null constraints.
PDF

The Design of IoT Device System for Disaster Prevention using Sound Source Detection and Location Estimation Algorithm (음원탐지 및 위치 추정 알고리즘을 이용한 방재용 IoT 디바이스 시스템 설계)

Ghil, Min-Sik;Kwak, Dong-Kurl
- Journal of Convergence for Information Technology
- /
- v.10 no.8
- /
- pp.53-59
- /
- 2020
This paper relates to an IoT device system that detects sound source and estimates the sound source location. More specifically, it is a system using a sound source direction detection device that can accurately detect the direction of a sound source by analyzing the difference of arrival time of a sound source signal collected from microphone sensors, and track the generation direction of a sound source using an IoT sensor. As a result of a performance test by generating a sound source, it was confirmed that it operates very accurately within 140dB of the acoustic detection area, within 1 second of response time, and within 1° of directional angle resolution. In the future, based on this design plan, we plan to commercialize it by improving the reliability by reflecting the artificial intelligence algorithm through big data analysis.
https://doi.org/10.22156/CS4SMB.2020.10.08.053 인용 PDF KSCI

A study on sound source segregation of frequency domain binaural model with reflection (반사음이 존재하는 양귀 모델의 음원분리에 관한 연구)

Lee, Chai-Bong
- Journal of the Institute of Convergence Signal Processing
- /
- v.15 no.3
- /
- pp.91-96
- /
- 2014
For Sound source direction and separation method, Frequency Domain Binaural Model(FDBM) shows low computational cost and high performance for sound source separation. This method performs sound source orientation and separation by obtaining the Interaural Phase Difference(IPD) and Interaural Level Difference(ILD) in frequency domain. But the problem of reflection occurs in practical environment. To reduce this reflection, a method to simulate the sound localization of a direct sound, to detect the initial arriving sound, to check the direction of the sound, and to separate the sound is presented. Simulation results show that the direction is estimated to lie close within 10% from the sound source and, in the presence of the reflection, the level of the separation of the sound source is improved by higher Coherence and PESQ(Perceptual Evaluation of Speech Quality) and by lower directional damping than those of the existing FDBM. In case of no reflection, the degree of separation was low.
PDF KSCI

Direction Estimation of Multiple Sound Sources Using Circular Probability Distributions (순환 확률분포를 이용한 다중 음원 방향 추정)

Nam, Seung-Hyon;Kim, Yong-Hoh
- The Journal of the Acoustical Society of Korea
- /
- v.30 no.6
- /
- pp.308-314
- /
- 2011
This paper presents techniques for estimating directions of multiple sound sources ranging from $0^{\circ}$ to $360^{\circ}$ using circular probability distributions having a periodic property. Phase differences containing direction information of sources can be modeled as mixtures of multiple probability distributions and source directions can be estimated by maximizing log-likelihood functions. Although the von Mises distribution is widely used for analyzing this kind of periodic data, we define a new class of circular probability distributions from Gaussian and Laplacian distributions by adopting a modulo operation to have $2{\pi}$-periodicity. Direction estimation with these circular probability distributions is done by implementing corresponding EM (Expectation-Maximization) algorithms. Simulation results in various reverberant environments confirm that Laplacian distribution provides better performance than von Mises and Gaussian distributions.
https://doi.org/10.7776/ASK.2011.30.6.308 인용 PDF KSCI

The Estimaion of Sound source of DIFAR Sonobuoy in Time Domain (DIFAR Sonobuoy의 시간영역에서의 음원 방향 추정)

Kim Jung-Hwa;Lee Baek-Lyeol;Bae Hyeon-Gee;Park Soon-Jong;Kim Chun-Duck;Lim Jung-Bin;Lee Yung-Yook
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.241-244
- /
- 2002
시간영역에서의 음원 방향 추정 알고리즘을 이용하여 수동형 DIFAR Sonobuoy 의 도래각 추정 성능 평가 시스템을 구성하고 추정 오차에 대하여 고찰하였다. 일반 실내에서 음원주파수 $f_0(700Hz\~1.7kHz)$로 입사하는 음원에 대하여 도래각을 추정한 결과 한 주기당 한계 ${\pm}10^{\circ}$ 이내로 약 $80\%$ 이상 추정 결과로 나타났으며 특히, 1.7kHz 의 경우는 ${\pm}2.97^{\circ}$로 적은 오차를 보임에 따라 이 대역에서의 기준 주파수로 평가 시스템에 적용할 수 있음을 확인하였다.
PDF

Nonnegative Matrix Factorization Based Direction-of-Arrival Estimation of Multiple Sound Sources Using Dual Microphone Array (이중 마이크로폰을 이용한 비음수 행렬분해 기반 다중음원 도래각 예측)

Jeon, Kwang Myung;Kim, Hong Kook;Yu, Seung Woo
- Journal of the Institute of Electronics and Information Engineers
- /
- v.54 no.2
- /
- pp.123-129
- /
- 2017
This paper proposes a new nonnegative matrix factorization (NMF) based direction-of-arrival (DOA) estimation method for multiple sound sources using a dual microphone array. First of all, sound signals coming from the dual microphone array are segmented into consecutive analysis frames, and a steered-response power phase transform (SRP-PHAT) beamformer is applied to each frame so that stereo signals of each frame are represented in a time-direction domain. The time-direction outputs of SRP-PHAT are stored for a pre-defined number of frames, which is referred to as a time-direction block. Next, In order to estimate DOAs robust to noise, each time-direction block is normalized along the time by using a block subtraction technique. After that, an unsupervised NMF method is applied to the normalized time-direction block in order to cluster the directions of each sound source in a multiple sound source environments. In particular, the activation and basis matrices are used to estimate the number of sound sources and their DOAs, respectively. The DOA estimation performance of the proposed method is evaluated by measuring a mean absolute error (MAE) and the standard deviation of errors between the oracle and estimated DOAs under a three source condition, where the sources are located in [$-35{\circ}$, 5m], [$12{\circ}$, 4m], and [$38{\circ}$, 4.m] from the dual microphone array. It is shown from the experiment that the proposed method could relatively reduce MAE by 56.83%, compared to a conventional SRP-PHAT based DOA estimation method.
https://doi.org/10.5573/ieie.2017.54.2.123 인용 PDF KSCI

A Real-time Audio Surveillance System Detecting and Localizing Dangerous Sounds for PTZ Camera Surveillance (PTZ 카메라 감시를 위한 실시간 위험 소리 검출 및 음원 방향 추정 소리 감시 시스템)

Nguyen, Viet Quoc;Kang, HoSeok;Chung, Sun-Tae;Cho, Seongwon
- Journal of Korea Multimedia Society
- /
- v.16 no.11
- /
- pp.1272-1280
- /
- 2013
In this paper, we propose an audio surveillance system which can detect and localize dangerous sounds in real-time. The location information about dangerous sounds can render a PTZ camera to be directed so as to catch a snapshot image about the dangerous sound source area and send it to clients instantly. The proposed audio surveillance system firstly detects foreground sounds based on adaptive Gaussian mixture background sound model, and classifies it into one of pre-trained classes of foreground dangerous sounds. For detected dangerous sounds, a sound source localization algorithm based on Dual delay-line algorithm is applied to localize the sound sources. Finally, the proposed system renders a PTZ camera to be oriented towards the dangerous sound source region, and take a snapshot against over the sound source region. Experiment results show that the proposed system can detect foreground dangerous sounds stably and classifies the detected foreground dangerous sounds into correct classes with a precision of 79% while the sound source localization can estimate orientation of the sound source with acceptably small error.
https://doi.org/10.9717/kmms.2013.16.11.1272 인용 PDF KSCI KPUBS HTML

Search Result 42, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)