Search | Korea Science

Deep learning-based approach to improve the accuracy of time difference of arrival - based sound source localization (도달시간차 기반의 음원 위치 추정법의 정확도 향상을 위한 딥러닝 적용 연구)

Iljoo Jeong;Hyunsuk Huh;In-Jee Jung;Seungchul Lee
- The Journal of the Acoustical Society of Korea
- /
- v.43 no.2
- /
- pp.178-183
- /
- 2024
This study introduces an enhanced sound source localization technique, bolstered by a data-driven deep learning approach, to improve the precision and accuracy of direction of arrival estimation. Focused on refining Time Difference Of Arrival (TDOA) based sound source localization, the research hinges on accurately estimating TDOA from cross-correlation functions. Accurately estimating the TDOA still remains a limitation in this research field because the measured value from actual microphones are mixed with a lot of noise. Additionally, the digitization process of acoustic signals introduces quantization errors, associated with the sampling frequency of the measurement system, that limit the precision of TDOA estimation. A deep learning-based approach is designed to overcome these limitations in TDOA accuracy and precision. To validate the method, we conduct comprehensive evaluations using both two and three-microphone array configurations. Moreover, the feasibility and real-world applicability of the suggested method are further substantiated through experiments conducted in an anechoic chamber.
https://doi.org/10.7776/ASK.2024.43.2.178 인용 PDF

Speech Enhancement Using Microphone Array with MMSE-STSA Estimator Based Post-Processing (MMSE-STSA 추정치에 기반한 후처리를 갖는 마이크로폰 배열을 이용한 음성 개선)

Kwon Hong Seok;Son Jong Mok;Bae Keun Sung
- Proceedings of the KSPS conference
- /
- 2002.11a
- /
- pp.187-190
- /
- 2002
In this paper, a speech enhancement system using microphone array with MMSE-STSA (Minimum Mean Square Error-Short Time Spectral Amplitude) estimator based post-processing is proposed. Speech enhancement is first carried out by conventional delay-and-sum beamforming (DSB). A new MMSE-STSA estimator is then obtained by refining MMSE-STSA estimators from each microphone, which is applied to the output of conventional DSB to obtain additional speech enhancement. Computer simulation for white and pink noises show that the proposed system is superior to other approaches.
PDF

Frequency Domain Blind Source Seperation Using Cross-Correlation of Input Signals (입력신호 상호상관을 이용한 주파수 영역 블라인드 음원 분리)

Sung Chang Sook;Park Jang Sik;Son Kyung Sik;Park Keun-Soo
- Journal of Korea Multimedia Society
- /
- v.8 no.3
- /
- pp.328-335
- /
- 2005
This paper proposes a frequency domain independent component analysis (ICA) algorithm to separate the mixed speech signals using a multiple microphone array By estimating the delay timings using a input cross-correlation, even in the delayed mixture case, we propose a good initial value setting method which leads to optimal convergence. To reduce the calculation, separation process is performed at frequency domain. The results of simulations confirms the better performances of the proposed algorithm.
PDF

Microphone Array Based Speech Enhancement Using Independent Vector Analysis (마이크로폰 배열에서 독립벡터분석 기법을 이용한 잡음음성의 음질 개선)

Wang, Xingyang;Quan, Xingri;Bae, Keunsung
- Phonetics and Speech Sciences
- /
- v.4 no.4
- /
- pp.87-92
- /
- 2012
Speech enhancement aims to improve speech quality by removing background noise from noisy speech. Independent vector analysis is a type of frequency-domain independent component analysis method that is known to be free from the frequency bin permutation problem in the process of blind source separation from multi-channel inputs. This paper proposed a new method of microphone array based speech enhancement that combines independent vector analysis and beamforming techniques. Independent vector analysis is used to separate speech and noise components from multi-channel noisy speech, and delay-sum beamforming is used to determine the enhanced speech among the separated signals. To verify the effectiveness of the proposed method, experiments for computer simulated multi-channel noisy speech with various signal-to-noise ratios were carried out, and both PESQ and output signal-to-noise ratio were obtained as objective speech quality measures. Experimental results have shown that the proposed method is superior to the conventional microphone array based noise removal approach like GSC beamforming in the speech enhancement.
https://doi.org/10.13064/KSSS.2012.4.4.087 인용 PDF

Compensation for Spectral Variance in Scan-Based Planar Acoustical Holography (스캐닝 평면 음향 홀로그래피에서의 스펙트럴 분산 보정)

;;J. S. Bolton
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2002.05a
- /
- pp.520-524
- /
- 2002
Multi-reference, scan-based Acoustical Holography is a useful measurement technique when insufficient microphones are available to measure a complete hologram at once. When the sound sources are stationary, the whole hologram can be constructed by joining together sub-holograms captured using a relatively small scan array. Here that approach is extended by the development of a formulation that explicitly includes the acoustical transfer functions between the reference microphones and the scanning microphones. Based on those expressions, a compensation procedure of spectral variance due to source-non-stationarity is proposed. It has been verified both numerically and experimentally that this procedure can help suppress spatially distributed noise caused by the source level non-stationarity that is always present in a measurement.
PDF

A Study for the Sound Reinforcement System in Conference Room Using Linear Array Microphone (직선배열 마이크로폰을 사용한 회의용확성장치에 관한 연구)

;Masa-to, Abe;Ken-Ich, Kido
- The Journal of the Acoustical Society of Korea
- /
- v.4 no.1
- /
- pp.35-42
- /
- 1985
This paper suggests a method on the use of very long linear array microphone in order to obtain clear loud sound reinforcement system without howling. According to the results of the theoretical investigation, we have made that a linear array microphone. This is made of one hundred small condenser microphone having 2 cm of spatical period. To estimate the effect of sound reinforcement system physically and subjectively, four cases have been experimented : In case of using no sound reinforcement, nondirectivity microphone, rectangular window and Hnning window in linear array microphone. The experimental results prove that the case of Hnning window in linear array microphone is more excellent.
PDF

Development of Directional Sound Source in Air by Using Parametric Array- (파라메트릭 송파 방식을 이용한 기중 지향성 음원의 개발)

Moon Byung-Cheon;Kim Moo-Joon;Ha Kang-Lyeol;Kim Chun-Duck
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.291-294
- /
- 2000
음파의 비선형 현상을 통해 수행되는 파라메트릭 송파 방식을 공기 중에서의 음향 변환기에 적용시키기 위해 비교적 공진 주파수가 낮은 기중 초음파 발생 소자들을 배열시켜 기중 지향성 음원을 제작하였다. 이 송파기를 이용하여 비선형 왜곡이 발생함을 알아보기 위하여 2kHz의 차주파수를 갖도록 37.12kHz와 39.12kHz의 메인 주파수 음파를 신호발생기로부터 인가하고 이를 방사한 후 마이크로폰을 통해 수신하였다. 공기 중에서의 파라메트릭 효과의 확인을 통하여 파라메트릭 송파기에서 방사되는 메인 주파수 음파와 전파 경로 중에서 생성되는 차주파수 음파의 거동을 고찰하고, 실험 결과로부터 기중 파라메트릭 송파기 개발의 가능성을 검토하여 보았다.
PDF

Noise Statistics Estimation Using Target-to-Noise Contribution Ratio for Parameterized Multichannel Wiener Filter (변수내장형 다채널 위너필터를 위한 목적신호대잡음 기여비를 이용한 잡음추정기법)

Hong, Jungpyo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.26 no.12
- /
- pp.1926-1933
- /
- 2022
Parameterized multichannel Wiener filter (PMWF) is a linear filter that can control the trade-off between residual noise and signal distortion using the embedded parameter. To apply the PMWF to noisy inputs, accurate noise estimation is important and multichannel minima-controlled recursive averaging (MMCRA) is widely used. However, in the case of the MMCRA, the accuracy of noise estimation decreases when a directional interference is involved into the array inputs. Consequently, the performance of the PMWF is degraded. Therefore, we propose a noise power spectral density (PSD) estimation method for the PMWF in this paper. The proposed method is based on a consecutive process of eigenvalue decomposition on noisy input PSD, estimation of the target component contribution using directional information, and exponential weighting for improved estimation of the target contribution. For evaluation, four objective measures were compared with the MMCRA and we verify that the PMWF with the proposed noise estimation method can improve performance in environments where directional interfereces exist.
https://doi.org/10.6109/jkiice.2022.26.12.1926 인용 PDF KSCI

A Name Recognition Based Call-and-Come Service for Home Robots (가정용 로봇의 호출음 등록 및 인식 시스템)

Oh, Yoo-Rhee;Yoon, Jae-Sam;Park, Ji-Hun;Kim, Min-A;Kim, Hong-Kook;Kong, Dong-Geon;Myung, Hyun;Bang, Seok-Won
- 한국HCI학회:학술대회논문집
- /
- 2008.02a
- /
- pp.360-365
- /
- 2008
We propose an efficient robot name registration and recognition method in order to enable a Call-and-Come service for home robots. In the proposed method for the name registration, the search space is first restricted by using monophone-based acoustic models. Second, the registration of robot names is completed by using triphone-based acoustic models in the restricted search space. Next, the parameter for the utterance verification is calculated to reduce the acceptance rate of false calls. In addition, acoustic models are adapted by using a distance speech database to improve the performance of distance speech recognition, Moreover, the location of a user is estimated by using a microphone array. The experimental result on the registration and recognition of robot names shows that the word accuracy of speech recognition is 98.3%.
PDF

Can We Hear the Shape of a Noise Source\ulcorner (소음원의 모양을 들어서 상상할 수 있을까\ulcorner)

Kim, Yang-Hann
- Transactions of the Korean Society for Noise and Vibration Engineering
- /
- v.14 no.7
- /
- pp.586-603
- /
- 2004
One of the subtle problems that make noise control difficult for engineers is “the invisibility of noise or sound.” The visual image of noise often helps to determine an appropriate means for noise control. There have been many attempts to fulfill this rather challenging objective. Theoretical or numerical means to visualize the sound field have been attempted and as a result, a great deal of progress has been accomplished, for example in the field of visualization of turbulent noise. However, most of the numerical methods are not quite ready to be applied practically to noise control issues. In the meantime, fast progress has made it possible instrumentally by using multiple microphones and fast signal processing systems, although these systems are not perfect but are useful. The state of the art system is recently available but still has many problematic issues : for example, how we can implement the visualized noise field. The constructed noise or sound picture always consists of bias and random errors, and consequently it is often difficult to determine the origin of the noise and the spatial shape of noise, as highlighted in the title. The first part of this paper introduces a brief history, which is associated with “sound visualization,” from Leonardo da Vinci's famous drawing on vortex street (Fig. 1) to modern acoustic holography and what has been accomplished by a line or surface array. The second part introduces the difficulties and the recent studies. These include de-Dopplerization and do-reverberation methods. The former is essential for visualizing a moving noise source, such as cars or trains. The latter relates to what produces noise in a room or closed space. Another mar issue associated this sound/noise visualization is whether or not Ivecan distinguish mutual dependence of noise in space : for example, we are asked to answer the question, “Can we see two birds singing or one bird with two beaks?"
https://doi.org/10.5050/KSNVN.2004.14.7.586 인용 PDF KSCI

Search Result 41, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)