• Title/Summary/Keyword: Sound-image localization

Search Result 35, Processing Time 0.022 seconds

Sound Source Localization using HRTF database

  • Hwang, Sung-Mok;Park, Young-Jin;Park, Youn-Sik
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.751-755
    • /
    • 2005
  • We propose a sound source localization method using the Head-Related-Transfer-Function (HRTF) to be implemented in a robot platform. In conventional localization methods, the location of a sound source is estimated from the time delays of wave fronts arriving in each microphone standing in an array formation in free-field. In case of a human head this corresponds to Interaural-Time-Delay (ITD) which is simply the time delay of incoming sound waves between the two ears. Although ITD is an excellent sound cue in stimulating a lateral perception on the horizontal plane, confusion is often raised when tracking the sound location from ITD alone because each sound source and its mirror image about the interaural axis share the same ITD. On the other hand, HRTFs associated with a dummy head microphone system or a robot platform with several microphones contain not only the information regarding proper time delays but also phase and magnitude distortions due to diffraction and scattering by the shading object such as the head and body of the platform. As a result, a set of HRTFs for any given platform provides a substantial amount of information as to the whereabouts of the source once proper analysis can be performed. In this study, we introduce new phase and magnitude criteria to be satisfied by a set of output signals from the microphones in order to find the sound source location in accordance with the HRTF database empirically obtained in an anechoic chamber with the given platform. The suggested method is verified through an experiment in a household environment and compared against the conventional method in performance.

  • PDF

HRTF Enhancement Algorithm for Stereo ground Systems (스테레오 시스템을 위한 머리전달함수의 개선)

  • Koo, Kyo-Sik;Cha, Hyung-Tai
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.4
    • /
    • pp.207-214
    • /
    • 2008
  • To create 3D sound, we usually use two methods which are two channels or multichannel sound systems. Because of cost and space problems, we prefer two channel sound system to multi-channel. Using a headphone or two speakers, the most typical method to create 3D sound effects is a technology of head related transfer function (HRTF) which contains the information that sound arrives from a sound source to the ears of the listener. But it causes a problem to localize a sound source around a certain places which is called cone-of-confusion. In this paper, we proposed the new algorithm to reduce the confusion of sound image localization. HRTF grouping and psychoacoustics theory are used to boost the spectral cue with spectrum difference among each directions. Informal listening tests show that the proposed method improves the front-back sound localization characteristics much better than conventional methods.

A Real-time Audio Surveillance System Detecting and Localizing Dangerous Sounds for PTZ Camera Surveillance (PTZ 카메라 감시를 위한 실시간 위험 소리 검출 및 음원 방향 추정 소리 감시 시스템)

  • Nguyen, Viet Quoc;Kang, HoSeok;Chung, Sun-Tae;Cho, Seongwon
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.11
    • /
    • pp.1272-1280
    • /
    • 2013
  • In this paper, we propose an audio surveillance system which can detect and localize dangerous sounds in real-time. The location information about dangerous sounds can render a PTZ camera to be directed so as to catch a snapshot image about the dangerous sound source area and send it to clients instantly. The proposed audio surveillance system firstly detects foreground sounds based on adaptive Gaussian mixture background sound model, and classifies it into one of pre-trained classes of foreground dangerous sounds. For detected dangerous sounds, a sound source localization algorithm based on Dual delay-line algorithm is applied to localize the sound sources. Finally, the proposed system renders a PTZ camera to be oriented towards the dangerous sound source region, and take a snapshot against over the sound source region. Experiment results show that the proposed system can detect foreground dangerous sounds stably and classifies the detected foreground dangerous sounds into correct classes with a precision of 79% while the sound source localization can estimate orientation of the sound source with acceptably small error.

Improvement of front/back Sound Localization Characteristics using Psychoacoustics of Head Related Transfer Function (머리전달함수의 심리음향적 특성을 이용한 전/후 음상정위 특성 개선)

  • Koo, Kyo-Sik;Cha, Hyung-Tai
    • Journal of Broadcast Engineering
    • /
    • v.11 no.4 s.33
    • /
    • pp.448-457
    • /
    • 2006
  • HRTF DB, including the information of the sounds which is arrived to our ears, is generally used to make a 3D sound. But it can decline some three-dimensional effects by the confusion between front and back directions due to the non-individual HRTF depending on each listener. In this paper, we propose a new method to use psychoacoustic theory that reduces the confusion of sound image localization. And we make use of an excitation energy by the sense of hearing. This method is brought HRTF spectrum characteristics into relief to draw out the energy ratio about the bark band. Informal listening tests show that the proposed method improves the front-back sound localization characteristics much better than the conventional methods.

Soundsource Localization and Tracking System of Intruder for Intelligent Surveillance System (지능형 감시 시스템 구축을 위한 침입자의 음원 위치 파악 및 추적 시스템)

  • Park, Jung-Hyun;Yeom, Hong-Gi;Jung, Bong-Gyu;Jang, In-Hun;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.6
    • /
    • pp.786-791
    • /
    • 2007
  • In the place that its security is crucial, the necessity of system which can tract and recognize random person is getting more important. In this paper, we'd like to develop the invader tracking system which consists of the sound source tracking-sensor and the pan-tilt camera for wide-area guard. After detecting the direction of any sound with the sound source tracking-sensor at first, our system make move the pan-tilt camera to that direction and extract reference image from that camera. This reference image is compared and updated by the next captured image after some interval time. By keeping on it over again, we can realize the guard system which can tract an invader using the difference image and the result of another image processing. By linking home network security system, the suggested system can provide some interfacing functions for the security service of the public facilities as well as that of home.

IIR Filter Design of HRTF for Implementation of 3D Sound (입체음향 구현을 위한 머리전달함수의 IIR필터 설계)

  • Kim Pan-Gon;Park Jang-Sik;Kim Hyun-Tae
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2005.05a
    • /
    • pp.341-345
    • /
    • 2005
  • In this paper, we propose an algorithm for the approximation of FIR filters by IIR filters. The algorithm is based on a concept of the balanced model reduction. Head-related transfer functions(HRTFs) of dummy-head are approximated by 32-order IIR filters. The binaural sounds using the approximated HRTFs are reproduced by headphone, and serves as a cue of sound image localization. Experiment of sound image are carried out for 10 participants with computer simulation and DSP board respectively. The results of the experiments show that the localization using the approximated HRTFs by IIR filters is the same accuracy as the case of FIR filters that simulate the HRTFs.

  • PDF

Sound Diffusion Control for the Localized Sound Image Using Time Delay (방향 정위된 음원에 시간지연을 이용한 확산감 제어에 관한 연구)

  • 김익형;정의필
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.135-138
    • /
    • 2001
  • Many researchers have developed the techniques of an efficient 3-D sound system based on the psycho-acoustics of spatial hearing with multimedia or virtual reality In this paper, we propose an idea for the improved 3-D sound system using conventional stereo headphones to obtain a better sound diffusion from the mono-sound recorded at an anechoic chamber. We use the HRTF (Head Related Transfer Function) for the sound localization and the wavelet filter bank with time delay for the sound diffusion. We investigate the effects of the 3-B sound depending on the length of time delay at lowest frequency band. Also the correlation coefficient of the signals between the left channel and the right channel is measured to identify the sound diffusion.

  • PDF

A Study on Enhancement of 3D Sound Using Improved HRTFS (개선된 머리전달함수를 이용한 3차원 입체음향 성능 개선 연구)

  • Koo, Kyo-Sik;Cha, Hyung-Tai
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.6
    • /
    • pp.557-565
    • /
    • 2009
  • To perceive the direction and the distance of a sound, we always use a couple of information. Head Related Transfer Function (HRTF) contains the information that sound arrives from a sound source to the ears of the listener, like differences of level, phase and frequency spectrum. For a reproduction system using 2 channels, we apply HRTF to many algorithms which make 3d sound. But it causes a problem to localize a sound source around a certain places which is called the cone-of-confusion. In this paper, we proposed the new algorithm to reduce the confusion of sound image localization. The difference of frequency spectrum and psychoacoustics theory are used to boost the spectral cue among each directions. To confirm the performance of the algorithm, informal listening tests are carried out. As a result, we can make the improved 3d sound in 2 channel system based on a headphone. Also sound quality of improved 3d sound is much better than conventional methods.

Improving a Sound Localization Using 1/3-octave Band Pass Filter (1/3-옥타브 대역통과필터를 이용한 음상정위기법 성능 향상)

  • Hwang, Shin;Yang, Jin-Woo;Cheung, Wan-Sup;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.98-103
    • /
    • 2001
  • The binaural auditory system of human has the capability of differentiating the direction and distance of sound sources. This feature is well characterised in terms of the inter-aural intensity difference (IID), the inter-aural time difference (ITD) and/or the spectral shape difference (SSD) arising from the acoustic transfer of a sound source to the outer ears. This paper proposes an effective way of extracting the three sound perception factors (IID, ITD, SSD) from the head-related transfer functions (HRTF's) that depends on the direction and distance of the acoustic source from the listener. It includes the estimation method of the equivalent ITD and 1/3-octave band-based IID factors and their usage to locate a sound source in space. Subjective and objective tests were carried out to examine the effectiveness of the proposed methodology and its applicability to real sound systems. Those experimental results are illustrated in this paper.

  • PDF

Impulsive sound localization using crest factor of the time-domain beamformer output (빔형성기 출력의 파고율을 이용한 충격음의 방향 추정)

  • Seo, Dae-Hoon;Choi, Jung-Woo;Kim, Yang-Hann
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2014.10a
    • /
    • pp.713-717
    • /
    • 2014
  • This paper presents a beamforming technique for locating impulsive sound source. The conventional frequency-domain beamformer is advantageous for localizing noise sources for a certain frequency band of concern, but the existence of many frequency components in the wide-band spectrum of impulsive noise makes the beamforming image less clear. In contrast to a frequency-domain beamformer, it has been reported that a time-domain beamformer can be better suited for transient signals. Although both frequency- and time-domain beamformers produce the same result for the beamforming power, which is defined as the RMS value of its output, we can use alternative directional estimators such as the peak value and crest factor to enhance the performance of a time-domain beamformer. In this study, the performance of three different directional estimators, the peak, crest factor and RMS output values, are investigated and compared with the incoherent interfering noise embedded in multiple microphone signals. The proposed formula is verified via experiments in an anechoic chamber using a uniformly spaced linear array. The results show that the peak estimation of beamformer output determines the location with better spatial resolution and a lower side lobe level than crest factor and RMS estimation in noise free condition, but it is possible to accurately estimate the direction of the impulsive sound source using crest factor estimation in noisy environment with stationary interfering noise.

  • PDF