• Title/Summary/Keyword: Binaural sound

Search Result 57, Processing Time 0.031 seconds

Externalization of sound image in 3D sound system based on headphone

  • Youngsik Yoon;Park, Youngjin
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2002.10a
    • /
    • pp.51.3-51
    • /
    • 2002
  • 3D sound user often finds the results that the sound image appear to originate either inside, or close to, the head when he uses headphone-based binaural system. This phenomenon is called in-head localization(IHL). The main factors were chosen to evaluate externalization performance : individualized HRTFs, near-field HRTF characteristics and reverberation. Direct comparison was conducted among them, especially two factors\ulcorner reverberation and near-field HRTFs.

  • PDF

Evaluation of a signal segregation by FDBM (FDBM의 음원분리 성능평가)

  • Lee, Chai-Bong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.12
    • /
    • pp.1793-1802
    • /
    • 2013
  • Various approaches for sound source segregation have been proposed. Among these approaches, frequency domain binaural model(FDBM) has the advantages of low computational load and effective howling cancellation. A binaural hearing assistance system based on FDBM has been proposed. This system can enhance desired signal based on the directivity information. Although FDBM has been evaluated in terms of signal-to-noise ratio (SNR) and coherence function, the evaluation results do not always agree with the human impressions. These evaluation methods provide physical measures, and do not take account of perceptual aspect of human being. Considering a binaural hearing assistance system as a one of major applications, the quality of segregated sound should keep level enough. In the paper, signal segregation performance by means of FDBM is evaluated by three objective methods, i.e., SNR, coherence and Perceptual Evaluation of Speech Quality(PESQ), to discuss the characteristic of FDBM on the sound source segregation performance. The simulation's evaluation results show that FDBM improves the quality of the left and right channel signals to an equivalent level. And the results suggest the possibility that PESQ provides a more useful measure than SNR and coherence in terms of the segregation performance of FDBM. The evaluation results by PESQ show the effects from segregation parameters and indicate appropriate parameters under the conditions. In the paper, signal segregation performance by means of FDBM is evaluated by three objective methods, i.e., SNR, coherence and PESQ, to discuss the characteristic of FDBM on the sound source segregation performance. The simulation's evaluation results show that FDBM improves the quality of the left and right channel signals to an equivalent level. And the results suggest the possibility that PESQ provides a more useful measure than SNR and coherence in terms of the segregation performance of FDBM. The evaluation results by PESQ show the effects from segregation parameters and indicate appropriate parameters under the conditions.

Speaker Separation Based on Directional Filter and Harmonic Filter (Directional Filter와 Harmonic Filter 기반 화자 분리)

  • Baek, Seung-Eun;Kim, Jin-Young;Na, Seung-You;Choi, Seung-Ho
    • Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.125-136
    • /
    • 2005
  • Automatic speech recognition is much more difficult in real world. Speech recognition according to SIR (Signal to Interface Ratio) is difficult in situations in which noise of surrounding environment and multi-speaker exists. Therefore, study on main speaker's voice extractions a very important field in speech signal processing in binaural sound. In this paper, we used directional filter and harmonic filter among other existing methods to extract the main speaker's information in binaural sound. The main speaker's voice was extracted using directional filter, and other remaining speaker's information was removed using harmonic filter through main speaker's pitch detection. As a result, voice of the main speaker was enhanced.

  • PDF

Audio-Visual Localization and Tracking of Sound Sources Using Kalman Filter (칼만 필터를 이용한 시청각 음원 정위 및 추적)

  • Song, Min-Gyu;Kim, Jin-Young;Na, Seung-You
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.4
    • /
    • pp.519-525
    • /
    • 2007
  • With the high interest on robot technology and application, the research on artificial auditory systems for robot is very active. In this paper we discuss sound source localization and tracing based on audio-visual information. For video signals we use face detection based on skin color model. Also, binaural-based DOA is used as audio information. We integrate both informations using Kalman filter. The experimental results show that audio-visual person tracking Is useful, specially in the case that some informations are not observed.

Audio Format Comparative Study and Suggestion for Next Generation DTV (차세대 디지털 TV 방송을 위한 오디오 규격 비교 분석 및 제언)

  • Lee, Jae-Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.6
    • /
    • pp.337-343
    • /
    • 2011
  • With commencing trial 3D digital broadcasting, the studies on next generation digital broadcasting technology for coming UHDTV era is being actively progressing. In this paper, I propose surround audio formats for next-generation digital TV broadcasting, along with comparative study of major surround audio formats in use or under development. I did comparative study on current major competing surround formats such as Dolby True HD and DTS HD MA, along with NHK proposed 22.2 channel surround format for UHDTV system. Upon this comparative study and our housing situation consideration, I propose lossy compression 3D surround 7.1 channel surround format along with loosless 2.0 and 4.0 hi-fi format as next generation digital TV broadcasting standard. In lieu with this, I also propose transmitting binaural 2 channel audio data as sub-audio. It will give holographic sound experience when properly processed with individual HRTF (Head Related Transfer Function) with headphone. The table for data rate of each proposed audio format is also presented.

Headphone-based multi-channel 3D sound generation using HRTF (HRTF를 이용한 헤드폰 기반의 다채널 입체음향 생성)

  • Kim Siho;Kim Kyunghoon;Bae Keunsung;Choi Songin;Park Manho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.1
    • /
    • pp.71-77
    • /
    • 2005
  • In this paper we implement a headphone-based 5.1 channel 3-dimensional (3D) sound generation system using HRTF (Head Related Transfer Function). Each mono sound source in the 5.1 channel signal is localized on its virtual location by binaural filtering with corresponding HRTFs, and reverberation effect is added for spatialization. To reduce the computational burden, we reduce the number of taps in the HRTF impulse response and model the early reverberation effect with several tens of impulses extracted from the whole impulse sequences. We modified the spectrum of HRTF by weighing the difference of front-back spec01m to reduce the front-back confusion caused by non-individualized HRTF DB. In informal listening test we can confirm that the implemented 3D sound system generates live and rich 3D sound compared with simple stereo or 2 channel down mixing.

Development of Obstacle Alarm for the Visually Impaired (시각 장애인을 위한 장애물 경보기의 개발)

  • 심현민;이응혁;민홍기;홍승홍
    • Proceedings of the IEEK Conference
    • /
    • 2002.06e
    • /
    • pp.113-116
    • /
    • 2002
  • In this paper, we propose the sound-mapping algorithm of the detected obstacle by ultrasonic sensors. We apply this algorithm to a Obstacle alarm for the visually impaired. In our system, we acquire obstacles information using ultrasonic sensors, and transform two-dimensional and distance information into sound-imaging information and vibrator with azimuth (direction) and distance. We implement this system with ultrasonic sensors to more effective expression of the obstacle information. The distance of an obstacle can be expressed by sound pressure level, and azimuth of the obstacles can be expressed by inter-aural time difference (ITD) and inter-aural level difference (ILD) that are two important cues in a binaural system. These are the principal cues for sound localization, to detect sound source. In this system, the obstacle is substituted with a sound source. The visually impaired receive sound information of obstacles by headphone.

  • PDF

Sleep Health Care System using Binaural Beats (바이노럴 비트를 활용한 수면 헬스 케어 시스템)

  • Kim, Kang-Hyeon;Yang, Yoon-Jeong;Park, Jun-Mo;Jeong, Do-Un
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2016.05a
    • /
    • pp.527-528
    • /
    • 2016
  • In this paper, to provide a binaural beat in the method of treatment of sleep disorders. Let the sound of different frequencies, the waveform of the brain resonate at a specific frequency, to induce a stable state, is converted over to a high-frequency audio frequency, improves the listening experience. To target the three college students for the results performance evaluation of, conducted an experiment on the sense of stability at the time of sleep, in the process, the brain waves were measured as a means to check the degree of sense of stability.

  • PDF

Binaural Interaction Component in Auditory Brainstem Responses with Asymmetric Simultaneous Acoustic Stimulation (비대칭 음 강도 양이 동시 자극 청성뇌간유발반응의 양이간섭치)

  • Heo, S.D.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.8 no.2
    • /
    • pp.95-99
    • /
    • 2014
  • Binaural interaction can recognize the same intensity sound by stimulating two ears alternatively, and it can be record auditory brainstem responses (ABR). However, We needs to be researched about binaural interaction in asymmetric binaural acoustic stimulation. 17 normal young hearing university students were participated. Clicks were presented at the intensity of 90 dB nHL to one ear and the click intensity was increased from 0 to 90 dB nHL with a separation of 10 dB to another ear, simultaneous. BI waveform was obtained by subtracting the sum of the asymmetrically evoked potentials from the binaurally evoked potentials; i.e. BI = B - (L + R). Latency and amplitude was measured 'peak to following trough' of IV-V complex of BI waveform. Threshold of BIC (t-BIC) was obtained using amplitude depend on stimulus intensities (paired sample t-test). Latency shifted in 4.65, 4.63, 4.57, 4.58, 4.62, 4.6, 4.48, 4.36, 4.23 ms for peak, 5.57, 5.51, 5.51, 5.59, 5.61, 5.55, 5.44, 5.28, 5.19 ms for trough, and amplitude shifted in .0.32, -0.3, -0.34, -0.32, -0.42, -0.53, -0.54, -0.61, $-0.67{\mu}V$ from 0 to 90 dB nHL in every 10 dB, respectively. t-BIC was observed 40 dB nHL(p=.001).

  • PDF

HRTF-field Reproduction for Robust Virtual Source Imaging (머리 전달 함수장 재현을 통한 광대역 입체 음향 구현)

  • Choi, Joung-Woo
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.18 no.2
    • /
    • pp.199-207
    • /
    • 2008
  • A hybrid technique that combines the advantages of binaural reproduction and sound field reproduction technique is proposed. The concept of HRTF-field, which is defined as the set of HRTFs corresponding to the various head dislocations, enables us to realize virtual source imaging over a wide area. Conventional binaural($2{\times}2$) reproduction system is redefined as a MIMO system composed of multiple control sources and multiple head locations, and HRTF variations corresponding to various head movement are quantified. Through the direct control of HRTF-field, reproduction error induced by head dislocation can be minimized in least-square-error sense, and consequential disturbances on the virtual source image can be reduced within a selected area. Simple lateralization examples are investigated, and the reproduction error of the proposed technique is compared to that of higher-order Ambisonics.