• Title/Summary/Keyword: Head-related Impulse Response

Search Result 14, Processing Time 0.028 seconds

Modeling of individual head-related impulse responses using a set of general basis functions (보편적인 기저함수를 이용한 개인의 머리전달함수 모델링)

  • Hwang, Sung-Mok;Park, Young-Jin;Park, Youn-Sik
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2007.11a
    • /
    • pp.1430-1436
    • /
    • 2007
  • A principal components analysis (PCA) of the median head-related impulse responses (HRIRs) in the CIPIC HRTF database reveals that the individual HRIRs can be adequately reconstructed by a linear combination of 12 orthonormal basis functions. These basis functions can be used generally to model arbitrary HRIRs, which are not included in the process to obtain the basis functions. To clarify whether these basis functions can be used to model other set of arbitrary HRIRs, an numerical error analysis for modeling and a series of subjective listening tests were carried out using the measured and modeled HRIRs. The results showed that the set of individual HRIRs, which were measured in our lab using different measurement conditions, techniques, and source positions, can be well modeled with reasonable accuracy. Furthermore, all subjects reported not only the accurate vertical perception but also the front-back discrimination with the modeled HRIRs based on 12 basis functions. However, as less basis functions were used for HRIR modeling, the modeling accuracy and localization performance deteriorated.

  • PDF

Modeling of Median-plane Head-related Impulse Responses Using a Set of General Basis Functions (보편적인 기저함수를 이용한 중앙면상의 머리전달함수 모델링)

  • Hwang, Sung-Mook;Park, Young-Jin;Park, Youn-Sik
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.18 no.4
    • /
    • pp.448-457
    • /
    • 2008
  • A principal components analysis (PCA) of the median-plane head-related impulse responses (HRIRs) in the CIPIC HRTF database reveals that the individual HRIRs in the median plane can be adequately reconstructed by a linear combination of 12 orthonormal basis functions. These basis functions can be used to model arbitrary median-plane HRIRs, which are not included in the process to obtain the basis functions. Memory size can be reduced up to 5-fold depending on the number of HRIRs to be modeled. To clarify whether these basis functions can be used to model other set of arbitrary median plane HRIRs, a numerical error analysis for modeling and a series of subjective listening tests were carried out using the measured and modeled HRIRs. The results showed that the set of individual HRIRs in the median plane, which were measured in our lab using different measurement conditions, techniques, and source positions, can be modeled with reasonable accuracy. All subjects, involved in the subjective listening test, reported not only the accurate vertical perception but also the front-back discrimination with the modeled HRIRs based on 12 basis functions.

A DNN-Based Personalized HRTF Estimation Method for 3D Immersive Audio

  • Son, Ji Su;Choi, Seung Ho
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.13 no.1
    • /
    • pp.161-167
    • /
    • 2021
  • This paper proposes a new personalized HRTF estimation method which is based on a deep neural network (DNN) model and improved elevation reproduction using a notch filter. In the previous study, a DNN model was proposed that estimates the magnitude of HRTF by using anthropometric measurements [1]. However, since this method uses zero-phase without estimating the phase, it causes the internalization (i.e., the inside-the-head localization) of sound when listening the spatial sound. We devise a method to estimate both the magnitude and phase of HRTF based on the DNN model. Personalized HRIR was estimated using the anthropometric measurements including detailed data of the head, torso, shoulders and ears as inputs for the DNN model. After that, the estimated HRIR was filtered with an appropriate notch filter to improve elevation reproduction. In order to evaluate the performance, both of the objective and subjective evaluations are conducted. For the objective evaluation, the root mean square error (RMSE) and the log spectral distance (LSD) between the reference HRTF and the estimated HRTF are measured. For subjective evaluation, the MUSHRA test and preference test are conducted. As a result, the proposed method can make listeners experience more immersive audio than the previous methods.

Proposal on the Diagnostic Criteria of Definite Isolated Otolith Dysfunction

  • Park, Han Gyeol;Lee, Jun Ho;Oh, Seung Ha;Park, Moo Kyun;Suh, Myung-Whan
    • Journal of Audiology & Otology
    • /
    • v.23 no.2
    • /
    • pp.103-111
    • /
    • 2019
  • Background and Objectives: Dizzy patients with abnormal otolith function tests, despite a normal caloric response, are defined as having specific (isolated) otolith organ dysfunction. This study was performed to compare the differences in clinical presentation between isolated otolith dysfunction (iOD) patients with lab- and Sx-based iOD group and lab-based iOD symptoms. Subjects and Methods: The medical records of 23 iOD patients with normal caloric response but abnormal cervical vestibular evoked myogenic potential (VEMP), ocular VEMP, or subjective visual vertical were reviewed. Non-spinning vertigo was considered as otolith-related symptoms. The patients' age, onset of dizziness, Numeric Rating Scale on the severity of dizziness, and concomitant vestibular disorders were analyzed. Results: Patients in the lab-based iOD group were significantly older than those in the lab- and Sx-based iOD group. Known vestibular disorders were significantly more common in the lab-based iOD group (83.3%) compared to the lab- and Sx-based iOD group (18.2%). Despite the normal caloric response, catch-up saccade was found in the video head impulse test in more than half (54.5%) of the lab-based iOD group patients. There was no catch-up saccade in the lab- and Sx-based iOD group. There were no significant differences in gender ratio, frequency of dizziness attacks, and duration of illness. Conclusions: We propose new definitions of definite iOD (lab- and Sx-based iOD) and probable iOD (lab- or Sx-based iOD). These new definitions may help researchers to identify patients who are more likely to have true iOD, and facilitate comparisons of results between different studies.

Proposal on the Diagnostic Criteria of Definite Isolated Otolith Dysfunction

  • Park, Han Gyeol;Lee, Jun Ho;Oh, Seung Ha;Park, Moo Kyun;Suh, Myung-Whan
    • Korean Journal of Audiology
    • /
    • v.23 no.2
    • /
    • pp.103-111
    • /
    • 2019
  • Background and Objectives: Dizzy patients with abnormal otolith function tests, despite a normal caloric response, are defined as having specific (isolated) otolith organ dysfunction. This study was performed to compare the differences in clinical presentation between isolated otolith dysfunction (iOD) patients with lab- and Sx-based iOD group and lab-based iOD symptoms. Subjects and Methods: The medical records of 23 iOD patients with normal caloric response but abnormal cervical vestibular evoked myogenic potential (VEMP), ocular VEMP, or subjective visual vertical were reviewed. Non-spinning vertigo was considered as otolith-related symptoms. The patients' age, onset of dizziness, Numeric Rating Scale on the severity of dizziness, and concomitant vestibular disorders were analyzed. Results: Patients in the lab-based iOD group were significantly older than those in the lab- and Sx-based iOD group. Known vestibular disorders were significantly more common in the lab-based iOD group (83.3%) compared to the lab- and Sx-based iOD group (18.2%). Despite the normal caloric response, catch-up saccade was found in the video head impulse test in more than half (54.5%) of the lab-based iOD group patients. There was no catch-up saccade in the lab- and Sx-based iOD group. There were no significant differences in gender ratio, frequency of dizziness attacks, and duration of illness. Conclusions: We propose new definitions of definite iOD (lab- and Sx-based iOD) and probable iOD (lab- or Sx-based iOD). These new definitions may help researchers to identify patients who are more likely to have true iOD, and facilitate comparisons of results between different studies.

HRTF Interpolation Using a Spherical Head Model (원형 머리 모델을 이용한 머리 전달 함수의 보간)

  • Lee, Ki-Seung;Lee, Seok-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.7
    • /
    • pp.333-341
    • /
    • 2008
  • In this paper, a new interpolation model for the head related transfer function (HRTF) was proposed. In the method herein, we assume that the impulse response of the HRTF for each azimuth angle is given by linear interpolation of the time-delayed neighboring impulse responses of HRTFs. The time delay of the HRTF for each azimuth angle is given by sum of the sound wave propagation time from the ears to the sound source, which can be estimated by using azimuth angle, the physical shape of the underlying head and the distance between the head and sound source, and the refinement time yielding the minimum mean square error. Moreover, in the proposed model, the interpolation intervals were not fixed but varied, which were determined by minimizing the total number of HRTFs while the synthesized signals have no perceptual difference from the original signals in terms of sound location. To validate the usefulness of the proposed interpolation model, the proposed model was applied to the several HRTFs that were obtained from one dummy-head and three human heads. We used the HRTFs that have 5 degree azimuth angle resolution at 0 degree elevation (horizontal plane). The experimental results showed that using only $30\sim40%$ of the original HRTFs were sufficient for producing the signals that have no audible differences from the original ones in terms of sound location.

Headphone-based multi-channel 3D sound generation using HRTF (HRTF를 이용한 헤드폰 기반의 다채널 입체음향 생성)

  • Kim Siho;Kim Kyunghoon;Bae Keunsung;Choi Songin;Park Manho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.1
    • /
    • pp.71-77
    • /
    • 2005
  • In this paper we implement a headphone-based 5.1 channel 3-dimensional (3D) sound generation system using HRTF (Head Related Transfer Function). Each mono sound source in the 5.1 channel signal is localized on its virtual location by binaural filtering with corresponding HRTFs, and reverberation effect is added for spatialization. To reduce the computational burden, we reduce the number of taps in the HRTF impulse response and model the early reverberation effect with several tens of impulses extracted from the whole impulse sequences. We modified the spectrum of HRTF by weighing the difference of front-back spec01m to reduce the front-back confusion caused by non-individualized HRTF DB. In informal listening test we can confirm that the implemented 3D sound system generates live and rich 3D sound compared with simple stereo or 2 channel down mixing.

Design and Implementation of Crosstalk Canceller Using Warped Common Acoustical Poles (주파수 워핑된 공통 극점을 이용한 음향 간섭제거기의 설계 및 구현)

  • Jeong, Jae-Woong;Park, Young-Cheol;Youn, Dae-Hee;Lee, Seok-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.5
    • /
    • pp.339-346
    • /
    • 2010
  • For the implementation of the crosstalk canceller, the filters with large length are needed, which is because that the length of the filters greatly depends on the length of the head-related impulse responses. In order to reduce the length of the crosstalk cancellation filters, many methods such as frequency warping, common acoustical pole and zero (CAPZ) modeling have been researched. In this paper, we propose a new method combining these two methods. To accomplish this, we design the filters using the CAPZ modeling on the warped domain, and then, we implement the filters using the poles and zeros de-warped to the linear domain. The proposed method provides improved channel separation performance through the frequency warping and significant reduction of the complexity through the CAPZ modeling. These are confirmed through various computer simulations.

Sound Externalization using Multichannel Room Impulse Response (멀티채널 룸임펄스 응답 기반 외재화 알고리즘)

  • Jang, In-Seon;Lee, Yong-Ju;Jang, Dae-Young;Kang, Kyeong-Ok
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2008.02a
    • /
    • pp.139-142
    • /
    • 2008
  • 헤드폰 또는 이어폰으로 오디오 청취 시 흔히 음상이 머리 내부에 맺히는 현상이 발생하게 되며, 이러한 현상을 음상 내재화(Inside Head Localization; IHL)라 한다. 오디오의 음상이 머리 주변 혹은 머리 내부에 맺히게 되면 공간감이나 입체감이 떨어지게 되어 음향의 현실감을 저하시키게 되며 또한 청취에 따른 피로도가 증가하게 된다. 이러한 음상 내재화 현상을 제거하여, 헤드폰/이어폰을 통해 오디오 청취 시 음상이 머리의 외부에 맺히도록(Out of Head Localization; OHL) 하는 기술을 음상 외재화(Sound Externalization) 기술이라 한다. 룸 임펄스 응답이 방향 큐와 연계하여 생성되었을 경우 외재화가 가능하다는 실험적 사실을 바탕으로 기존의 음상 외재화 방법은 일반적인 HRTF (Head Related Transfer Function)를 이용하여 외재화 필터를 구성해왔다. 본 논문에서는 구체마이크로폰을 이용하여 녹음한 멀티채널 룸 임펄스 응답을 기반으로 모델링 된 외재화 필터를 이용한 음원 외재화 방법을 제안한다. 또한 실험 및 결과 분석을 통하여 본 알고리즘의 전방 음원 외재화 성능의 우수성을 입증하고, 외재화 알고리즘 적용 후의원 신호 음상 보존 성능을 확인한다.

  • PDF

A study on the simplification of HRTF within high frequency region (고역 주파수 영역에서 HRTF의 간략화에 관한 연구)

  • Lee, Chai-Bong
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.12 no.1
    • /
    • pp.1-6
    • /
    • 2011
  • In this study, we investigated the effect of the simplification for high frequency region in Head-Related Transfer Function(HRTF) on the sound localization. For this purpose, HRTF was measured and analyzed. The result in the HRTF frequency characteristic of the back sound source showed that the decrease revel of high frequency was smaller than that of low frequency region, which means the possibility of simplification in the high frequency region. Simplification was performed by flattening of the high frequency amplitude characteristics with the insertion of the low-pass filter, whose cutoff frequency is given by boundary frequency. Auditory experiments were performed to evaluate the simplified HRTF. The result showed that direction perception was not influenced by the simplification of the frequency characteristics of HRTF for the error of sound localization. The rate of confusion for the front and back was not affected by the simplification of the frequency characteristics over 8kHz of HRTF. Finally, we made it clear that the sound localization was not affected by the simplification of frequency characteristics of HRTF over 8kHz.