Search | Korea Science

Speaker Linear Distortion Canceller Considering Properties of Human Ear (귀의 특성을 고려한 스피커의 선형 왜곡보상 시스템의 설계 및 구현에 관한 연구)

이원우;허정권;김진영;성굉모
- The Journal of the Acoustical Society of Korea
- /
- v.12 no.5
- /
- pp.38-46
- /
- 1993
스피커는 입력이 전기신호이고 출력이 소리인 전기 음향 장치로서, 고유의 전달함수를 갖는 하나의 시스템으로 생각할 수 있다. 일한 스피커 시스템의 전달함수는 여러 가지 원인으로 인해 주파수 영역에서 평탄한 특성을 가지지 못하며 선형위상의 조건을 만족시키기가 힘들다. 음색을 변화시키는 스피커의 전달함수를 선형이라고 가정할 때 스피커의 앞단에서 스피커의 선형 전달함수의 역에 해당하는 선형 필터를 추가함으로쏘, 음색의 왜곡은 보정할 수 있다. 본 연구에서는 이러한 스피커의 선형왜곡을 보상하기 위한 선형필터로서 FIR 필터를 사용하였으나, 2장에 제시되어 있는 기존의 알고리듬과는달리 귀의 청감특성을 이용하여 가청주파수 대역에 대해 몇 개의 대역으로 나눈 후, 각 대역마다 보상 할 주파수 해상도를 달리하는 새로운 알고리듬에 대해 기술하였다. 이러한 다해상도 선형왜곡 보상시스템은 각 대역마다 다른 해상도로 보상한다는점에서는 필터뱅크 보상 시스템과 유사하지만, 대역통과필터가 아닌 저역통과필터만을 이용하여 설계의 복잡서응ㄹ 줄였으며 각 대역마다 독립적으로 보상하는 필터뱅크 보상시스템과는 달리 1차 대역에서 보상하지 ath한 오차를 2차, 3차에서 보상하는 새로운 기법을 이용하였다. 본 알고리듬을 적용한 모의실험을 통하여 성능을 평가한 결과 가청음대역에서 ±6dB의 왜곡을 보인 스피커가 실효 tap수를 272로 했을 때 ±0.5dB의 리플을 보이는 양호한 결과를 얻었다.
PDF

Speech Recognition in Noise Environments Using SPLICE with Phonetic Information (음성학적인 정보를 포함한 SPLICE를 이용한 잡음환경에서의 음성인식)

Kim Doo Hee;Kim Hyung Soon
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.83-86
- /
- 2002
훈련과정과 인식과정에서의 주변환경 잡음과 채널 특성 등의 불일치는 음성인식 성능을 급격히 저하시킨다. 이러한 불일치를 보상하기 위해서 켑스트럼 영역에서의 다양한 전처리 방법이 시도되고 있으며 최근에는 stereo 데이터와 잡음 음성의 Gaussian Mixture Model (GMM)을 이용해 보상벡터를 구하는 SPLICE 방법이 좋은 결과를 보이고 있다(1). 기존의 SPLICE가 전체 발성에 대해서 음향학적인 정보만으로 Gaussian 모델을 구하는 반면 본 논문에서는 발성에 해당하는 음소정보를 고려하여 전체 음향 공간을 각 음소에 대해 나누어서 모델링하고 각 음소에 대한 Gaussian 모델과 그 음소에 해당하는 음성데이터만을 이용하여 음소별 보상벡터가 훈련되도록 하였다. 이 경우 보상벡터는 잡음이 각 음소에 미치는 영향을 보다 자세히 나타내게 된다. Aurora 2 데이터베이스를 이용한 실험결과, 제안된 방법이 기존의 SPLICE방법에 비해 성능향상을 보였다.
PDF

Performance Improvement of Speech Recognition based on Stereo Data with Dimensionally Weighted Bias Compensation (스테레오 데이터에 기반한 차원별 가중 보상에 의한 음성 인식 성능 향상)

Kim Jong Hyeon;Song Hwa Jeon;Kim Hyung Soon
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.139-142
- /
- 2004
훈련 과정과 인식 과정사이의 주변 잡음과 채널 특성으로 인한 환경의 불일치는 음성 인식 성능을 급격히 저하시킨다. 이러한 차이를 극복하기 위해 다양한 전처리 방법이 제안되어 왔으며, 최근에는 스테레오 데이터와 잡음 음성의 Gaussian Mixture Model(GMM)을 이용하여 보상벡터를 구하는 SPLICE 방법이 좋은 성능을 보여주고 있다. 하지만 차원별로 특징벡터를 보상해주는 추정된 보상벡터는 underestimation되는 경향이 있으며, 그 정도가 각각의 차원마다 달라짐이 관찰되었다. 본 논문에서는 SPLICE 방법에 기반하여 추정된 보상벡터와 실제 보상벡터 사이의 관계를 관찰하여 차원별로 다른 가중치를 적용하는 차원별 가중 보상 방법을 제안하였다. 제안한 방법은 Aurora2 Clean-condition인 경우 baseline 실험 결과에 비해 $68\%$의 높은 상대적인 인식 향상율을 얻었다.
PDF

Compensation of the Non-linearity of the Audio Power Amplifier Converged with Digital Signal Processing Technic (디지털 신호 처리 기술을 융합한 음향 전력 증폭기의 비선형 보상)

Eun, Changsoo;Lee, Yu-chil
- Journal of the Korea Convergence Society
- /
- v.7 no.3
- /
- pp.77-85
- /
- 2016
We propose a digital signal processing technic that can compensate the non-linearity inherent in audio amplifiers, and present the result of the simulation. The inherent non-linearity of the audio power amplifier arising from analog devices is compensated via a digital signal processing technic consisting of indirect learning architecture and an adaptive filter. The simulation results show that the compensator can be realized using a third-order polynomial and compensates odd-order non-linearity efficiently. The even-oder non-linearity is mainly due to the dc offset at the output, which is difficult to eliminate with the proposed method. Care must be taken in designing the bias circuit to avoid the DC offset at the output. The proposed technic has significance in that digital signal processing technic can compensate for the impairment that is an inherent characteristic of an analog system.
https://doi.org/10.15207/JKCS.2016.7.3.077 인용 PDF KSCI

Robust Beamforming Method by Linear Array Shape Estimation using Kanlman Filter (칼만필터 선배열 형상 추정에 의한 견실한 빔형성 기법 연구)

한정우
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1996.06a
- /
- pp.18-21
- /
- 1996
예인형 선배열 소나에서 기존의 선배열 빔형성 기법을 적용하면 예인함 기동시 선배열 형상 변형에 의해 심각한 성능 저하 현상이 나타난다. 이러한 문제 해결을 위해 센서배열 변형을 고려한 견실한 빔형성 기법은 선배열 형상 추정을 위해 칼만필터를 이용한 형상 추정기법을 이용하고, 변형보상 빔형성을 위해 선배열 세그먼트 기울기 추정값을 이용하는 변형보상 빔형성 알고리즘을 제안하였다. 제안된 기법을 기존의 빔형성 기법을 적용하여 시뮬레이션에 의해 제안된 기법의 보상성능을 입증 하였다.
PDF

A Study on the Audio Compensation System (음향 보상 시스템에 관한 연구)

Jeoung, Byung-Chul;Won, Chung-Sang
- The Journal of the Acoustical Society of Korea
- /
- v.32 no.6
- /
- pp.509-517
- /
- 2013
In this paper, we researched a method that makes a good acoustic-speech system using a digital signal processing technique with dynamic microphone as a transducer. Good acoustic-speech system should deliver the original sound input to electric signal without distortion. By measuring the frequency response of the microphone, adjustment factors are obtained by comparing measured data and standard frequency response of microphone for each frequency band. The final sound levels are obtained using the developed adjustment factors of frequency responses from the microphone and speaker to match the original sound levels using the digital signal processing technique. Then, we minimize the changes in the frequency response and level due to the variation of the distance from source to microphone, where the frequency responses were measured according to the distance changes.
https://doi.org/10.7776/ASK.2013.32.6.509 인용 PDF KSCI

A New Analysis and a Reduction Method of Computational Complexity for the Lattice Transversal Joint (LTJ) Adaptive Filter (격자 트랜스버설 결합 (LTJ) 적응필터의 새로운 해석과 계산량 감소 방법)

유재하
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.5
- /
- pp.438-445
- /
- 2002
In this paper, the necessity of the filter coefficients compensation for the lattice transversal joint (LTJ) adaptive filter was explained in general and with ease by analyzing it with respect to the time-varying transform domain adaptive filter. And also the reduction method of computational complexity for filter coefficients compensation was proposed using the property that speech signal is stationary during a short time period and its effectiveness was verified through experiments using artificial and real speech signals. The proposed adaptive filter reduces the computational complexity for filter coefficients compensation by 95%, and when the filter is applied to the acoustic echo canceller with 1000 taps, the total complexity is reduced by 82%.
PDF KSCI

Application of deep learning for accurate source localization using sound intensity vector (음향인텐시티 벡터를 통해 정확한 음원 위치 추정을 위한 딥러닝 적용)

Iljoo Jeong;In-Jee Jung;Seungchul Lee
- The Journal of the Acoustical Society of Korea
- /
- v.43 no.1
- /
- pp.72-77
- /
- 2024
Recently, the necessity for sound source localization has grown significantly across various industrial sectors. Among the sound source localization methods, sound intensimetry has the advantage of having high accuracy even with a small microphone array. However, the increase in localization error at high Helmholtz numbers have been pointed out as a limitation of this method. The study proposes a method to compensate for the bias error of the measured sound intensity vector according to the Helmholtz numbers by applying deep learning. The method makes it possible to estimate the accurate direction of arrival of the source by applying a dense layer-based deep learning model that derives compensated sound intensity vectors when inputting the sound intensity vectors measured by a tetrahedral microphone array for the Helmholtz numbers. The model is verified based on simulation data for all sound source directions with 0.1 < kd < 3.0. One can find that the deep learning-based approach expands the measurement frequency range when implementing the sound intensimetry-based sound source localization method, also one can make it applicable to various microphone array sizes.
https://doi.org/10.7776/ASK.2024.43.1.072 인용 PDF

Compensation Method for Improvement of Speech Recognition in Wireless Communication Network (무선 통신망에서 음성인식률 개선을 위한 보상기법 연구)

Seo Jin-Ho;Park Ho-Chong
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.65-68
- /
- 2004
이동통신 기술의 발전으로 이동통신 사용이 폭발적으로 증가하였고 그에 따라 이동통신망을 이용한 많은 서비스가 제공되고 있다. 이동통신망에서의 음성 인식 서비스에서 음성 인식기에 입력되는 음성신호는 통신망을 통해 음성 압축기를 거치게 되고 이에 음성신호가 왜곡되어 인식기의 인식성능이 저하된다. 본 논문에서는 무선통신 환경에서 음성인식기의 성능을 개선하기 위한 보상 방법을 제안한다. 기존의 제안된 방법은 음성 데이터에 의존하는 방법을 사용하나 본 논문에서는 음성 데이터와는 독립적 방법인 음성 압축기에 의해 손상된 입력 신호의 스펙트럼 보상방법과 Cepstrum 보정방법을 통해 인식률을 향상시키는 방법을 제안한다. 즉, 음성 압축기에 의하여 왜곡된 스펙트럼을 단계적 방법으로 보상하고 그를 토대로 왜곡된 신호에서 만들어진 Cepstrum을 보정하여 음성 인식기의 성능을 향상시키는 방법을 연구하였으며, 그 견과 손상된 음성신호의 인식률 $64.88\%$에 대하여, 본 논문에서 제안하는 보상 방법을 적용한 음성신호의 인식률은 $79.73\%$로서 $14.85\%$가 향상된 결과를 얻을 수 있었다.
PDF

Performance Comparision of Channel distortion Compensation Techniques in Keyword Spotting System over the Telephone Network (전화망을 통한 핵심어 검출 시스템에서의 채널왜곡 보상벙법의 성능비교)

이교혁
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1996.10a
- /
- pp.56-60
- /
- 1996
본 논문에서 핵심어 검출(Keyword spotting ) 시스템에서의 채널 왜곡에 대한 보상방법등의 성능을 비교하였다. 훈련을 음성과 인식실험용 음성은 서로 다른 환경에서 수집되었으며, 특별히 인식실험용 음성으로는 전화망을 통한 음성 데이터를 이용하였다. 전화망을 통한 음성인식에서는 채널왜곡과 부가잡음에 의해서 음성신호에 왜곡이 생기므로 이들에 대한 적적한 보상이 필요하다. 본 논문에서는 채널 왜곡보상을 위한 처리방법으로 널리 사용되고 있는 global cepstral mean substraction (GCMS), local cepstral mean subtraction(LCMS) 그리고 RASTA processing을 적용하였다. 그리고 인식성능의 개선을 위해 이들 방법을 likelihood ration scorning 에 의한 후처리 과정을 적용하였다. 인식실험결과 이들 방법 모두 채널왜곡 보상을 하지 않았을 경우와 비교하여 더 좋은 인식성능을 얻을 수 있었으며, 그 중 후처리를 적용한 LCMS 방법이 가장 우수한 성능을 나타내었다.
PDF

Search Result 174, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)