Search | Korea Science

Hands-free Speech Recognition based on Echo Canceller and MAP Estimation (에코제거기와 MAP 추정에 기초한 핸즈프리 음성 인식)

Sung-ill Kim;Wee-jae Shin
- Journal of the Institute of Convergence Signal Processing
- /
- v.4 no.3
- /
- pp.15-20
- /
- 2003
For some applications such as teleconference or telecommunication systems using a distant-talking hands-free microphone, the near-end speech signals to be transmitted is disturbed by an ambient noise and by an echo which is due to the coupling between the microphone and the loudspeaker. Furthermore, the environmental noise including channel distortion or additive noise is assumed to affect the original input speech. In the present paper, a new approach using echo canceller and maximum a posteriori(MAP) estimation is introduced to improve the accuracy of hands-free speech recognition. In this approach, it was shown that the proposed system was effective for hands-free speech recognition in ambient noise environment including echo. The experimental results also showed that the combination system between echo canceller and MAP environmental adaptation technique were well adapted to echo and noise environment.
PDF

A Study on the Design of Integrated Speech Enhancement System for Hands-Free Mobile Radiotelephony in a Car

Park, Kyu-Sik;Oh, Sang-Hun
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.2E
- /
- pp.45-52
- /
- 1999
This paper presents the integrated speech enhancement system for hands-free mobile communication. The proposed integrated system incorporates both acoustic echo cancellation and engine noise reduction device to provide signal enhancement of desired speech signal from the echoed plus noisy environments. To implement the system, a delayless subband adaptive structure is used for acoustic echo cancellation operation. The NLMS based adaptive noise canceller then applied to the residual echo removed noisy signal to achieve the selective engine noise attenuation in dominant frequency component. Two sets of computer simulations are conducted to demonstrate the effectiveness of the system; one for the fixed acoustical environment condition, the other for the robustness of the system in which, more realistic situation, the acoustic transmission environment change. Simulation results confirm the system performance of 20-25dB ERLE in acoustic echo cancellation and 9-19 dB engine noise attenuation in dominant frequency component for both cases.
PDF

A Design of Multi-channel Speech Pickup Embedded System for Hands-free Comuunication (핸즈프리 통신을 위한 다중채널 음성픽업 임베디드 시스템 설계)

Ju, Hyng-Jun;Park, Chan-Sub;Jeon, Jae-Kuk;Kim, Ki-Man
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.11 no.2
- /
- pp.366-373
- /
- 2007
In this paper we propose a multi-channel speech pickup system for calling quality enhancement of hands-free communication using ALTERA Nios-II processor. Multi-channel speech pickup system uses Delay-and-Sum beamformer with zero-padding interpolator. This paper implements speech pickup system using the Nios-II processor with real-time I/O data processing speed. The proposes speech pickup embedded system shows a good agreement with those of computer simulation(MATLAB) and conventional DSP processor(TMS320C6711) result. The proposed method is effective more than previous methods in cost and design processing time. As a result, LE(Logic Element) of hardware used 3,649/5,980(61%) on a chip.
https://doi.org/10.6109/jkiice.2007.11.2.366 인용 PDF KSCI

A User friendly Remote Speech Input Unit in Spontaneous Speech Translation System

Lee, Kwang-Seok;Kim, Heung-Jun;Song, Jin-Kook;Choo, Yeon-Gyu
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2008.05a
- /
- pp.784-788
- /
- 2008
In this research, we propose a remote speech input unit, a new method of user-friendly speech input in speech recognition system. We focused the user friendliness on hands-free and microphone independence in speech recognition applications. Our module adopts two algorithms, the automatic speech detection and speech enhancement based on the microphone array-based beamforming method. In the performance evaluation of speech detection, within-200msec accuracy with respect to the manually detected positions is about 97percent under the noise environments of 25dB of the SNR. The microphone array-based speech enhancement using the delay-and-sum beamforming algorithm shows about 6dB of maximum SNR gain over a single microphone and more than 12% of error reduction rate in speech recognition.
PDF

Two-Microphone Generalized Sidelobe Canceller with Post-Filter Based Speech Enhancement in Composite Noise

Park, Jinsoo;Kim, Wooil;Han, David K.;Ko, Hanseok
- ETRI Journal
- /
- v.38 no.2
- /
- pp.366-375
- /
- 2016
This paper describes an algorithm to suppress composite noise in a two-microphone speech enhancement system for robust hands-free speech communication. The proposed algorithm has four stages. The first stage estimates the power spectral density of the residual stationary noise, which is based on the detection of nonstationary signal-dominant time-frequency bins (TFBs) at the generalized sidelobe canceller output. Second, speech-dominant TFBs are identified among the previously detected nonstationary signal-dominant TFBs, and power spectral densities of speech and residual nonstationary noise are estimated. In the final stage, the bin-wise output signal-to-noise ratio is obtained with these power estimates and a Wiener post-filter is constructed to attenuate the residual noise. Compared to the conventional beamforming and post-filter algorithms, the proposed speech enhancement algorithm shows significant performance improvement in terms of perceptual evaluation of speech quality.
https://doi.org/10.4218/etrij.16.0115.0472 인용 PDF KSCI

The Wavelet Transform Based Subband Adaptive Acoustic Echo Canceller with Noise Cancellation Property (잡음제거 특성을 갖는 웨이브릿변환 기반 서브밴드 적응 음향반향제거기)

박재우;안주원;권기룡;문광석;김강언
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.7-10
- /
- 2000
This paper focuses on the development of speech enhancement techniques for hands-free audio terminals, including two major problems : noise cancellation and acoustic echo cancellation. The objective is to find a joint structure to get a near-end speech signal with minimum distortion and low levels of echo and noise. To solve the two problems, a new promising technique is studied and tested in computer simulation conditions.
PDF

A User-friendly Remote Speech Input Method in Spontaneous Speech Recognition System

Suh, Young-Joo;Park, Jun;Lee, Young-Jik
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.2E
- /
- pp.38-46
- /
- 1998
In this paper, we propose a remote speech input device, a new method of user-friendly speech input in spontaneous speech recognition system. We focus the user friendliness on hands-free and microphone independence in speech recognition applications. Our method adopts two algorithms, the automatic speech detection and the microphone array delay-and-sum beamforming (DSBF)-based speech enhancement. The automatic speech detection algorithm is composed of two stages; the detection of speech and nonspeech using the pitch information for the detected speech portion candidate. The DSBF algorithm adopts the time domain cross-correlation method as its time delay estimation. In the performance evaluation, the speech detection algorithm shows within-200 ms start point accuracy of 93%, 99% under 15dB, 20dB, and 25dB signal-to-noise ratio (SNR) environments, respectively and those for the end point are 72%, 89%, and 93% for the corresponding environments, respectively. The classification of speech and nonspeech for the start point detected region of input signal is performed by the pitch information-base method. The percentages of correct classification for speech and nonspeech input are 99% and 90%, respectively. The eight microphone array-based speech enhancement using the DSBF algorithm shows the maximum SNR gaing of 6dB over a single microphone and the error reductin of more than 15% in the spontaneous speech recognition domain.
PDF

A Residual Echo and Noise Reduction Scheme with Linear Prediction for Hands-Free Telephony (핸즈프리 전화기를 위한 선형 예측기를 이용한 잔여반향 및 잡음 제거 구조)

Hwang, Kyung-Rok;Son, Kyung-Sik;Kim, Hyun-Tae
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.5
- /
- pp.454-460
- /
- 2009
In this paper, we propose a residual echo and noise reduction scheme by using linear predictor for hands-free telephony applications. The proposed scheme whitens residual echo by the linear prediction during the non double-talk. But whitened residual echo signal still has speech characteristics. In this scheme, the whitened residual echo signal is more whitened by using the power of the linear prediction error signal and the linear predicted signal. After whitening process, near-end speech and ambient noise is present during double-talk but white noise will appear during non double-talk situation. By linearly predicting again the combined signal of the near-end speech and the whitened signal, the ambient noise is removed. Through computer simulation, it is shown that the proposed method performs well at the side of AIC (acoustic interference cancellation).
https://doi.org/10.7776/ASK.2009.28.5.454 인용 PDF KSCI

Performance Improvement in Distant-Talking Speech Recognition by an Integration of N-best results using Naive Bayesian Network (다채널 마이크 환경에서 Naive Bayesian Network의 Decision에 의한 음성인식 성능향상)

Ji, Mi-kyong;Kim, Hoi-Rin
- Proceedings of the KSPS conference
- /
- 2005.11a
- /
- pp.151-154
- /
- 2005
원거리 음성인식에서 인식률의 성능향상을 위해 필수적인 다채널 마이크 환경에서 방 안의 도처에 분산되어있는 원거리 마이크를 사용하여 TV, 조명 등의 주변 환경을 음성으로 제어하고자 한다. 이를 위해 각 채널의 인식결과를 통합하여 최적의 결과를 얻고자 채널의N-best 결과와 N-best 결과에 포함된 hypothesis의 frame-normalized likelihood 값을 사용하여 Bayesian network을 훈련하고 인식결과를 통합하여 최선의 결과를 decision 하는데 사용함으로써 원거리 음성인식의 성능을 향상시키고 또한 hands-free 응용을 현실화하기위한 방향을 제시한다.
PDF

Speech Feature based Double-talk Detector for Acoustic Echo Cancellation (반향제거를 위한 음성특징 기반의 동시통화 검출 기법)

Park, Jun-Eun;Lee, Yoon-Jae;Kim, Ki-Hyeon;Ko, Han-Seok
- Journal of IKEEE
- /
- v.13 no.2
- /
- pp.132-139
- /
- 2009
In this paper, a speech feature based double-talk detector method is proposed for an acoustic echo cancellation in hands-free communication system. The double-talk detector is an important element, since it controls the update of the adaptive filter for an acoustic echo cancellation. In previous research, the double talk detector is considered in the signal processing stage without taking the speech characteristics into account. However, in the proposed method, speech features which are used for the speech recognition is used for the discriminative features between the far-end and near-end speech. We obtained a substantial improvement over the previous double-talk detector methods using the only signal in time domain.
PDF

Search Result 17, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)