Search | Korea Science

Speech Recognition in Noise Environment by Independent Component Analysis and Spectral Enhancement (독립 성분 분석과 스펙트럼 향상에 의한 잡음 환경에서의 음성인식)

Choi Seung-Ho
- MALSORI
- /
- no.48
- /
- pp.81-91
- /
- 2003
In this paper, we propose a speech recognition method based on independent component analysis (ICA) and spectral enhancement techniques. While ICA tris to separate speech signal from noisy speech using multiple channels, some noise remains by its algorithmic limitations. Spectral enhancement techniques can compensate for lack of ICA's signal separation ability. From the speech recognition experiments with instantaneous and convolved mixing environments, we show that the proposed approach gives much improved recognition accuracies than conventional methods.
PDF

Speech Estimators Based on Generalized Gamma Distribution and Spectral Gain Floor Applied to an Automatic Speech Recognition (잡음에 강인한 음성인식을 위한 Generalized Gamma 분포기반과 Spectral Gain Floor를 결합한 음성향상기법)

Kim, Hyoung-Gook;Shin, Dong;Lee, Jin-Ho
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.8 no.3
- /
- pp.64-70
- /
- 2009
This paper presents a speech enhancement technique based on generalized Gamma distribution in order to obtain robust speech recognition performance. For robust speech enhancement, the noise estimation based on a spectral noise floor controled recursive averaging spectral values is applied to speech estimation under the generalized Gamma distribution and spectral gain floor. The proposed speech enhancement technique is based on spectral component, spectral amplitude, and log spectral amplitude. The performance of three different methods is measured by recognition accuracy of automatic speech recognition (ASR).
PDF

Speech enhancement using psychoacoustics model (사이코어쿠스틱스 모델을 이용한 음성 향상)

Kwon, Chul-Hyun;Shin, Dae-Kyu;Park, Sang-Hui
- Proceedings of the KIEE Conference
- /
- 1999.11c
- /
- pp.748-750
- /
- 1999
In this study, a speech enhancement is presented based on the utilization of well-known auditory mechanism, noise masking. The speech enhancement approach adopted here is to derive an modifier that achieves audible noise suppression. This modification selectively affects the perceptually significant spectral values, and is therefore less prone to introduction of unwanted distortions than methods that affect the complete STSA and produces more enhanced results at low SNR as well as at high SNR. The speech enhancement method adopted here needs exact estimation of the minimum specteal value per critical band because it uses only the minimum spectral value per critical band. For this, the method adopted here uses the modified spectral subtraction that is more flexible than power spectral subtraction. So, the result in experiment represented better SNR than before.
PDF

A single-channel speech enhancement method based on restoration of both spectral amplitudes and phases for push-to-talk communication (Push-to-talk 통신을 위한 진폭 및 위상 복원 기반의 단일 채널 음성 향상 방식)

Cho, Hye-Seung;Kim, Hyoung-Gook
- The Journal of the Acoustical Society of Korea
- /
- v.36 no.1
- /
- pp.64-69
- /
- 2017
In this paper, we propose a single-channel speech enhancement method based on restoration of both spectral amplitudes and phases for PTT (Push-To-Talk) communication. The proposed method combines the spectral amplitude and phase enhancement to provide high-quality speech unlike other single-channel speech enhancement methods which only use spectral amplitudes. We carried out side-by-side comparison experiment in various non-stationary noise environments in order to evaluate the performance of the proposed method. The experimental results show that the proposed method provides high quality speech better than other methods under different noise conditions.
https://doi.org/10.7776/ASK.2017.36.1.064 인용 PDF KSCI

Critical Banded Wavelet Packet-Based Spectral Subtractions for Speech Enhancement (음성신호개선을 위한 임계대역 웨이블렛 패킷 기반의 스펙트럼 차감법)

Chang, Sung-Wook;Yang, Sung-Il
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.4E
- /
- pp.125-133
- /
- 2004
In this paper, we propose a critical banded wavelet packet-based spectral subtraction for speech enhancement. Critical banded wavelet packet, which reflects the human auditory system, may lead to minimization of intelligibility loss and quality improvement of the enhanced speech in the spectral domain, when combined with an appropriate spectral subtraction gain function. The proposed method shows better performance than the conventional one in comparative assessments. We also show that, for effective evaluation of enhanced speech, it is essential to consider the characteristics of speech quality measures.
PDF KSCI

Simultaneous Spectral Resolution and Sensitivity Enhancement in MR spectrum: Maximum Likelihood Deconvolution Reconstruction

Jeong, Gwang-Woo;Jeong, Jenny Eunice;Kang, Heoung-Keun
- Journal of the Korean Magnetic Resonance Society
- /
- v.15 no.2
- /
- pp.157-174
- /
- 2011
Although the use of apodization functions in connection with postprocessing of a 2D NMR spectrum proves improved spectral quality, there is usually a trade-off between resolution enhancement and noise suppression due to a classical "uncertainty principle." In this study, therefore, a mathematical deconvolution technique called "Maximum Likelihood Deconvolution (MLD)" was adopted to achieve the spectral resolution and sensitivity enhancement simultaneously. The MLD technique greatly facilitates visualization and restoration of the genuine spectral information from complex 2D NMR spectra that would be problematic with the conventional apodization/FT processing. In particular, application of the MLD to the 2D-NOE spectrum would be very useful to derive the important proton connectivities, which are essential to achieve elucidating the 3D molecular structure.
https://doi.org/10.6564/JKMRS.2011.15.2.157 인용 PDF KSCI

Two-Microphone Generalized Sidelobe Canceller with Post-Filter Based Speech Enhancement in Composite Noise

Park, Jinsoo;Kim, Wooil;Han, David K.;Ko, Hanseok
- ETRI Journal
- /
- v.38 no.2
- /
- pp.366-375
- /
- 2016
This paper describes an algorithm to suppress composite noise in a two-microphone speech enhancement system for robust hands-free speech communication. The proposed algorithm has four stages. The first stage estimates the power spectral density of the residual stationary noise, which is based on the detection of nonstationary signal-dominant time-frequency bins (TFBs) at the generalized sidelobe canceller output. Second, speech-dominant TFBs are identified among the previously detected nonstationary signal-dominant TFBs, and power spectral densities of speech and residual nonstationary noise are estimated. In the final stage, the bin-wise output signal-to-noise ratio is obtained with these power estimates and a Wiener post-filter is constructed to attenuate the residual noise. Compared to the conventional beamforming and post-filter algorithms, the proposed speech enhancement algorithm shows significant performance improvement in terms of perceptual evaluation of speech quality.
https://doi.org/10.4218/etrij.16.0115.0472 인용 PDF KSCI

Speech Recognition through Speech Enhancement (음질 개선을 통한 음성의 인식)

Cho, Jun-Hee;Lee, Kee-Seong
- Proceedings of the KIEE Conference
- /
- 2003.11c
- /
- pp.511-514
- /
- 2003
The human being uses speech signals to exchange information. When background noise is present, speech recognizers experience performance degradations. Speech recognition through speech enhancement in the noisy environment was studied. Histogram method as a reliable noise estimation approach for spectral subtraction was introduced using MFCC method. The experiment results show the effectiveness of the proposed algorithm.
PDF

Performance Enhancement of Speech Communication System using Reverberation Rejection (잔향제거를 이용한 음성통신 시스템 성능 향상)

Kim, Se-Young;Kang, Suk-Youb;Kim, Ki-Man
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.13 no.10
- /
- pp.2211-2217
- /
- 2009
In this paper, we propose the speech enhancement algorithm using an one-microphone in a reverberant room environments. Spectral subtraction is the effective method which can reduce the reverberation element and the noise in a spectrum domain. Spectral subtraction needs correct separation of voice section and silent section therefore to improve the performance, voice activity detection(VAD) based on entropy has been applied to the proposed method. We test a performance of the proposed method by comparing with conventional method which used VAD based on energy detection. Reverberation reduction ratio with variable of SNR and a reverberation time is used as a test index. From the simulation result, proposed method shows performance better than conventional method.
https://doi.org/10.6109/JKIICE.2009.13.10.2211 인용 PDF KSCI

Spectral Resolution Enhancement of Acousto-Optic Tunable Filter(AOTF) using Incident Angle Variation (음향광학변조필터의 입사각 변화를 이용한 분해능 향상 방법)

You, Jang-Woo;Ahn, Jeong-Ho;Kim, Dae-suk;Kwak, Yoon-Keun;Kim, Soo-Hyun;Lee, Yun-Woo;Whang, In-Duk
- Proceedings of the KSME Conference
- /
- 2004.11a
- /
- pp.607-612
- /
- 2004
Spectral resolution enhancement method of Acousto-Optic Tunable Filter (AOTF) using incident light angle variation is described. AOTF is a small, mechanically rigid, high speed and spectral resolution light tunable filter. The basic theory of AOTF and its experimental verification is described. AOTF can generate two opposite polarized light simultaneously which wavelength can be changed by incident angle variation. We focused on the common region of two filtered light at the specific incident angle. This region can be used to enhance the spectral resolution of AOTF.
PDF

Search Result 208, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)