Search | Korea Science

A Spectral Compensation Method for Noise Robust Speech Recognition (잡음에 강인한 음성인식을 위한 스펙트럼 보상 방법)

Cho, Jung-Ho
- 전자공학회논문지 IE
- /
- v.49 no.2
- /
- pp.9-17
- /
- 2012
One of the problems on the application of the speech recognition system in the real world is the degradation of the performance by acoustical distortions. The most important source of acoustical distortion is the additive noise. This paper describes a spectral compensation technique based on a spectral peak enhancement scheme followed by an efficient noise subtraction scheme for noise robust speech recognition. The proposed methods emphasize the formant structure and compensate the spectral tilt of the speech spectrum while maintaining broad-bandwidth spectral components. The recognition experiments was conducted using noisy speech corrupted by white Gaussian noise, car noise, babble noise or subway noise. The new technique reduced the average error rate slightly under high SNR(Signal to Noise Ratio) environment, and significantly reduced the average error rate by 1/2 under low SNR(10 dB) environment when compared with the case of without spectral compensations.
PDF KSCI

A Study on the Revised Method using Normalized RGB Features in the Moving Object Detection by Background Subtraction (배경분리 방법에 의한 이동 물체 검출에서 개선된 색정보 정규화 기법에 관한 연구)

Park, Jong-Beom
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.12 no.6
- /
- pp.108-115
- /
- 2013
A developed skill of an intelligent CCTV is also advancing by using its Image Acquisition Device. In this field, area for technique can be divided into Foreground Subtraction which detects individuals and objects in a potential observing area and a tracing technology which figures out moving route of individuals and objects. In this thesis, an improved algorism for a settled engine development, which is stable to change in both noise and illumination for detecting moving objects is suggested. The proposed algorism from this thesis is focused on designing a stable and real time processing method which is perfect model in detecting individuals, animals, and also low-speeding transports and catching a change in an illumination and noise.
https://doi.org/10.12815/kits.2013.12.6.108 인용 PDF KSCI

Noise Reduction of Geomagnetic Signals From Randomly Oriented Sensors

Song, Yong J.;Lee, Choong S.;Kim, Ki C.;Lim, Sun-Ho;Kim, Duk-Yung;Son, Dong-Hwan;Kim, Dae Y.
- Journal of Magnetics
- /
- v.9 no.3
- /
- pp.69-74
- /
- 2004
A method of processing signals of unaligned geomagnetic sensors placed on the seabed is presented. The offset drifts of the fluxgate sensors are processed by polynomial fitting and the orientations of the sensor axes are found by minimizing the noise power using wavelet analysis. The noise power was reduced by 9.1 dB by processing the components of magnetic field separately using subtraction filter, polynomial fitting and wavelet analysis.
https://doi.org/10.4283/JMAG.2004.9.3.069 인용 PDF KSCI

Footstep Detection in Noisy Environment via Non-Linear Spectral Subtraction and Cross-Correlation (잡음 환경에서 비선형 주파수 차감 및 교차 상관을 이용한 사람 발자국 탐지 방안)

Kim, Tae-Bok;Ko, Hanseok
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.39C no.1
- /
- pp.60-69
- /
- 2014
Footstep detection using seismic sensors for security is a very meaningful task, but readings can easily fluctuate due to noise in outdoor environment. We propose NSSC method based on nonlinear spectral subtraction and cross-correlation using prime footstep model signal as a footstep signal refining process that enhances the signal-to-noise ratio (SNR) and attenuates noise. After de-noising, a detection event classification method is presented as further refining process to ensure that the detection result is a footstep. To validate the proposed algorithm, representative experiments including sunny and rainy-day cases are demonstrated.
https://doi.org/10.7840/kics.2014.39C.1.60 인용 PDF KSCI

Speech Enhancement Using Multiresolutional Signal Analysis Methods (다해상도 신호해석 방법을 이용한 음성개선)

Seok, Jong-Won;Han, Mi-Kyung;Bae, Keun-Sung
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.36S no.7
- /
- pp.134-135
- /
- 1999
This paper presents a speech enhancement method with spectral subtraction using wavelet, wavelet packet and cosine packet transforms which are known as multiresolutional signal analysis method. The performance of each method is compared with the conventional spectral subtraction method. Performance assessments based on average SNR, cepstral distance and informal subjective listening test are carried out. Experimental result demonstrate that cosine packet shows the best result in objective performance measure as well as subjective shows less musical noise than the conventional spectral subtraction method after removing the noise components.
PDF

The Effect of the Speech Enhancement Algorithm for Sensorineural Hearing Impaired Listeners

Kim, Dong-Wook;Lee, Young-Woo;Lee, Jong-Shill;Chee, Young-Joon;Lee, Sang-Min;Kim, In-Young;Kim, Sun-I.
- Journal of Biomedical Engineering Research
- /
- v.28 no.6
- /
- pp.732-743
- /
- 2007
Background noise is one of the major complaints of not only hearing impaired persons but also normal listeners. This paper describes the results of two experiments in which speech recognition performance was determined for listeners with normal hearing and sensorineural hearing loss in noise environment. First, we compared speech enhancement algorithms by evaluation speech recognition ability in various speech-to-noise ratios and types of noise. Next, speech enhancement algorithms by reducing background noise were presented and evaluated to improve speech intelligibility for sensorineural hearing impairment listeners. We tested three noise reduction methods using single-microphone, such as spectrum subtraction and companding, Wiener filter method, and maximum likelihood envelop estimation. Their responses in background noise were investigated and compared with those by the speech enhancement algorithm that presented in this paper. The methods improved speech recognition test score for the sensorineural hearing impaired listeners, but not for normal listeners. The results suggest the speech enhancement algorithm with the loudness compression can improve speech intelligibility for listeners with sensorineural hearing loss.
https://doi.org/10.9718/JBER.2007.28.6.732 인용 PDF KSCI

Speech Recognition in Noisy Environments using the NOise Spectrum Estimation based on the Histogram Technique (히스토그램 처리방법에 의한 잡음 스펙트럼 추정을 이용한 잡음환경에서의 음성인식)

Kwon, Young-Uk;Kim, Hyung-Soon
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.5
- /
- pp.68-75
- /
- 1997
Spectral subtraction is widely-used preprocessing technique for speech recognition in additive noise environments, but it requires a good estimate of the noise power spectrum. In this paper, we employ the histogram technique for the estimation of noise spectrum. This technique has advantages over other noise estimation methods in that it does not requires speech/non-speech detection and can estimate slowly-varying noise spectra. According to the speaker-independent isolated word recognition in both colored Gaussian and car noise environments under various SNR conditions. Histogram-technique-based spectral subtraction method yields superier performance to the one with conventional noise estimation method using the spectral average of initial frames during non-speech period.
PDF

Adaptive Threshold for Speech Enhancement in Nonstationary Noisy Environments (비정상 잡음환경에서 음질향상을 위한 적응 임계 치 알고리즘)

Lee, Soo-Jeong;Kim, Sun-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.7
- /
- pp.386-393
- /
- 2008
This paper proposes a new approach for speech enhancement in highly nonstationary noisy environments. The spectral subtraction (SS) is a well known technique for speech enhancement in stationary noisy environments. However, in real world, noise is mostly nonstationary. The proposed method uses an auto control parameter for an adaptive threshold to work well in highly nonstationary noisy environments. Especially, the auto control parameter is affected by a linear function associated with an a posteriori signal to noise ratio (SNR) according to the increase or the decrease of the noise level. The proposed algorithm is combined with spectral subtraction (SS) using a hangover scheme (HO) for speech enhancement. The performances of the proposed method are evaluated ITU-T P.835 signal distortion (SIG) and the segment signal to-noise ratio (SNR) in various and highly nonstationary noisy environments and is superior to that of conventional spectral subtraction (SS) using a hangover (HO) and SS using a minimum statistics (MS) methods.
https://doi.org/10.7776/ASK.2008.27.7.386 인용 PDF KSCI

SFMOG : Super Fast MOG Based Background Subtraction Algorithm (SFMOG : 초고속 MOG 기반 배경 제거 알고리즘)

Song, Seok-bin;Kim, Jin-Heon
- Journal of IKEEE
- /
- v.23 no.4
- /
- pp.1415-1422
- /
- 2019
Background subtraction is the major task of computer vision and image processing to detect changes in video. The best performing background subtraction is computationally expensive that cannot be used in real time in a typical computing environment. The proposed algorithm improves the background subtraction algorithm of the widely used MOG with the image resizing algorithm. The proposed image resizing algorithm is designed to drastically reduce the amount of computation and to utilize local information, which is robust against noise such as camera movement. Experimental results of the proposed algorithm have a classification capability that is close to the state of the art background subtraction method and the processing speed is more than 10 times faster.
https://doi.org/10.7471/ikeee.2019.23.4.1415 인용 PDF KSCI

An Improvement of Stochastic Feature Extraction for Robust Speech Recognition (강인한 음성인식을 위한 통계적 특징벡터 추출방법의 개선)

김회린;고진석
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.2
- /
- pp.180-186
- /
- 2004
The presence of noise in speech signals degrades the performance of recognition systems in which there are mismatches between the training and test environments. To make a speech recognizer robust, it is necessary to compensate these mismatches. In this paper, we studied about an improvement of stochastic feature extraction based on band-SNR for robust speech recognition. At first, we proposed a modified version of the multi-band spectral subtraction (MSS) method which adjusts the subtraction level of noise spectrum according to band-SNR. In the proposed method referred as M-MSS, a noise normalization factor was newly introduced to finely control the over-estimation factor depending on the band-SNR. Also, we modified the architecture of the stochastic feature extraction (SFE) method. We could get a better performance when the spectral subtraction was applied in the power spectrum domain than in the mel-scale domain. This method is denoted as M-SFE. Last, we applied the M-MSS method to the modified stochastic feature extraction structure, which is denoted as the MMSS-MSFE method. The proposed methods were evaluated on isolated word recognition under various noise environments. The average error rates of the M-MSS, M-SFE, and MMSS-MSFE methods over the ordinary spectral subtraction (SS) method were reduced by 18.6%, 15.1%, and 33.9%, respectively. From these results, we can conclude that the proposed methods provide good candidates for robust feature extraction in the noisy speech recognition.
PDF KSCI

Search Result 154, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)