Search | Korea Science

배경잡음 하에서의 신경회로망에 의한 남성화자 및 여성화자의 성별인식 알고리즘

Choe, Jae-Seung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2013.05a
- /
- pp.515-517
- /
- 2013
본 논문에서는 잡음 환경 하에서 남녀 성별인식이 가능한 신경회로망에 의한 화자종속 음성인식 알고리즘을 제안한다. 본 논문에서 제안한 음성인식 알고리즘은 남성화자 및 여성화자를 인식하기 위하여 LPC 켑스트럼 계수를 사용하여 신경회로망에 의하여 학습된다. 본 실험에서는 백색잡음 및 자동차잡음에 대하여 신경회로망의 네크워크에 대한 인식결과를 나타낸다. 인식실험의 결과로부터 백색잡음에 대해서는 최대 96% 이상의 인식률, 자동차잡음에 대해서는 최대 88% 이상의 인식률을 구하였다.
PDF

Speaker-dependent Speech Recognition Algorithm for Male and Female Classification (남녀성별 분류를 위한 화자종속 음성인식 알고리즘)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.4
- /
- pp.775-780
- /
- 2013
This paper proposes a speaker-dependent speech recognition algorithm which can classify the gender for male and female speakers in white noise and car noise, using a neural network. The proposed speech recognition algorithm is trained by the neural network to recognize the gender for male and female speakers, using LPC (Linear Predictive Coding) cepstrum coefficients. In the experiment results, the maximal improvement of total speech recognition rate is 96% for white noise and 88% for car noise, respectively, after trained a total of six neural networks. Finally, the proposed speech recognition algorithm is compared with the results of a conventional speech recognition algorithm in the background noisy environment.
https://doi.org/10.6109/jkiice.2013.17.4.775 인용 PDF KSCI

Improvement of Signal-to-Noise Ratio for Speech under Noisy Environment (잡음환경 하에서의 음성의 SNR 개선)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.7
- /
- pp.1571-1576
- /
- 2013
This paper proposes an improvement algorithm of signal-to-noise ratios (SNRs) for speech signals under noisy environments. The proposed algorithm first estimates the SNRs in a low SNR, mid SNR and high SNR areas, in order to improve the SNRs in the speech signal from background noise, such as white noise and car noise. Thereafter, this algorithm subtracts the noise signal from the noisy speech signal at each bands using a spectrum sharpening method. In the experiment, good signal-to-noise ratios (SNR) are obtained for white noise and car noise compared with a conventional spectral subtraction method. From the experiment results, the maximal improvement in the output SNR results was approximately 4.2 dB and 3.7 dB better for white noise and car noise compared with the results of the spectral subtraction method, in the background noisy environment, respectively.
https://doi.org/10.6109/jkiice.2013.17.7.1571 인용 PDF KSCI

Reduction of Background Noise using FFT cepstrum (FFT 켑스트럼을 사용한 배경잡음의 제거)

Choi, Jae-Seung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2010.10a
- /
- pp.264-267
- /
- 2010
본 논문에서는 오차역전파 학습 알고리즘을 사용하여 신경회로망을 학습시켜, 각 프레임에서의 음성 및 잡음 구간의 검출에 의한 음성인식 알고리즘을 제안한다. 그리고 신경회로망에 의하여 음성 및 잡음 구간의 검출에 따라서 각 프레임에서 잡음을 제거하는 스펙트럼 차감법을 제안한다. 본 실험에서는 원음성에 백색잡음 및 자동차잡음을 부가하여 음성인식의 인식율을 평가한다. 또한 인식시스템에 의하여 검출된 음성 및 잡음 구간을 이용하여 각 프레임에서의 스펙트럼 차감법에 의한 잡음제거의 실험결과를 나타낸다.
PDF

A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise Using Whitening Transformation (유색 잡음에 오염된 음성의 향상을 위한 백색 변환을 이용한 일반화 부공간 접근)

Lee, Jeong-Wook;Son, Kyung-Sik;Park, Jang-Sik;Kim, Hyun-Tae
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.15 no.8
- /
- pp.1665-1674
- /
- 2011
In this paper, we proposed an algorithm for speech enhancement of speeches corrupted by colored noise. When there is no correlation between colored noise and speech signal, the colored noise turns into white noise through whitening transformation. This transformed signal has been applied to the generalized subspace approach for speech enhancement. The speech spectral distortion, produced by the whitening transformation as pre-processing, has been restored by using the inverse whitening transformation as post-processing of the proposed algorithm. The performance of the proposed algorithm for speech enhancement has been confirmed by computer simulation. The colored noises used in this experiment were car noise and multi-talker babble. It is confirmed that the proposed algorithm shows better performance from SNR and SSD viewpoint over the previous approach with the data from the AURORA and TIMIT data base.
https://doi.org/10.6109/jkiice.2011.15.8.1665 인용 PDF KSCI

Estimation method of noise intensity by neural network for application in speech enhancement (음성강조에의 응용을 위한 신경회로망에 의한 잡음량의 추정법)

Choi Jae-Seung
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.42 no.3 s.303
- /
- pp.129-136
- /
- 2005
To reduce the noise in the noisy speech, it is desirable to change the parameters of the speech processing system according to the noise intensity to reproduce a good quality speech. This paper proposes an estimation method of noise intensity using a three layered neural network, which is able to learn the three graded speeches that is degraded by white noise or road noise. Experimental results demonstrate that the noise intensity could be estimated by the neural network. Even if the speakers and speech data are different from the training data, estimation rates for the noise intensity can be estimated by the neural network with an average accuracy of $95\%$ or more for white noise.
PDF KSCI

Speech Enhancement System Using a Model of Auditory Mechanism (청각기강의 모델을 이용한 음성강조 시스템)

최재승
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.6
- /
- pp.295-302
- /
- 2004
On the field of speech processing the treatment of noise is still important problems for speech research. Especially, it has been noticed that the background noise causes remarkable reduction of speech recognition ratio. As the examples of the background noise, there are such various non-stationary noises existing in the real environment as driving noise of automobiles on the road or typing noise of printer. The treatment for these kinds of noises is not so simple as could be eliminated by the former Wiener filter, but needs more skillful techniques. In this paper as one of these trials, we show an algorithm which is a speech enhancement method using a model of mutual inhibition for noise reduction in speech which is contaminated by white noise or background noise mentioned above. It is confirmed that the proposed algorithm is effective for the speech degraded not only by white noise but also by colored noise, judging from the spectral distortion measurement.
PDF KSCI

Reduction of Environmental Background Noise using Speech and Noise Recognition (음성 및 잡음 인식 알고리즘을 이용한 환경 배경잡음의 제거)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.15 no.4
- /
- pp.817-822
- /
- 2011
This paper first proposes the speech recognition algorithm by detection of the speech and noise sections at each frame using a neural network training by back-propagation algorithm, then proposes the spectral subtraction method which removes the noises at each frame according to detection of the speech and noise sections. In this experiment, the performance of the proposed recognition system was evaluated based on the recognition rate using various speeches that are degraded by white noise and car noise. Moreover, experimental results of the noise reduction by the spectral subtraction method demonstrate using the speech and noise sections detecting by the speech recognition algorithm at each frame. Based on measuring signal-to-noise ratio, experiments confirm that the proposed algorithm is effective for the speech by corrupted the noise using signal-to-noise ratio.
https://doi.org/10.6109/jkiice.2011.15.4.817 인용 PDF KSCI

Signal Processing for Speech Recognition in Noisy Environment (잡음 환경에서 음성 인식을 위한 신호처리)

Kim, Weon-Goo;Lim, Yong-Hoon;Cha, Il-Whan;Youn, Dae-Hee
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.2
- /
- pp.73-84
- /
- 1992
This paper studies noise subtraction methods and distance measures for speech recognition in a noisy environment, and investigates noise robustness of the distance measures applied to the problem of isolated word recognition in white Gaussian and colored noise (vehicle noise) environments. Noise subtraction methods which can be used as a pre-processor for the speech recognition system, such as the spectral subtraction method, autocorrelation subtraction method, adaptive noise cancellation and acoustic beamforming are studied, and distance measures such and Log Likelihood Ratio ($d_{LLR}$), cepstral distance measure ($d_{CEP}$), weighted cepstral distance measure ($d_{WCEP}$), spectral slope distance measure ($d_{RPS}$) and cepstral projection distance measure ($d_{CP},\;d_{BCP},\;d_{WCP},\;d_{BWCP}$) are also investigated. Testing of the distance measures for speaker-dependent isolated word recognition in a noisy environment indicate that $d_{RPS}\;and\;d_{WCEP}$ which weigh higher order cepstral coefficients more heavily give considerable performance improvement over $d_{CEP}and\;d_{LLR}$. In addition, when no pre-emphasis is performed, the recognizer can maintain higher performance under high noise conditions.
PDF

Speech and Noise Recognition System by Neural Network (신경회로망에 의한 음성 및 잡음 인식 시스템)

Choi, Jae-Sung
- The Journal of the Korea institute of electronic communication sciences
- /
- v.5 no.4
- /
- pp.357-362
- /
- 2010
This paper proposes the speech and noise recognition system by using a neural network in order to detect the speech and noise sections at each frame. The proposed neural network consists of a layered neural network training by back-propagation algorithm. First, a power spectrum obtained by fast Fourier transform and linear predictive coefficients are used as the input to the neural network for each frame, then the neural network is trained using these power spectrum and linear predictive coefficients. Therefore, the proposed neural network can train using clean speech and noise. The performance of the proposed recognition system was evaluated based on the recognition rate using various speeches and white, printer, road, and car noises. In this experiment, the recognition rates were 92% or more for such speech and noise when training data and evaluation data were the different.
PDF KSCI

Search Result 17, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)