Search | Korea Science

Evaluation for speech signal based on human sense and signal quality

Mekada, Yoshito;Hasegawa, Hiroshi;Kumagai, Takeshi;Kasuga, Masao
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 1997.06a
- /
- pp.13-18
- /
- 1997
Each reproducing speech signal has each particular signal property, because of the processing of encoding and decoding for communications through various media. In this paper, we examine the correlation between speech signal quality and sensory pleasure for the sensory improvement of that signal. In experiments, we evaluate the quality of speech signals through various media by psychological auditory test and physical features of these signals.
PDF

Voiced/Unvoiced/Silence Classification of Speech Signal by Level Crossing and DPCM (Level Crossing과 DPCM을 사용한 유성음/무성음/묵음의 분류)

Kim, Jin-Young;Sung, Koeng-Mo
- Proceedings of the KIEE Conference
- /
- 1987.07b
- /
- pp.1615-1618
- /
- 1987
This paper proposes new algorithm for classifying speech signal frame into voiced, unvoiced, silence frame, using the parameters extracted from time domain behavior of speech signal The prameters used in this paper are absolute magnitude, the sum of peaks lager than reference level (T-peak), the ratio of T-peak to absolute magnitude and the magnitude of signal outputs of DPCM. Using this parameters, speech signal is more easily classified into voiced/ unvoiced/silence frame.
PDF

Speech Recognition in Noise Environment by Independent Component Analysis and Spectral Enhancement (독립 성분 분석과 스펙트럼 향상에 의한 잡음 환경에서의 음성인식)

Choi Seung-Ho
- MALSORI
- /
- no.48
- /
- pp.81-91
- /
- 2003
In this paper, we propose a speech recognition method based on independent component analysis (ICA) and spectral enhancement techniques. While ICA tris to separate speech signal from noisy speech using multiple channels, some noise remains by its algorithmic limitations. Spectral enhancement techniques can compensate for lack of ICA's signal separation ability. From the speech recognition experiments with instantaneous and convolved mixing environments, we show that the proposed approach gives much improved recognition accuracies than conventional methods.
PDF

A Performance of a Remote Speech Input Unit in Speech Recognition System (음성인식 시스템에서의 원격 음성입력기의 성능평가)

Lee, Gwang-seok
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2009.10a
- /
- pp.723-726
- /
- 2009
In this research, We simulated performances of error reduction algorithm for the speech signal based on the microphone array-based beamforming method in speech recognition system and analyzed its performance. Also, we processed speech signal adopted from microphone array and maximum signal to noise ratio for each channel, and then compared them with signal to noise ratio of speech signal. Speech recognition rate is improved from 54.2% to 61.4% in case 1 and is improved from 41.2% to 50.5% in case 2 of the lower signal to noise ratio. Therefore the average reduction rates are showed 15.7% in case 1.
PDF

A STUDY ON THE SPEECH SYNTHESIS-BY-RULE SYSTEM APPLIED MULTIBAND EXCITATION SIGNAL

Kyung, Younjeong;Kim, Geesoon;Lee, Hwangsoo;Lee, Yanghee
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.1098-1103
- /
- 1994
In this paper, we design and implement the Korean speech synthesis by rule system. This system is applied the multiband excitation signal on voiced sounds. The multiband excitation signal is obtained by mixing impluse spectrum and which noise spectrum. We find that the quality of synthesized speech is improved using this application. Also, we classify the voiced sounds by cepstral euclidian distance measure for reducing overhead memory. The representative excitation signal of the same group's voiced sounds is used as excitation signal on synthesis. This method does not affect the quality of synthesized speech. As the result of experiment, this method eliminates the "buzziness" of synthesized speech and reduces the spectral distortion of synthesized speech.ed speech.
PDF

Improved Leakage Signal Blocking Methods for Two Channel Generalized Sidelobe Canceller

Kim, Ki-Hyeon;Ko, Han-Seok
- Speech Sciences
- /
- v.13 no.1
- /
- pp.117-128
- /
- 2006
The two-channel Generalized Sidelobe Canceller (GSC) scheme suffers from the presence of leakage signal in the reference channel. The leakage signal is caused by the dissimilar impulse responses between microphones, and different paths from speech source to microphones. Such leakage is detrimental to speech enhancement of the GSC since the desired reference signal becomes corrupted. In order to suppress the signal leakage, two matrix injection methods are proposed. In the first method, a simple gain compensation matrix is used. In the second, a projection matrix for reducing the error between the actual and the ideal primary and reference signals, is used. This paper describes the performance degradation resulting from leakage, and proposes effective methods to resolve the problem. Representative experiments were conducted to demonstrate the effectiveness of the proposed methods on recorded speech and noise in an actual automobile environment.
PDF

RECOGNITION SYSTEM USING VOCAL-CORD SIGNAL (성대 신호를 이용한 인식 시스템)

Cho, Kwan-Hyun;Han, Mun-Sung;Park, Jun-Seok;Jeong, Young-Gyu
- Proceedings of the KIEE Conference
- /
- 2005.10b
- /
- pp.216-218
- /
- 2005
This paper present a new approach to a noise robust recognizer for WPS interface. In noisy environments, performance of speech recognition is decreased rapidly. To solve this problem, We propose the recognition system using vocal-cord signal instead of speech. Vocal-cord signal has low quality but it is more robust to environment noise than speech signal. As a result, we obtained 75.21% accuracy using MFCC with CMS and 83.72% accuracy using ZCPA with RASTA.
PDF

IMBE Model Based SNR Estimation of Continuous Speech Signals (연속음성신호에서 IMBE 모델을 이용한 SNR 추정 연구)

Park, Hyung-Woo;Bae, Myung-Jin
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.2
- /
- pp.148-153
- /
- 2010
In speech signal processing, speech signal corrupted by noise should be enhanced to improve quality. Usually noise estimation methods need flexibility for variable environment. Noise profile is renewed on silence region to avoid effects of speech properties. So we have to preprocess finding voice region before noise estimation. However, if received signal does not have silence region, we cannot apply that method. In this paper, we proposed SNR estimation method for continuous speech signal. A Speech signal consists of Voice and Unvoiced Band in The MBE excitation model. And the energy of speech signal is mostly distributed on voiced region, so we can estimate SNR by the ratio of voiced region energy to unvoiced. We use the IMBE vocoder for the Voice or Unvoice band of segmented speech signal. Continuously we calculate the segmented SNR using that information and the energy of each band. And we estimate the SNR of continuous speech signal.
https://doi.org/10.7776/ASK.2010.29.2.148 인용 PDF KSCI

Noise Cancellation Algorithm of Bone Conduction Speech Signal using Feature of Noise in Separated Band (밴드 별 잡음 특징을 이용한 골전도 음성신호의 잡음 제거 알고리즘)

Lee, Jina;Lee, Gihyoun;Na, Sung Dae;Seong, Ki Woong;Cho, Jin Ho;Kim, Myoung Nam
- Journal of Korea Multimedia Society
- /
- v.19 no.2
- /
- pp.128-137
- /
- 2016
In mobile communication, air conduction(AC) speech signal had been commonly used, but it was easily affected by ambient noise environment such as emergency, military action and rescue. To overcome the weakness of the AC speech signal, bone conduction(BC) speech signal have been used. The BC speech signal is transmitted through bone vibration, so it is affected less by the background noise. In this paper, we proposed noise cancellation algorithm of the BC speech signal using noise feature of decomposed bands. The proposed algorithm consist of three steps. First, the BC speech signal is divided into 17 bands using perceptual wavelet packet decomposition. Second, threshold is calculated by noise feature during short time of separated-band and compared to absolute average of the signal frame. Therefore, the speech and noise parts are detected. Last, the detected noise parts are removed and then, noise eliminated bands are re-synthesised. In order to confirm the efficiency of the proposed algorithm, we compared the proposed algorithm with conventional algorithm. And the proposed algorithm has better performance than the conventional algorithm.
https://doi.org/10.9717/kmms.2016.19.2.128 인용 PDF KSCI KPUBS HTML

Pitch Detection Using Wavelet Transform (웨이브렛 변환을 이용한 피치검출)

Seok, Jong-Won;Son, Young-Ho;Bae, Keun-Sung
- Speech Sciences
- /
- v.5 no.1
- /
- pp.23-33
- /
- 1999
Mallat has shown that, with a proper choice of wavelet function, the local maxima of wavelet transformed signal indicate a sharp variation in the signal. Since the glottal closure causes sharp discontinuities in the speech signal, dyadic wavelet transform can be useful for detecting abrupt change in the voiced sounds, i.e., epochs. In this paper, we investigate the glottal closure instants obtained from the wavelet analysis of speech signal and compare them with those obtained from the EGG signal. Then, we detect pitch period of speech signal on the basis of these results. Experimental results demonstrated that local maxima of wavelet transformed signal give accurate estimation of epoch and pitch periods of voiced sound obtained by the proposed algorithm also correspond to those from EGG well.
PDF

Search Result 1,172, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)