Search | Korea Science

A Study on the Channel Normalized Pitch Synchronous Cepstrum for Speaker Recognition (채널에 강인한 화자 인식을 위한 채널 정규화 피치 동기 켑스트럼에 관한 연구)

김유진;정재호
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.1
- /
- pp.61-74
- /
- 2004
In this paper, a contort- and speaker-dependent cepstrum extraction method and a channel normalization method for minimizing the loss of speaker characteristics in the cepstrum were proposed for a robust speaker recognition system over the channel. The proposed extraction method creates a cepstrum based on the pitch synchronous analysis using the inherent pitch of the speaker. Therefore, the cepstrum called the 〃pitch synchronous cepstrum〃 (PSC) represents the impulse response of the vocal tract more accurately in voiced speech. And the PSC can compensate for channel distortion because the pitch is more robust in a channel environment than the spectrum of speech. And the proposed channel normalization method, the 〃formant-broadened pitch synchronous CMS〃 (FBPSCMS), applies the Formant-Broadened CMS to the PSC and improves the accuracy of the intraframe processing. We compared the text-independent closed-set speaker identification on 56 females and 112 males using TIMIT and NTIMIT database, respectively. The results show that pitch synchronous km improves the error reduction rate by up to 7.7% in comparison with conventional short-time cepstrum and the error rates of the FBPSCMS are more stable and lower than those of pole-filtered CMS.
PDF KSCI

Parameter Considering Variance Property for Speech Recognition in Noisy Environment (잡음환경에서의 음성인식을 위한 변이특성을 고려한 파라메터)

Park, Jin-Young;Lee, Kwang-Seok;Koh, Si-Young;Hur, Kang-In
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- v.9 no.2
- /
- pp.469-472
- /
- 2005
This paper propose about effective speech feature parameter that have robust character in effect of noise in realizing speech recognition system. Established MFCC that is the basic parameter used to ASR(Automatic Speech Recognition) and DCTCs that use DCT in basic parameter. Also, proposed delta-Cepstrum and delta-delta-Cepstrum parameter that reconstruct Cepstrum to have information for variation of speech. And compared recognition performance in using HMM. For dimension reduction of each parameter LDA algorithm apply and compared recognition. Results are presented reduced dimension delta-delta-Cepstrum parameter in using LDA recognition performance that improve more than existent parameter in noise environment of various condition.
PDF

Detection of Impulse Signal in Noise Using a Minimum Variance Cepstrum -Application on Faults Detection in a Bearing System (최소 분산 캡스트럼을 이용한 노이즈 속에 묻힌 임펄스 검출 방법-베어링 결함 검출에의 적용)

최영철;김양한
- Journal of KSNVE
- /
- v.10 no.6
- /
- pp.985-990
- /
- 2000
The signals that can be obtained from rotating machines often convey the information of machine. For example, if the machine under investigation has faults, then these signals often have pulse signals, embedded in noise. Therefore the ability to detect the fault signal in noise is major concern of fault diagnosis of rotating machine, In this paper, minimum variance cepstrum (MV cepstrum) . which can easily detect impulse in noise, has been applied to detect the type of faults of ball bearing system. To test the performance of this technique. various experiments have been performed for ball bearing elements that have man made faults. Results show that minimum variance cepstrum can easily detect the periodicity due to faults and also shows the pattern of excitation by the faults.
PDF

Noise Analysis of Geared Motor using Cepstrum and Comb Lifter (Cepstrum과 Comb Lifter를 이용한 기어드 모터의 소음 분석)

Lee Min Hwan;Kang Dong Bae;Kim Hwa Young;Ahn Jung Hwan
- Journal of the Korean Society for Precision Engineering
- /
- v.22 no.5 s.170
- /
- pp.72-79
- /
- 2005
Gearing system emits inconsistent noise from gear teeth impact in case of gear defects. But, it is not easy for inspection operator in production line to distinguish objectively the defective product. Therefore, customer complains continuously bad noise of the geared motor. Because impulsive signal at low frequency has a tendency not to appear in frequency domain, it is difficult to separate the gear inconsistent noise of defective gear from overall geared motor's noise using general signal processing method such as FFT. In this paper, the method to estimate more objectively the inconsistent noise of gearing system and to measure the quantities is suggested. Suggested method uses Cepstrum, Autocorrelation, Comb Lifter and Inverse Cepstrum by turns to make objective quantities about noise level.
PDF KSCI

Speaker Verification Performance Improvement Using Weighted Residual Cepstrum (가중된 예측 오차 파라미터를 사용한 화자 확인 성능 개선)

위진우;강철호
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.5
- /
- pp.48-53
- /
- 2001
In speaker verification based on LPC analysis the prediction residues are ignored and LPCC(LPC cepstrum) are only used to compose feature vectors. In this study, LPCC and RCEP (residual cepstrum) extracted from residues are used as feature parameters in the various environmental speaker verification. We propose the weighting function which can enlarge inter-speaker variation by weighting pitch, speaker inherent vector, included in residual cepstrum. Simulation results show that the average speaker verification rate is improved in the rate of 6％ with RCEP and LPCC at the same time and is improved in the rate of 2.45% with the proposed weighted RCEP and LPCC at the same time compared with no weighting.
PDF

Formant-broadened CMS Using the Log-spectrum Transformed from the Cepstrum (켑스트럼으로부터 변환된 로그 스펙트럼을 이용한 포먼트 평활화 켑스트럴 평균 차감법)

김유진;정혜경;정재호
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.4
- /
- pp.361-373
- /
- 2002
In this paper, we propose a channel normalization method to improve the performance of CMS (cepstral mean subtraction) which is widely adopted to normalize a channel variation for speech and speaker recognition. CMS which estimates the channel effects by averaging long-term cepstrum has a weak point that the estimated channel is biased by the formants of voiced speech which include a useful speech information. The proposed Formant-broadened Cepstral Mean Subtraction (FBCMS) is based on the facts that the formants can be found easily in log spectrum which is transformed from the cepstrum by fourier transform and the formants correspond to the dominant poles of all-pole model which is usually modeled vocal tract. The FBCMS evaluates only poles to be broadened from the log spectrum without polynomial factorization and makes a formant-broadened cepstrum by broadening the bandwidths of formant poles. We can estimate the channel cepstrum effectively by averaging formant-broadened cepstral coefficients. We performed the experiments to compare FBCMS with CMS, PFCMS using 4 simulated telephone channels. In the experiment of channel estimation, we evaluated the distance cepstrum of real channel from the cepstrum of estimated channel and found that we were able to get the mean cepstrum closer to the channel cepstrum due to an softening the bias of mean cepstrum to speech. In the experiment of text-independent speaker identification, we showed the result that the proposed method was superior than the conventional CMS and comparable to the pole-filtered CMS. Consequently, we showed the proposed method was efficiently able to normalize the channel variation based on the conventional CMS.
PDF KSCI

Vowel Recognition Using the Fractal Dimension (프랙탈 차원을 이용한 모음인식)

최철영;김형순;김재호;손경식
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.19 no.6
- /
- pp.1140-1148
- /
- 1994
In this paper, we carried out some experiments on the Korean vowel recognition using the fractal dimension of the speech signals. We chose the Minkowski-Bouligand dimension as the fractal dimension, and computed it using the morphological covering method. For our experiments, we used both the fractal dimension and the LPC cepstrum which is conventionally known to be one of the best parameters for speech recognition, and examined the usefulness of the fractal dimension. From the vowel recognition experiments under various consonant contexts, we achieved the vowel recognition error rates of 5.6% and 3.2% for the case with only LPC cepstrum and that with both LPC cepstrum and the fractal dimension, respectively. The results indicate that the incorporation of the fractal dimension with LPC cepstrum gives more than 40% reduction in recognition errors, and indicates that the fractal dimension is a useful feature parameter for speech recognition.
PDF

A Study on Cepstrum Analysis for Wheel Flat Detection in Railway Vehicles (차륜의 찰상결함 진단을 위한 켑스트럼 분석 방법 연구)

Kim, Geoyoung;Kim, Hyuntae;Koo, Jeongseo
- Journal of the Korean Society of Safety
- /
- v.31 no.3
- /
- pp.28-33
- /
- 2016
Since defects in the wheels of railway vehicles, which occur due to wears with the rail, cause serious damage to the running device, the diagnostic monitoring system for condition-based maintenance is required to secure the driving safety. In this paper, we studied to apply a useful Cepstrum analysis to detect periodic structure in spectrum among the vibration signal processing techniques for the fault diagnosis of a rotating body such as wheel. In order to analyze in variations of train velocity, the Cepstrum analysis was performed after a domain change of the vibration signal from time domain to rotation angle domain. When domains change, it is important to use a interpolation for a uniform interval of the rotation angle. Finally, the Cepstrum analysis for wheel flat detection was verified by using the vibration signal including the disturbance resulting from the rail irregularities and the vibration of bogie components.
https://doi.org/10.14346/JKOSOS.2016.31.3.28 인용 PDF KSCI

The study on Korean isolated-word recognition using LPC cepstrum and clustering (LPC cepstrum 과 집단화를 이용한 한국어 고립단어 인식에 관한 연구)

김진영
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1987.11a
- /
- pp.70-74
- /
- 1987
본 논문은 화자독립 고립단어 인식에 있어서 LP 모델의 문제점과 그 해결 방안으로서 cepstrum 영역에 있어서 lifter를 이용한 해결에 대해서 고찰하였다. 한편, 각 인식 단어의 기준 패턴을 구하기 위한 방법으로서 집단화의 방법에 대해 논하였다. 집단화의 방법으로서는 UWA 방법과 K-iteration 방법을 변형시킨 KMA 방법을 제시 비교하였다. 인식 실험결과 정현파 lifter와 KMA의 집단화 방법을 사용하였을 때 95%의 최고 인식률을 보였다.
PDF

A Study on the CEPSTRUM Method for the Function Classification of EMG Signal (EMG 신호의 기능 분류에 적용되는 CEPSTRUM 기법에 관한 연구)

Wang, Moon-Sung;Byun, Yoon-Shik;Park, Sang-Hui
- Proceedings of the KOSOMBE Conference
- /
- v.1992 no.11
- /
- pp.79-82
- /
- 1992
Under the assumption that the EMG signal was used as the reference signal for driving a prosthetic arm, function discrimination of EMG signal from the biceps and triceps of subject was achived with LPC CEPSTRUM coefficients. By varying the number of coefficients, the types of windows, window size, and window overlaping rates, the best conditions for the function discrimination of EMG signal were obtained.
PDF

Search Result 274, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)