• Title/Summary/Keyword: Cepstrum Analysis

Search Result 91, Processing Time 0.033 seconds

Speaker Verification Performance Improvement Using Weighted Residual Cepstrum (가중된 예측 오차 파라미터를 사용한 화자 확인 성능 개선)

  • 위진우;강철호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.5
    • /
    • pp.48-53
    • /
    • 2001
  • In speaker verification based on LPC analysis the prediction residues are ignored and LPCC(LPC cepstrum) are only used to compose feature vectors. In this study, LPCC and RCEP (residual cepstrum) extracted from residues are used as feature parameters in the various environmental speaker verification. We propose the weighting function which can enlarge inter-speaker variation by weighting pitch, speaker inherent vector, included in residual cepstrum. Simulation results show that the average speaker verification rate is improved in the rate of 6% with RCEP and LPCC at the same time and is improved in the rate of 2.45% with the proposed weighted RCEP and LPCC at the same time compared with no weighting.

  • PDF

A study on the acoustical inversion method using cepstrum analysis of underwater ship radiated noise (선박 수중방사소음의 셉스트럼 분석을 이용한 음향역산법 연구)

  • Park, Cheolsoo;Kim, Gun Do;Yim, Geuntae;Moon, Il-Sung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.38 no.1
    • /
    • pp.73-81
    • /
    • 2019
  • This paper proposes an acoustical inversion method using cepstrum analysis of underwater ship noise. Through the cepstrum analysis, multipath structure can be extracted from the recorded ship noise. The multipath structure comes from interferences between a direct arrival and multiple reflections from the sea surface and the bottom. The acoustic inversion is the optimization process to find the best parameters which show good correlation between cepstrums of the measured signal and the replica. The inversion method was applied to the underwater ship radiated noise data measured at Straits of Korea in order to estimate the acoustic center of the ship and the hydrophone position. The inversion results showed good agreement with the measured information.

Analysis of Impedance of Multilayer Structure using Cepstrum Technique (켑스트럼 기법을 이용한 다층구조물의 임피던스 해석)

  • Shin, Jin-Seob;Jun, Kye-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.4
    • /
    • pp.85-89
    • /
    • 1997
  • In this paper, the imdedance for each layer using triple cepstrum signal processing for reflected ultrasonic signal from the multilayer structure has been analyzed. The reflection coefficient can be obtained from the amplitude and the polarity of the peaks in the triple cepstrum, and then the impedance of each layer has been reconstructed by the reflection coefficient. In this experiment, four types of multilayers consisting of different metal layers were manufactured. The reflected signals from the multilayer structure have been detected by pulse-echo method. The impedances have been reconstructed by triple cepstrum technique. The experimental results have been in good agreement with the theoretical results.

  • PDF

On a Pitch Alteration Technique by Cepstrum Analysis of Flattened Excitation Spectrum (평탄화된 여기 스펙트럼에서 켑스트럼 피치 변경법에 관한 연구)

  • 조왕래
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06c
    • /
    • pp.159-162
    • /
    • 1998
  • Speech synthesis coding is classified into three categories: waveform coding, source coding and hybrid coding. To obtain the synthetic speech with high quality, the synthesis by waveform coding is desired. However, it is difficult to apply waveform coding to synthesis by syllable or phoneme unit, because it does not divide the speech into excitation and formant component. Thus it is required to alter the excitation in waveform coding for applying waveform coding to synthesis by rule. In this paper we propose a new pitch alteration method that minimizes the spectrum distortion by using the behavior of cepstrum. This method splits the spectrum of speech signal into excitation spectrum and formant spectrum and transforms the excitation spectrum into cepstrum domain. The pitch of excitation cepstrum is altered by zero insertion or zero deletion and the pitch altered spectrum is reconstructed in spectrum domain. As a result of performance test, the average spectrum distortion was below 2.29%.

  • PDF

Performance Evaluation of Novel AMDF-Based Pitch Detection Scheme

  • Kumar, Sandeep
    • ETRI Journal
    • /
    • v.38 no.3
    • /
    • pp.425-434
    • /
    • 2016
  • A novel average magnitude difference function (AMDF)-based pitch detection scheme (PDS) is proposed to achieve better performance in speech quality. A performance evaluation of the proposed PDS is carried out through both a simulation and a real-time implementation of a speech analysis-synthesis system. The parameters used to compare the performance of the proposed PDS with that of PDSs that are based on either a cepstrum, an autocorrelation function (ACF), an AMDF, or circular AMDF (CAMDF) methods are as follows: percentage gross pitch error (%GPE); a subjective listening test; an objective speech quality assessment; a speech intelligibility test; a synthesized speech waveform; computation time; and memory consumption. The proposed PDS results in lower %GPE and better synthesized speech quality and intelligibility for different speech signals as compared to the cepstrum-, ACF-, AMDF-, and CAMDF-based PDSs. The computational time of the proposed PDS is also less than that for the cepstrum-, ACF-, and CAMDF-based PDSs. Moreover, the total memory consumed by the proposed PDS is less than that for the ACF- and cepstrum-based PDSs.

Cepstrum analysis on the chatter vibration generated by the machine tool (공작기계의 채터진동에 대한 켑스트럼 분석)

  • 김명구;최봉학;이흥식;조종두
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2004.05a
    • /
    • pp.77-82
    • /
    • 2004
  • There were many researches about the chatter vibration occur in the cutting process of machine tools. But there are in sufficient research parts ; the frequency about the chatter vibration and its characteristics and its nonlinear properties. This paper measured signals of vibration that occur before and immediately after and after the chatter vibration. This signals were analyzed through autospectrum obtained by the Fast Fourier Transform(FFT). And then, the nonlinear characteristis were analyzed by cepstrum analysis through FFT of autospectrun.

  • PDF

Real-time implementation and performance evaluation of speech classifiers in speech analysis-synthesis

  • Kumar, Sandeep
    • ETRI Journal
    • /
    • v.43 no.1
    • /
    • pp.82-94
    • /
    • 2021
  • In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR-E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear-predictive-coding-based speech analysis-synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN-based speech classifier performs better than the ACF-, AMDF-, cepstrum-, WACF- and ZCR-E-based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF-based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN-based speech classifier is greater compared with other classifiers.

Visibility Enhancement of the Ultrasonic Signal Reflected from Adhesive Layers (접착층에서 반사된 초음파 신호의 가시도 개선)

  • Shin, Jin Seob;Lee, Jeong-Ihll
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.8 no.6
    • /
    • pp.153-157
    • /
    • 2008
  • Recently, electronic devices is produced by multilayer structure, therefore analysis for hidden layers is important nondestructive inspection. This paper presents visibility enhancement methods for the ultrasonic multiple echoes reflected from adhesive layer in the multilayers using digital signal processing. The reflected signals from the multilayers come out interval of the peaks in the power cepstrum. In the experiment, the adhesive layers of settled thickness using epoxy were formed. The reflected signals from the multilayer is detected by pulse-echo method and power cepstrum is processed for enhancement of visibility.

  • PDF

A Study on Speech Recognition using Vocal Tract Area Function (성도 면적 함수를 이용한 음성 인식에 관한 연구)

  • 송제혁;김동준
    • Journal of Biomedical Engineering Research
    • /
    • v.16 no.3
    • /
    • pp.345-352
    • /
    • 1995
  • The LPC cepstrum coefficients, which are an acoustic features of speech signal, have been widely used as the feature parameter for various speech recognition systems and showed good performance. The vocal tract area function is a kind of articulatory feature, which is related with the physiological mechanism of speech production. This paper proposes the vocal tract area function as an alternative feature parameter for speech recognition. The linear predictive analysis using Burg algorithm and the vector quantization are performed. Then, recognition experiments for 5 Korean vowels and 10 digits are executed using the conventional LPC cepstrum coefficients and the vocal tract area function. The recognitions using the area function showed the slightly better results than those using the conventional LPC cepstrum coefficients.

  • PDF

Spectral Characteristics and Nasalance Scores of Hypernasality in Patient with Cleft Palate

  • Soh, Byung-Soo;Shin, Hyo-Keun;Kim, Hyun-Gi
    • Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.27-35
    • /
    • 2005
  • Differential instrumentation for the diagnoses of individuals with Cleft palate has been used to objectively measure speech problems. The Cepstrum Method was used to study the vocal tract transfer function. The vocal tract transfer function and the source spectrum should be considered in the evaluation of nasal resonance. The aim of this study was to collect quantitative data on the acoustic Instrumentation used for evaluating hypernasality. Normal subjects (9 male, 21 female; 37 male children, 20 female children) and individuals with VPI (13 male, 8 female; 16 male children, 9 female) participated in this study. The vowel /i/ was selected to gauge the severances of hypernasality Spectral and Cepstral studies using CSL was used to identify the acoustic characteristics. Cepstrum analysis shows significant differences in quefrency and amplitude. The quefrency of normal groups was shorter than that of the VPI groups, while the amplitude of normal groups was lower than that of the VPI groups. This may have significance in the evaluation 'of nasal resonance.

  • PDF