• 제목/요약/키워드: Cepstrum Analysis

검색결과 91건 처리시간 0.027초

가중된 예측 오차 파라미터를 사용한 화자 확인 성능 개선 (Speaker Verification Performance Improvement Using Weighted Residual Cepstrum)

  • 위진우;강철호
    • 한국음향학회지
    • /
    • 제20권5호
    • /
    • pp.48-53
    • /
    • 2001
  • LPC분석 기반 화자 확인에서 잔여성분(residue) 예측은 보통 무시되고, LPCC(LPC-cepstrum)만이 특징 파라미터로 사용된다. 본 연구에서는 잔여성분으로부터 추출된 예측파라미터인 잔여 켑스트럼(residual cepstrum)을 LPCC와 함께 여러 환경에서 구축된 데이터 베이스에서 화자특징 파라미터로 사용하였다. 또한, 잔여 켑스트럼에 포함되어있는 화자 고유성분인 피치(pitch)성분에 큰 가중치(weighting)를 줌으로써 화자간 변이(inter-speaker variation)가 커지도록 하는 가중치 함수를 제안한다. 실험 결과, LPCC만을 특징 파라미터로 사용하였을 경우보다 잔여 켑스트럼 (RCEP)과 LPCC를 동시에 사용했을 경우 약 6%가량의 인식 오류율이 향상 되었으며, 제안한 가중치 함수를 적용한 잔여 켑스트럼 (RCEP)과 LPCC를 동시에 사용했을 경우 인식 오류율이 가중치를 주지 않은 경우보다 약 2.45%가량 개선되었다.

  • PDF

선박 수중방사소음의 셉스트럼 분석을 이용한 음향역산법 연구 (A study on the acoustical inversion method using cepstrum analysis of underwater ship radiated noise)

  • 박철수;김건도;임근태;문일성
    • 한국음향학회지
    • /
    • 제38권1호
    • /
    • pp.73-81
    • /
    • 2019
  • 본 논문에서는 선박 수중방사소음의 셉스트럼(cepstrum) 분석을 이용한 음향역산법을 제안하였다. 셉스트럼 분석을 통해 수중 청음기에서 계측된 선박 소음으로부터 직접 도달파와 해수면과 해저면에서 반사파와의 간섭에서 기인한 음파의 다중반사 구조를 추출할 수 있다. 음향학적 역산은 계측 신호의 셉스트럼과 모의 신호의 셉스트럼을 비교하여 최적의 역산인자를 찾는 방식으로 구성되었다. 본 논문에서 제안된 역산기법을 대한해협에서 계측한 선박 수중방사소음 데이터에 적용하여 대상 선박의 음원중심과 수중청음기의 위치를 추정하였다.

켑스트럼 기법을 이용한 다층구조물의 임피던스 해석 (Analysis of Impedance of Multilayer Structure using Cepstrum Technique)

  • 신진섭;전계석
    • 한국음향학회지
    • /
    • 제16권4호
    • /
    • pp.85-89
    • /
    • 1997
  • 본 논문에서는 다층구조물에 초음파를 입사시켰을 때 반사된 신호를 켑스트럼 기법으로 신호처리하여 각 층의 임피던스 변화를 해석하였다. 이를 위하여 각 층에서 반사된 초음파 반사신호에 삼중 켑스트럼 기법을 적용하여 최대 진폭과 극성을 구하고 이러한 값으로 각 층의 반사계수를 산출하므로써 임피던스를 구할 수 있었다. 실험을 위하여 얻어진 반사신호를 켑스트럼 처리하여 임피던스를 측정한 결과 이론치와 잘 일치하였다.

  • PDF

평탄화된 여기 스펙트럼에서 켑스트럼 피치 변경법에 관한 연구 (On a Pitch Alteration Technique by Cepstrum Analysis of Flattened Excitation Spectrum)

  • 조왕래
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 학술발표대회 논문집 제17권 1호
    • /
    • pp.159-162
    • /
    • 1998
  • Speech synthesis coding is classified into three categories: waveform coding, source coding and hybrid coding. To obtain the synthetic speech with high quality, the synthesis by waveform coding is desired. However, it is difficult to apply waveform coding to synthesis by syllable or phoneme unit, because it does not divide the speech into excitation and formant component. Thus it is required to alter the excitation in waveform coding for applying waveform coding to synthesis by rule. In this paper we propose a new pitch alteration method that minimizes the spectrum distortion by using the behavior of cepstrum. This method splits the spectrum of speech signal into excitation spectrum and formant spectrum and transforms the excitation spectrum into cepstrum domain. The pitch of excitation cepstrum is altered by zero insertion or zero deletion and the pitch altered spectrum is reconstructed in spectrum domain. As a result of performance test, the average spectrum distortion was below 2.29%.

  • PDF

Performance Evaluation of Novel AMDF-Based Pitch Detection Scheme

  • Kumar, Sandeep
    • ETRI Journal
    • /
    • 제38권3호
    • /
    • pp.425-434
    • /
    • 2016
  • A novel average magnitude difference function (AMDF)-based pitch detection scheme (PDS) is proposed to achieve better performance in speech quality. A performance evaluation of the proposed PDS is carried out through both a simulation and a real-time implementation of a speech analysis-synthesis system. The parameters used to compare the performance of the proposed PDS with that of PDSs that are based on either a cepstrum, an autocorrelation function (ACF), an AMDF, or circular AMDF (CAMDF) methods are as follows: percentage gross pitch error (%GPE); a subjective listening test; an objective speech quality assessment; a speech intelligibility test; a synthesized speech waveform; computation time; and memory consumption. The proposed PDS results in lower %GPE and better synthesized speech quality and intelligibility for different speech signals as compared to the cepstrum-, ACF-, AMDF-, and CAMDF-based PDSs. The computational time of the proposed PDS is also less than that for the cepstrum-, ACF-, and CAMDF-based PDSs. Moreover, the total memory consumed by the proposed PDS is less than that for the ACF- and cepstrum-based PDSs.

공작기계의 채터진동에 대한 켑스트럼 분석 (Cepstrum analysis on the chatter vibration generated by the machine tool)

  • 김명구;최봉학;이흥식;조종두
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2004년도 춘계학술대회논문집
    • /
    • pp.77-82
    • /
    • 2004
  • There were many researches about the chatter vibration occur in the cutting process of machine tools. But there are in sufficient research parts ; the frequency about the chatter vibration and its characteristics and its nonlinear properties. This paper measured signals of vibration that occur before and immediately after and after the chatter vibration. This signals were analyzed through autospectrum obtained by the Fast Fourier Transform(FFT). And then, the nonlinear characteristis were analyzed by cepstrum analysis through FFT of autospectrun.

  • PDF

Real-time implementation and performance evaluation of speech classifiers in speech analysis-synthesis

  • Kumar, Sandeep
    • ETRI Journal
    • /
    • 제43권1호
    • /
    • pp.82-94
    • /
    • 2021
  • In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR-E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear-predictive-coding-based speech analysis-synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN-based speech classifier performs better than the ACF-, AMDF-, cepstrum-, WACF- and ZCR-E-based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF-based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN-based speech classifier is greater compared with other classifiers.

접착층에서 반사된 초음파 신호의 가시도 개선 (Visibility Enhancement of the Ultrasonic Signal Reflected from Adhesive Layers)

  • 신진섭;이정일
    • 한국인터넷방송통신학회논문지
    • /
    • 제8권6호
    • /
    • pp.153-157
    • /
    • 2008
  • 최근 산업사회에서 널리 쓰이는 전자소자들은 다층구조로 제작되고 있는 실정이며 이러한 소자의 보이지 않는 층에 대한 해석은 비파괴 검사에서 중요한 일이다. 따라서 본 논문에서는 접착층이 존재하는 다층구조물에 초음파를 입사시켰을 때 반사되는 신호를 디지털 신호처리하여 가시도를 개선하였다. 이를 위하여 다층구조물에서 반사된 신호를 전력 켑스트럼 처리하여 각층에서 나타난 첫 번째 피크와 두 번째 피크를 구할 수 있었다. 실험을 위하여 일정한 두께를 갖는 에폭시층이 존재하는 다층구조물을 형성하였고 초음파 펄스-에코 방법에 의하여 얻어진 반사신호의 가시도를 개선하기 위해 전력 켑스트럼 처리하였다.

  • PDF

성도 면적 함수를 이용한 음성 인식에 관한 연구 (A Study on Speech Recognition using Vocal Tract Area Function)

  • 송제혁;김동준
    • 대한의용생체공학회:의공학회지
    • /
    • 제16권3호
    • /
    • pp.345-352
    • /
    • 1995
  • The LPC cepstrum coefficients, which are an acoustic features of speech signal, have been widely used as the feature parameter for various speech recognition systems and showed good performance. The vocal tract area function is a kind of articulatory feature, which is related with the physiological mechanism of speech production. This paper proposes the vocal tract area function as an alternative feature parameter for speech recognition. The linear predictive analysis using Burg algorithm and the vector quantization are performed. Then, recognition experiments for 5 Korean vowels and 10 digits are executed using the conventional LPC cepstrum coefficients and the vocal tract area function. The recognitions using the area function showed the slightly better results than those using the conventional LPC cepstrum coefficients.

  • PDF

Spectral Characteristics and Nasalance Scores of Hypernasality in Patient with Cleft Palate

  • Soh, Byung-Soo;Shin, Hyo-Keun;Kim, Hyun-Gi
    • 음성과학
    • /
    • 제12권1호
    • /
    • pp.27-35
    • /
    • 2005
  • Differential instrumentation for the diagnoses of individuals with Cleft palate has been used to objectively measure speech problems. The Cepstrum Method was used to study the vocal tract transfer function. The vocal tract transfer function and the source spectrum should be considered in the evaluation of nasal resonance. The aim of this study was to collect quantitative data on the acoustic Instrumentation used for evaluating hypernasality. Normal subjects (9 male, 21 female; 37 male children, 20 female children) and individuals with VPI (13 male, 8 female; 16 male children, 9 female) participated in this study. The vowel /i/ was selected to gauge the severances of hypernasality Spectral and Cepstral studies using CSL was used to identify the acoustic characteristics. Cepstrum analysis shows significant differences in quefrency and amplitude. The quefrency of normal groups was shorter than that of the VPI groups, while the amplitude of normal groups was lower than that of the VPI groups. This may have significance in the evaluation 'of nasal resonance.

  • PDF