• 제목/요약/키워드: Cepstral Analysis

검색결과 80건 처리시간 0.02초

Condition Monitoring기법에 의한 베어링의 이상진단 (Bearing Fault Diagnosis by Condition Monitoring Method)

  • 이정철;오재응;염성하;권오관
    • Tribology and Lubricants
    • /
    • 제3권1호
    • /
    • pp.52-60
    • /
    • 1987
  • Many kinds of condition monitoring technique as the preventive maintenance technique have been studied, so this study has investigated the possibility of chbcking the trend in the fault diagnosis of ball bearing, one of the important elements of rotating machine, by applying the cepstral analysis method. And computer simulation is conducted in order to identify obviously the physical meaning of cepstral analysis. It is identified that cepstral analysis is effective method to distinguish between the basic and reflected wave by computer simulation, and we know that it is possible to apply the cepstral analysis to the arbitrary elements of rotating machine which are different in fundamental frequency. It is verified that cepstral analysis method is more effective than the other conventional method in bearing fault diganosis.

기식 등급에 따른 CPP (Cepstral Peak Prominence) 분석 비교 (A comparison of CPP analysis among breathiness ranks)

  • 강영애;구본석;조철우
    • 말소리와 음성과학
    • /
    • 제7권1호
    • /
    • pp.21-26
    • /
    • 2015
  • The aim of this study is to synthesize pathological breathy voice and to make a cepstral peak prominence (CPP) table following breathiness ranks by cepstral analysis to supplement reliability of the perceptual auditory judgment task. KlattGrid synthesizer included in Praat was used. Synthesis parameters consist of two groups, i.e., constants and variables. Constant parameters are pitch, amplitude, flutter, open phase, oral formant and bandwidth. Variable parameters are breathiness (BR), aspiration amplitude (AH), and spectral tilt (TL). Five hundred sixty samples of synthetic breathy vowel /a/ for male were created. Three raters participated in ranking of the breathiness. 217 were proved to be inadequate samples from perceptual judgment and cepstral analysis. Finally, 343 samples were selected. These CPP values and other related parameters from cepstral analysis are classified under four breathiness ranks (B0~B3). The mean and standard deviation of CPP is $16.10{\pm}1.15$ dB(B0), $13.68{\pm}1.34$ dB(B1), $10.97{\pm}1.41$ dB(B2), and $3.03{\pm}4.07$ dB(B3). The value of CPP decreases toward the severe group of breathiness because there is a lot of noise and a small quantity of harmonics.

식도발성화자 음성의 spectral & cepstral 분석 (Spectral and Cepstral Analyses of Esophageal Speakers)

  • 심희정;장효령;신희백;고도흥
    • 말소리와 음성과학
    • /
    • 제6권2호
    • /
    • pp.47-54
    • /
    • 2014
  • The purpose of this study was to analyze spectral versus cepstral measurements in esophageal speakers. The comparison between the measurements in thirteen male esophageal speakers was compared with the control group of thirteen normal speakers using the sustained vowel /a/. The main results can be summarized as below: (a) the CPP and L/H ratio of the esophageal group were significantly lower than those of the control group (b) the CPP was significantly correlated with the spectral parameters such as jitter, shimmer, NHR and VTI, and (c) the ROC analysis showed that the threshold of 10.25dB for the CPP achieved a good classification for esophageal speakers, with 100% perfect sensitivity and specificity. Thus, it was known that cepstral-based acoustic measures such as CPP, may be more reliable predictors than other spectral-based acoustic measures such as jitter and shimmer. And it was found that cepstral-based acoustic measures were effective in distinguishing esophageal voice quality from normal voice quality. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation with laryngectomees.

캡스트럼 분석을 이용한 해금의 스펙트럼 모델링 (Spectral Modeling of Haegeum Using Cepstral Analysis)

  • 홍연우;강명수;조상진;김종면;이정철;정의필
    • 한국음향학회지
    • /
    • 제29권4호
    • /
    • pp.243-250
    • /
    • 2010
  • 본 논문에서는 해금 소리의 시간에 따른 변화를 사실적으로 묘사하기 위해 캡스트럼 분석을 이용한 전통 악기 해금의 스펙트럼 모델링을 제안한다. 정확한 캡스트럼 분석 결과를 얻기 위해 프레임 사이즈는 입력 신호의 3주기로 하였고 포만트 추출에 더 많은 캡스트럼 계수를 활용하였다. 정현파 성분 합성 과정에서 대역통과 필터의 차단주파수를 공명점 별로 유동적으로 조절하고 노이즈 성분에 남아있는 피크 성분들을 제거하는 과정을 추가하여 성능을 향상시켰다. 음 높이의 변화를 판단하기 위해 입력 프레임을 묵음구간, 어택구간, 지속구간으로 분류하였고 기본주파수에 따라 프레임 사이즈를 가변적으로 조절하였으며 지속구간에서의 기본주파수 검출 오류를 수정함으로써 정확도를 향상시켰다. 해금 연주 전문가의 청취테스트를 통해 원음과 합성음이 96~100 % 유사하다는 평가 결과를 얻었다.

화자인식을 위한 주파수 워핑 기반 특징 및 주파수-시간 특징 평가 (Evaluation of Frequency Warping Based Features and Spectro-Temporal Features for Speaker Recognition)

  • 최영호;반성민;김경화;김형순
    • 말소리와 음성과학
    • /
    • 제7권1호
    • /
    • pp.3-10
    • /
    • 2015
  • In this paper, different frequency scales in cepstral feature extraction are evaluated for the text-independent speaker recognition. To this end, mel-frequency cepstral coefficients (MFCCs), linear frequency cepstral coefficients (LFCCs), and bilinear warped frequency cepstral coefficients (BWFCCs) are applied to the speaker recognition experiment. In addition, the spectro-temporal features extracted by the cepstral-time matrix (CTM) are examined as an alternative to the delta and delta-delta features. Experiments on the NIST speaker recognition evaluation (SRE) 2004 task are carried out using the Gaussian mixture model-universal background model (GMM-UBM) method and the joint factor analysis (JFA) method, both based on the ALIZE 3.0 toolkit. Experimental results using both the methods show that BWFCC with appropriate warping factor yields better performance than MFCC and LFCC. It is also shown that the feature set including the spectro-temporal information based on the CTM outperforms the conventional feature set including the delta and delta-delta features.

내전형연축성 발성장애 음성에 대한 켑스트럼과 스펙트럼 분석 (Cepstral and spectral analysis of voices with adductor spasmodic dysphonia)

  • 심희정;정훈;;최병흔;허정화;고도흥
    • 말소리와 음성과학
    • /
    • 제8권2호
    • /
    • pp.73-80
    • /
    • 2016
  • The purpose of this study was to analyze perceptual and spectral/cepstral measurements in patients with adductor spasmodic dysphonia(ADSD). Sixty participants with gender and age matched individuals(30 ADSD and 30 controls) were recorded in reading a sentence and sustained the vowel /a/. Acoustic data were analyzed acoustically by measuring CPP, L/H ratio, mean CPP F0 and CSID, and auditory-perceptual ratings were measured using GRBAS. The main results can be summarized as below: (a) the CSID for the connected speech was significantly higher than for the sustained vowel (b) the G, R and S for the connected speech were significantly higher than for the sustained vowel (c) Spectral/cepstral parameters were significantly correlated with the perceptual parameters, and (d) the ROC analysis showed that the threshold of 13.491 for the CSID achieved a good classification for ADSD, with 86.7% sensitivity and 96.7% specificity. Spectral and cepstral analysis for the connected speech is especially meaningful on cases where perceptual analysis and clinical evaluation alone are insufficient.

성대마비로 인한 기식 음성에 대한 Cepstral 분석 (A Cepstral Analysis of Breathy Voice with Vocal Fold Paralysis)

  • 강영애;성철재
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.89-94
    • /
    • 2012
  • The aim of this study is to investigate the usefulness of the parameter CPP (cepstral peak prominence) and LTAS (long term average spectrum) band energy for an analysis of breathy voice with vocal fold paralysis. Thirty-four female subjects who have vocal paralysis after thyroidectomy participated in this study. According to the perceptual judgements by three speech pathologists and one phonetic scholar, subjects were divided into two groups: breathy voice group (n = 21) and non-breathy voice group (n = 13). Maximum sustained phonation task was measured for acoustic analysis. CPP-related (i.e. mean F0, mean CPP, and mean CPPs) and LTAS-related (i.e. minimum, maximum, and mean) parameters were used. Independent samples t-test was conducted. Regarding CPP, there are significant differences in mean CPP and mean CPPs between groups. The values of mean CPP and CPPs in the non-breathy voice group are higher than those in the breathy voice group. The CPP could be regarded as the useful parameter for breathy voice analysis in the clinic. When it comes to LTAS, energy from 0 to 2 kHz are significantly different between groups. The minimum value of non-breathy group is lower than that of breathy group, whereas the maximum value of non-breathy group is higher. The frequency band below 2 kHz seems to be related to breathy voice.

Development of Software For Machinery Diagnostics by Adaptive Noise Cancelling Method (1St: Cepstrum Analysis)

  • Lee, Jung-Chul;Oh, Jae-Eung;Yum, Sung-Ha
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1987년도 한국자동제어학술회의논문집(한일합동학술편); 한국과학기술대학, 충남; 16-17 Oct. 1987
    • /
    • pp.836-841
    • /
    • 1987
  • Many kinds of conditioning monitoring technique have been studied, so this study has investigated the possibility of checking the trend in the fault diagnosis of ball bearing, one of the elements of rotating machine, by applying the cepstral analysis method using the adaptive noise cancelling (ANC) method. And computer simulation is conducted in oder to identify obviously the physical meaning of ANC. The optimal adaptation gain in adaptive filter is estimated, the performance of ANC according to the change of the signal to noise ratio and convergence of LMS algorithm is considered by simulation. It is verified that cepstral analysis using ANC method is more effective than the conventional cepstral analysis method in bearing fault diagnosis.

  • PDF

영교차점과 켑스트럼 전처리 기술을 이용한 반향환경에서의 음원방향 추정 (Zero-Crossing-Based Source Direction Estimation Using a Cepstral Prefiltering Technique)

  • 박용진;이수연;박형민
    • 대한음성학회지:말소리
    • /
    • 제67호
    • /
    • pp.121-133
    • /
    • 2008
  • To estimate directions of multi-sound sources, we consider an approach based on zero crossings which provided more robust results to diffuse noise than the conventional cross-correlation-based method [6][7]. In reverberant environments, the performance of source direction estimation can be improved by using signal components through direct paths from sources to microphones. Since a cepstral prefiltering technique [8] removes the effect of reverberation, we propose a source direction estimation method which can find out intervals of the direct-path components by comparing original and cepstral-prefiltered envelopes. Simulations demonstrate that the proposed method can improve the performance of source direction estimation in reverberant environments.

  • PDF

심리 음향 겝스트럼 평균 차감법을 이용한 이동 전화망에서의 음질 평가 (Speech Quality Measure in a Mobile Communication System using PLP Cepstral Distance with CMS)

  • 윤종진;박상욱;박영철;안동순;윤대희
    • 한국통신학회논문지
    • /
    • 제25권12B호
    • /
    • pp.2046-2051
    • /
    • 2000
  • 본 논문에서는 기존의 음질 평가 방법들보다 우수할 뿐 아니라 다양한 채널 경로의 음성 신호에 대해서도 일관된 성능을 갖는 새로운 음질 평가 방법 PLP-CMS(Perceptual Linear Predictive-Cepstral Mean Subtraction)를 제안한다. CDMA PCS 이동 전화 환경에서 음성 신호의 주관적 음질을 효과적으로 예측할 수 있는 PLP-CMS는 심리 음향 선형 예측 분석(PLP Analysis: Perceptual Linear Predictive Analysis)을 이용하여 주관적 음질과의 상관 관계를 높였으며, 겝스트럼 평균 차감(CMS: Cepstral Mean Subtraction) 과정을 통하여 PSTN 경로에 무관하게 일관된 성능을 갖음을 확인하였다.

  • PDF