• Title/Summary/Keyword: Cepstrum Analysis

Search Result 91, Processing Time 0.029 seconds

Usefulness of Cepstral Peak Prominence (CPP) in Unilateral Vocal Fold Paralysis Dysphonia Evaluation (일측성 성대마비 환자 평가에서 Cepstral Peak Prominence의 유용성)

  • Lee, Chang-Yoon;Jeong, Hee Seok;Son, Hee Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.84-88
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to compare the usefulness of Cepstral peak prominence (CPP) with parameter of Multiple Dimensional Voice Program (MDVP) in evaluating unilateral vocal fold paraylsis patients with subjective voice impairment. Materials and Methods : From July 2014 to August 2016, 37 patients with unilateral vocal fold paralysis who had been diagnosed with unilateral vocal fold paralysis and had received two or more voice tests before and after the diagnosis were evaluated for maximum phonation time (MPT), MDVP and CPP. Respectively. Voice tests were performed with short vowel /a/ and paragraph reading. Results : The CPP-a (CPP with vowel /a/) and CPP-s (CPP with paragraph reading) of the Cepstrum were statistically negatively correlated with G, R, B, and A before the voice therapy. Jitter, Shimmer, and NHR of MDVP were positively correlated with G, R, B. Jitter, Shimmer, and NHR of the MDVP were significantly correlated with the Cepstrum index. G, B, A and CPP-a and CPP-s showed a statistically significant negative correlation and a somewhat higher correlation coefficient between 0.5 and 0.78. On the other hand, in MDVP index, there was a positive correlation with G and B only with Jitter of 0.4. Conclusion : CPP can be an important evaluation tool in the evaluation of speech in the unilateral vocal cord paralysis when speech energy changes or the cycle is not constant during speech.

  • PDF

Analysis of Speech Signals Depending on the Microphone and Micorphone Distance

  • Son, Jong-Mok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.4E
    • /
    • pp.41-47
    • /
    • 1998
  • Microphone is the first link in the speech recognition system. Depending on its type and mounting position, the microphone can significantly distort the spectrum and affect the performance of the speech recognition system. In this paper, characteristics of the speech signal for different microphones and microphone distances are investigated both in time and frequency domains. In the time domain analysis, the average signal-to-noise ration is measure ration is measured for the database we collected depending on the microphones and microphone distances. Mel-frequency spectral coefficients and mel-frequency cepstrum are computed to examine the spectral characteristics. Analysis results are discussed with our findings, and the result of recognition experiments is given.

  • PDF

Cepstral Analysis of the Ultrasonic Signal from the liver tissue (간 조직 초음파 신호의 cepstrum 분석)

  • Kim, Jong-Won;Kwark, Cheol-Eun;Seo, Bo-Suk;Min, Byoun-Goo
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1247-1251
    • /
    • 1987
  • Cepstral analysis was performed on the ultrasonic echo signal from the tissue to achieve improvement on the estmation of the attenuation coefficient. In this paper, the feasibility of the acquiring the structural information of the tissue was also included by same method with band pass lifter.

  • PDF

Comparison of MEL-LPC and LPC-MEL Analysis Method for the Korean Speech Recognition Systems. (한국어 음성 인식 시스템을 위한 MEL-LPC 분석 방법과 LPC-MEL 분석 방법의 비교)

  • 김주곤;김범국;정호열;정현열
    • Proceedings of the IEEK Conference
    • /
    • 2001.09a
    • /
    • pp.833-836
    • /
    • 2001
  • 본 논문에서는 한국어 음성인식 시스템의 성능 향상을 위해 청각 주파수 분해능을 가진 MEL-LPC Cepstrum을 음소단위의 HMM(Hidden Markov Model)을 기반으로 하는 인식 시스템에 적용하여 그 결과를 비교 검토하였다. 선형예측(LP) 분석 후에 후처리로서 주파수를 왜곡시킨 LPC-MEL 분석이 계산량이 적고 효과적이라 일반적으로 많이 사용되고 있으나 주파수 분해능은 많이 개선되지 않는다. 따라서 본 논문에서는 주파수 분해능을 개선하기 위해, 원 음성신호로부터 직접적으로 멜주파수로 왜곡시킨 후 선형 예측 분석을 수행하는 MEL-LPC 분석방법을 이용한 음소기반의 화자 독립 음성인식 시스템을 구성하여 기존의 LPC-MEL 분석방법과 비교실험을 통하여 MEL-LPC 분석방법의 유효성을 검토하였다. 실험에 사용한 음성 데이터베이스는 음소 및 단어 인식실험에서는 ETRI 445단어 DB, 연속 숫자음인식 실험에서는 KLE 4연속 숫자음 DB를 사용하였다. 화자 독립 음소인식 실험의 경우, 묵음을 제외한 47개의 유사 음소에 대하여 4상태 3출력의 Left-to-Right 모델을이용하였다. 단어 및 연속 숫자음 인식 실험의 경우, 유한상태 네트워크에 의한 OPDP법을 이용하였다. 화자 독립 음소, 단어 및 4연속 숫자음 인식 실험결과, 기존의 LPC-MEL Cepstrum을 사용한 경우보다 MEL-LPC Cepstum을 사용한 경우가 더 높은 인식률을 나타내어 한국어 음성인식 시스템에서 MEL-LPC 분석방법의 유효성을 확인할 수 있었다.

  • PDF

On a Processing Time Reduction of Cepstrum-Based Pitch Alteration in Time-Frequency Hybrid Domain (켑스트럼 기반 혼성영역 피치변경법의 처리시간 단축에 관한 연구)

  • Jo, Wang-Rae;Kim, Jong-Kuk;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.41-47
    • /
    • 2010
  • The pitch alteration technique for voice conversion is classified in time domain, frequency domain and hybrid domain. The Hybrid domain method has a merit of clearness and natural-ness of pitch altered speech but has the major drawback of long processing time. In this paper, we proposed a new method that can reduce the processing time of pitch alteration in time-frequency hybrid domain. We omitted the bit-reversing process of FFT and IFFT in changing the processing domain. Therefore we can reduce the processing time by 86.26% to the conventional method with same quality.

Comparison of Initial Therapeutic Effects of Voice Therapy and Injection Laryngoplasty for Unilateral Vocal Cord Paralysis Patients (일측 성대마비 환자에 대해 음성치료와 성대주입술의 초기 치료 효과 비교 연구)

  • Lee, Chang-Yoon;An, Soo-Youn;Chang, Hyun;Son, Hee Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.112-117
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to classify patients with unilateral vocal fold paralysis according to their fixed location and to analysis the effects of two treatment methods by early voice therapy and injection laryngoplasty. Materials and Methods : Twenty patients who were classified as full abduction and slight abduction according to the position of paralysis were treated injection laryngoplasy, and 23 patients were treated by voice therapy. Twenty patients were treated injection laryngoplasy and 23 patients were treated voice therapy. Results were evaluated by acoustic analysis, electroglottography, cepstrum analysis before and after therapy. The voice therapy was conducted by improving the larynx movement and glottal contact, whilst removing hypertension of the supraglottic and use the breathing. Results : Significant improvement was found in the acoustic parameter, cepstrum parameter, and EGG before and after treatment in both groups. There was no significant difference between the two groups when compared before and after treatment to compare the effects of injection laryngoplasty and voice therapy. Conclusion : The initial treatments for unilateral vocal cord paralysis are injection laryngoplasty and voice therapy. however, there is no precise standard about which method should be applied first. Therefore, in this study, we tried to classify patients according to their paralysis position and then apply two methods. The results of this study suggest that voice therapy and Injection laryngoplasty at the initial stage is a very useful method to improve voice quality of vocal fold paralysis and improve laryngeal function.

  • PDF

A Signal Processing Technique for Predictive Fault Detection based on Vibration Data (진동 데이터 기반 설비고장예지를 위한 신호처리기법)

  • Song, Ye Won;Lee, Hong Seong;Park, Hoonseok;Kim, Young Jin;Jung, Jae-Yoon
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.2
    • /
    • pp.111-121
    • /
    • 2018
  • Many problems in rotating machinery such as aircraft engines, wind turbines and motors are caused by bearing defects. The abnormalities of the bearing can be detected by analyzing signal data such as vibration or noise, proper pre-processing through a few signal processing techniques is required to analyze their frequencies. In this paper, we introduce the condition monitoring method for diagnosing the failure of the rotating machines by analyzing the vibration signal of the bearing. From the collected signal data, the normal states are trained, and then normal or abnormal state data are classified based on the trained normal state. For preprocessing, a Hamming window is applied to eliminate leakage generated in this process, and the cepstrum analysis is performed to obtain the original signal of the signal data, called the formant. From the vibration data of the IMS bearing dataset, we have extracted 6 statistic indicators using the cepstral coefficients and showed that the application of the Mahalanobis distance classifier can monitor the bearing status and detect the failure in advance.

AN ACOUSTIC STUDY IN RELATION TO THE SOUND DISTORTION BY THE ALTERATION OF PALATAL PLATE -FOCUSSED ON/ㅅ(s)/. BY COMPUTER ANALYSIS- (구개상의 형태 변화가 발음에 미치는 영향에 관한 음향학적 연구 -/ㅅ/을 중심으로한 컴퓨터 분석-)

  • Choi, Chang-Kyu;Woo, Y.H.;Park, Nam-Soo
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.27 no.1
    • /
    • pp.83-102
    • /
    • 1989
  • This study was done to analyze the sound distortion, before and after insertion of the palatal palates. For this study, 4 healthy subjects (3 males and 1 female, each 24-year-old), who were born in Seoul were recruited from K university, and 3 type palatal plates were fabricated, each palatal thickness being 1.0mm, 2.5mm, dentoalveolar portion 2.5mm and elsewhere 1.0mm, named B,C,D-type repectively, and informants's sounds of /사(sa), 서(se), 소(so), 수(su), 스($s\.{+}$), 시(si)/ were recorded, without plate, and with palatal plates of different types, in succession. A series of analysis were adminstered through a 16 Bit IBM PC/AT using linear combination methods. These experiments were analyzed by the Cepstrum (Weighted and Euclidian), Log Area Ratio, Linear prediction correlation methods The findings led to the following conclusions : 1. It was confirmed that the same consonant, /ㅅ(s)/, variously distorted by the following vowel. 2. By and large, 시($s\.{+}$) was the most distorted in all conditions, and (sa), 소(so) were the least distorted in each condition. 3. There were no persistant correlation of the palatal plate types, and sound distortions of each informant were diverse with no regularities. 4. There were persistent correaltion to the Cepstrum (Weighted, Euclidian), Log Area Ratio. However, Linear prediction correlation has a different alteration pattern.

  • PDF

Development of Software For Machinery Diagnostics by Adaptive Noise Cancelling Method (1St: Cepstrum Analysis)

  • Lee, Jung-Chul;Oh, Jae-Eung;Yum, Sung-Ha
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1987.10a
    • /
    • pp.836-841
    • /
    • 1987
  • Many kinds of conditioning monitoring technique have been studied, so this study has investigated the possibility of checking the trend in the fault diagnosis of ball bearing, one of the elements of rotating machine, by applying the cepstral analysis method using the adaptive noise cancelling (ANC) method. And computer simulation is conducted in oder to identify obviously the physical meaning of ANC. The optimal adaptation gain in adaptive filter is estimated, the performance of ANC according to the change of the signal to noise ratio and convergence of LMS algorithm is considered by simulation. It is verified that cepstral analysis using ANC method is more effective than the conventional cepstral analysis method in bearing fault diagnosis.

  • PDF

A Study on Spoken Digits Analysis and Recognition (숫자음 분석과 인식에 관한 연구)

  • 김득수;황철준
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.6 no.3
    • /
    • pp.107-114
    • /
    • 2001
  • This paper describes Connected Digit Recognition with Considering Acoustic Feature in Korea. The recognition rate of connected digit is usually lower than word recognition. Therefore, speech feature parameter and acoustic feature are employed to make robust model for digit, and we could confirm the effect of Considering. Acoustic Feature throughout the experience of recognition. We used KLE 4 connected digit as database and 19 continuous distributed HMM as PLUs(Phoneme Like Units) using phonetical rules. For recognition experience, we have tested two cases. The first case, we used usual method like using Mel-Cepstrum and Regressive Coefficient for constructing phoneme model. The second case, we used expanded feature parameter and acoustic feature for constructing phoneme model. In both case, we employed OPDP(One Pass Dynamic Programming) and FSA(Finite State Automata) for recognition tests. When appling FSN for recognition, we applied various acoustic features. As the result, we could get 55.4% recognition rate for Mel-Cepstrum, and 67.4% for Mel-Cepstrum and Regressive Coefficient. Also, we could get 74.3% recognition rate for expanded feature parameter, and 75.4% for applying acoustic feature. Since, the case of applying acoustic feature got better result than former method, we could make certain that suggested method is effective for connected digit recognition in korean.

  • PDF