• Title/Summary/Keyword: LPC cepstrum coefficients

Search Result 33, Processing Time 0.025 seconds

Speaker Recognition using LPC cepstrum Coefficients and Neural Network (LPC 켑스트럼 계수와 신경회로망을 사용한 화자인식)

  • Choi, Jae-Seung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.12
    • /
    • pp.2521-2526
    • /
    • 2011
  • This paper proposes a speaker recognition algorithm using a perceptron neural network and LPC (Linear Predictive Coding) cepstrum coefficients. The proposed algorithm first detects the voiced sections at each frame. Then, the LPC cepstrum coefficients which have speaker characteristics are obtained by the linear predictive analysis for the detected voiced sections. To classify the obtained LPC cepstrum coefficients, a neural network is trained using the LPC cepstrum coefficients. In this experiment, the performance of the proposed algorithm was evaluated using the speech recognition rates based on the LPC cepstrum coefficients and the neural network.

Comparison of Characteristic Vector of Speech for Gender Recognition of Male and Female (남녀 성별인식을 위한 음성 특징벡터의 비교)

  • Jeong, Byeong-Goo;Choi, Jae-Seung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.7
    • /
    • pp.1370-1376
    • /
    • 2012
  • This paper proposes a gender recognition algorithm which classifies a male or female speaker. In this paper, characteristic vectors for the male and female speaker are analyzed, and recognition experiments for the proposed gender recognition by a neural network are performed using these characteristic vectors for the male and female. Input characteristic vectors of the proposed neural network are 10 LPC (Linear Predictive Coding) cepstrum coefficients, 12 LPC cepstrum coefficients, 12 FFT (Fast Fourier Transform) cepstrum coefficients and 1 RMS (Root Mean Square), and 12 LPC cepstrum coefficients and 8 FFT spectrum. The proposed neural network trained by 20-20-2 network are especially used in this experiment, using 12 LPC cepstrum coefficients and 8 FFT spectrum. From the experiment results, the average recognition rates obtained by the gender recognition algorithm is 99.8% for the male speaker and 96.5% for the female speaker.

A Study on Speech Recognition using Vocal Tract Area Function (성도 면적 함수를 이용한 음성 인식에 관한 연구)

  • 송제혁;김동준
    • Journal of Biomedical Engineering Research
    • /
    • v.16 no.3
    • /
    • pp.345-352
    • /
    • 1995
  • The LPC cepstrum coefficients, which are an acoustic features of speech signal, have been widely used as the feature parameter for various speech recognition systems and showed good performance. The vocal tract area function is a kind of articulatory feature, which is related with the physiological mechanism of speech production. This paper proposes the vocal tract area function as an alternative feature parameter for speech recognition. The linear predictive analysis using Burg algorithm and the vector quantization are performed. Then, recognition experiments for 5 Korean vowels and 10 digits are executed using the conventional LPC cepstrum coefficients and the vocal tract area function. The recognitions using the area function showed the slightly better results than those using the conventional LPC cepstrum coefficients.

  • PDF

EMG signal identification using LPC cepstrum coefficients (LPC cepstrum 계수를 이용한 근전도 신호의 동작판별)

  • Chung, T.Y.;Park, S.H.;Kim, H.R.;Wang, M.S.;Choi, Y.H.;Byun, Y.S.
    • Proceedings of the KIEE Conference
    • /
    • 1988.07a
    • /
    • pp.738-741
    • /
    • 1988
  • In this paper, we deal with the movements identification of EMG signals by LPC cepstrum coefficients. Movements were identified by extration of characteristics of similar patterns in Euclid distance measurement method for EMG signals generated by voluntary contractions of subject's musculature. As number of coefficients is larger, we obtain the better rate of movements identification. By exact extraction of signals and decision of optimal coefficient, it is expected that these results will apply to prosthesis control in real-time.

  • PDF

A Study on Function Recognition of EMG Signal Using LPC Cepstrum Coefficients (LPC 켑스트럼 계수를 이용한 EMG 신호의 기능 인식에 관한 연구)

  • Wang, Sung-Moon;Chung, Tae-Yun;Choi, Yun-Ho;Byun, Youn-Shik;Park, Sang-Hui
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.2
    • /
    • pp.126-134
    • /
    • 1990
  • In this study, eight function discrimination and recognition of the EMG signal from the biceps and triceps of 4 subjects were executed, using the Euclidean and weighted cepstral distance measure with LPC cepstrum coefficients. In case of Euclidean cepstral distance measure, as the number of LPC cepstrum coefficients was increased in 8, 10, 12, 14 the recognition rates of functions are 94.69, 95.63, 96.56, and 96.88[%], respectively, but increasing rates of recognition were inclined to decrease. In case of weighted cepstral distance measure, when the number of LPC cepstrum coefficients was 8, 10, 12 and 14, the recognition rates of functions were 91.88, 95, 99.69, and 96.63[%], respectively.

  • PDF

Performance Analysis of Speech Parameters and a New Decision Logic for Speaker Recognition (화자인식을 위한 음성 요소들의 성능분석 및 새로운 판단 논리)

  • Lee, Hyuk-Jae;Lee, Byeong-Gi
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.7
    • /
    • pp.146-156
    • /
    • 1989
  • This paper discusses how to choose speech parameters and decision logics to improve the performance of speaker recognition systems. It also considers the influence of the reference patterns on the speaker recognition. It is observed from the performance analysis based on LPSs, PARCOR coefficients and LPC-cepstrum coefficients that LPC-cepstrum coefficients are superior to the others in speaker recognition without regard to the reference patterns. In order to improve the recognition performance, a new decision logic is proposed based on a generalized-distance concept. It differs from the existing methods in that it considers the statistics of customer and impostors at the same time. It turns out from a speaker verification test that the proposed decision logic ferforms better than the existing ones.

  • PDF

A Study on the CEPSTRUM Method for the Function Classification of EMG Signal (EMG 신호의 기능 분류에 적용되는 CEPSTRUM 기법에 관한 연구)

  • Wang, Moon-Sung;Byun, Yoon-Shik;Park, Sang-Hui
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1992 no.11
    • /
    • pp.79-82
    • /
    • 1992
  • Under the assumption that the EMG signal was used as the reference signal for driving a prosthetic arm, function discrimination of EMG signal from the biceps and triceps of subject was achived with LPC CEPSTRUM coefficients. By varying the number of coefficients, the types of windows, window size, and window overlaping rates, the best conditions for the function discrimination of EMG signal were obtained.

  • PDF

Voice personality transformation using an orthogonal vector space conversion (직교 벡터 공간 변환을 이용한 음성 개성 변환)

  • Lee, Ki-Seung;Park, Kun-Jong;Youn, Dae-Hee
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.1
    • /
    • pp.96-107
    • /
    • 1996
  • A voice personality transformation algorithm using orthogonal vector space conversion is proposed in this paper. Voice personality transformation is the process of changing one person's acoustic features (source) to those of another person (target). In this paper, personality transformation is achieved by changing the LPC cepstrum coefficients, excitation spectrum and pitch contour. An orthogonal vector space conversion technique is proposed to transform the LPC cepstrum coefficients. The LPC cepstrum transformation is implemented by principle component decomposition by applying the Karhunen-Loeve transformation and minimum mean-square error coordinate transformation(MSECT). Additionally, we propose a pitch contour modification method to transform the prosodic characteristics of any speaker. To do this, reference pitch patterns for source and target speaker are firstly built up, and speaker's one. The experimental results show the effectiveness of the proposed algorithm in both subjective and objective evaluations.

  • PDF

Spectrum Representation Based on LPC Cepstral VQ for Low Bit Rate CELP Coder (LPC Cepstral 벡터 양자화에 의한 저 전송율 CELP 음성부호기의 스펙트럼 표기)

  • 정재호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.4
    • /
    • pp.761-771
    • /
    • 1994
  • This paper focuses on how spectrum information can be represented efficiently in a very low bit rate CELP speech coder. To achieve the goal, an LPC cepstral coefficients VQ scheme representing the spectrum information in a CELP coder is proposed. To represent the spectrum information using LPC cepstrums, three different cepstral distance measures having different spectral meanings in the frequency domain are considered, and their performances are compared and analyzed. The experimental results show that spectrum information in low bit rate CELP coders can be represented very efficiently using the proposed LPC cepstral vector quantization scheme.

  • PDF

Speaker-dependent Speech Recognition Algorithm for Male and Female Classification (남녀성별 분류를 위한 화자종속 음성인식 알고리즘)

  • Choi, Jae-Seung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.4
    • /
    • pp.775-780
    • /
    • 2013
  • This paper proposes a speaker-dependent speech recognition algorithm which can classify the gender for male and female speakers in white noise and car noise, using a neural network. The proposed speech recognition algorithm is trained by the neural network to recognize the gender for male and female speakers, using LPC (Linear Predictive Coding) cepstrum coefficients. In the experiment results, the maximal improvement of total speech recognition rate is 96% for white noise and 88% for car noise, respectively, after trained a total of six neural networks. Finally, the proposed speech recognition algorithm is compared with the results of a conventional speech recognition algorithm in the background noisy environment.