• Title/Summary/Keyword: Cepstrum Analysis

Search Result 91, Processing Time 0.028 seconds

A Study on the Phonemic Analysis for Korean Speech Segmentation (한국어 음소분리에 관한 연구)

  • Lee, Sou-Kil;Song, Jeong-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.4E
    • /
    • pp.134-139
    • /
    • 2004
  • It is generally known that accurate segmentation is very necessary for both an individual word and continuous utterances in speech recognition. It is also commonly known that techniques are now being developed to classify the voiced and the unvoiced, also classifying the plosives and the fricatives. The method for accurate recognition of the phonemes isn't yet scientifically established. Therefore, in this study we analyze the Korean language, using the classification of 'Hunminjeongeum' and contemporary phonetics, with the frequency band, Mel band and Mel Cepstrum, we extract notable features of the phonemes from Korean speech and segment speech by the unit of the phonemes to normalize them. Finally, through the analysis and verification, we intend to set up Phonemic Segmentation System that will make us able to adapt it to both an individual word and continuous utterances.

A NEW METHOD FOR NORTH-SOUTH ASYMMETRY OF SUN SPOT AREA ANALYSIS

  • Chang, Heon-Young
    • Journal of Astronomy and Space Sciences
    • /
    • v.24 no.4
    • /
    • pp.261-268
    • /
    • 2007
  • We have studied the temporal variation in the North-South asymmetry of the sunspot area during the period from 1874 to 2007. Though the 9-year periodicity is commonly reported, shorter periodicities is still under study. We employ the cepstrum analysis method to analyze the noisy power spectrum of the North-South asymmetry. We demonstrate that the cleaned power spectrum shows reduction of the spurious back-ground noise level. Some of short period peaks in the power spectrum disappear after deconvolution. It should be, however, pointed out that power spectrum might look less noisy because of a filtering process during deconvolution. We conclude by pointing out that a more sophisticate filtering algorithm is required to produce a precise and reliable periodicity estimate.

The Voice Dialing System Using Dynamic Hidden Markov Models and Lexical Analysis (DHMM과 어휘해석을 이용한 Voice dialing 시스템)

  • 최성호;이강성;김순협
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.7
    • /
    • pp.548-556
    • /
    • 1991
  • In this paper, Korean spoken continuous digits are ercognized using DHMM(Dynamic Hidden Markov Model) and lexical analysis to provide the base of developing voice dialing system. After segmentation by phoneme unit, it is recognized. This system can be divided into the segmentation section, the design of standard speech section, the recognition section, and the lexical analysis section. In the segmentation section, it is segmented using the ZCR, O order LPC cepstrum, and Ai, parameter of voice speech dectaction, which is changed according to time. In the standard speech design section, 19 phonemes or syllables are trained by DHMM and designed as a standard speech. In the recognition section, phomeme stream are recognized by the Viterbi algorithm.In the lexical decoder section, finally recognized continuous digits are outputed. This experiment shiwed the recognition rate of 85.1% using data spoken 7 times of 21 classes of 7 continuous digits which are combinated all of the occurence, spoken by 10 man.

  • PDF

Speech Modification and Concatenative Speech Synthesis by using Analysis-By-Synthesis/OverLap-Add(ABS/OLA) Sinusoidal Model (Analysis- By-Synthesis/OverLap- Add( ABS/OLA) Sinusoidal Model 을 이용한 음성변환과 연결음성합성)

  • 구자형
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.339-343
    • /
    • 1998
  • Sinusoidal model 은 음성신호처리의 넓은 분야에 적용되고 있는 방법으로 고음질의 합성음을 생성해 낼 수 있고, 조작이 용이하다는 장점을 가지고 있다. 본 논문에서는 Analysis-by-synthesis/Overlap-add Sinusoidal model 이라는 방법을 이용하여 시간축 변환과 dam성 변환을 수행하였다. 특히 본 논문에서는 음질향상을 위하여 시간축 변환시에는 정적인 구간과 변화하는 구간을 구별하여 서로 다른 시간축 변환비를 이용하였고, 기존의 LPC 방법에 비해 스펙트럼 포락선을 보다 잘 추정하는 Improved Cepstrum을 이용하여 음정변환에 적용하였다. 또 서로 다른 문맥에서 얻어진 음성단위들을 결합할 때 생기는 위상차이를 극복하기 위하여, 기본주파수 성분이 일치하도록 시간축을 이동하여 합성하였다. 실험결과 본 논문에서 적용한 방법들을 통해 기존 방식에 비해 개선된 음질을 얻을 수 있었다.

  • PDF

Analysis and parameter extraction of motion blurred image (움직임 열화 현상이 발생한 영상의 분석과 파라메터 추출)

  • 최지웅;최병철;강문기
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.10B
    • /
    • pp.1953-1962
    • /
    • 1999
  • While acquiring the image, the shaking of the image capturing equipment or the object seriously damages the image quality. This phenomenon, which degrades the clarity and the resolution of the image is called motion blur. In this paper, a newly defined function is introduced for finding the degree and the length of the motion blur. The domain of this function defined as Peak-trace domain. In The Peak-trace domain, the noise dominant region for calculating the noise variance and the signal dominant region for extracting the degree and the length of the motion blur are defined and analyzed. Using the information of the Peak-trace in the signal dominant region, we can find the direction of the motion regardless of the noise corruption. Weighted least mean square method helps extracting the Peak-trace more precisely. After getting the direction of the motion blur, we can find the length of the motion blur based on one dimensional Cepstrum. In the experiment, we could efficiently restore the degraded image using the information obtained by the proposed algorithm.

  • PDF

Voice Recognition Performance Improvement using the Convergence of Voice signal Feature and Silence Feature Normalization in Cepstrum Feature Distribution (음성 신호 특징과 셉스트럽 특징 분포에서 묵음 특징 정규화를 융합한 음성 인식 성능 향상)

  • Hwang, Jae-Cheon
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.5
    • /
    • pp.13-17
    • /
    • 2017
  • Existing Speech feature extracting method in speech Signal, there are incorrect recognition rates due to incorrect speech which is not clear threshold value. In this article, the modeling method for improving speech recognition performance that combines the feature extraction for speech and silence characteristics normalized to the non-speech. The proposed method is minimized the noise affect, and speech recognition model are convergence of speech signal feature extraction to each speech frame and the silence feature normalization. Also, this method create the original speech signal with energy spectrum similar to entropy, therefore speech noise effects are to receive less of the noise. the performance values are improved in signal to noise ration by the silence feature normalization. We fixed speech and non speech classification standard value in cepstrum For th Performance analysis of the method presented in this paper is showed by comparing the results with CHMM HMM, the recognition rate was improved 2.7%p in the speech dependent and advanced 0.7%p in the speech independent.

Phoneme Segmentation in Consideration of Speech feature in Korean Speech Recognition (한국어 음성인식에서 음성의 특성을 고려한 음소 경계 검출)

  • 서영완;송점동;이정현
    • Journal of Internet Computing and Services
    • /
    • v.2 no.1
    • /
    • pp.31-38
    • /
    • 2001
  • Speech database built of phonemes is significant in the studies of speech recognition, speech synthesis and analysis, Phoneme, consist of voiced sounds and unvoiced ones, Though there are many feature differences in voiced and unvoiced sounds, the traditional algorithms for detecting the boundary between phonemes do not reflect on them and determine the boundary between phonemes by comparing parameters of current frame with those of previous frame in time domain, In this paper, we propose the assort algorithm, which is based on a block and reflecting upon the feature differences between voiced and unvoiced sounds for phoneme segmentation, The assort algorithm uses the distance measure based upon MFCC(Mel-Frequency Cepstrum Coefficient) as a comparing spectrum measure, and uses the energy, zero crossing rate, spectral energy ratio, the formant frequency to separate voiced sounds from unvoiced sounds, N, the result of out experiment, the proposed system showed about 79 percents precision subject to the 3 or 4 syllables isolated words, and improved about 8 percents in the precision over the existing phonemes segmentation system.

  • PDF

An Accuracy Improvement Method on Acoustic Source Localization Using Ground Reflection Effect (지면반사효과를 이용한 폭발 소음원의 위치 추정 정밀도 향상법)

  • Go, Yeong-Ju;Choi, Donghun;Lee, Jaehyung;Choi, Jong-Soo;Ha, Jae-Hyoun;Na, Taeheum
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.26 no.1
    • /
    • pp.69-74
    • /
    • 2016
  • A technique for improving estimation accuracy is introduced in order to locate the impact position of artillery shell during the weapon scoring test. Study on localization of impacts using acoustic measurement has been conducted and the usability of sensor array is verified with experiments. When the blast occurs above the ground in the firing range, the acoustic sensor above the ground can measure the directly propagated sound with the ground-reflected one. In this study, a method for reducing estimation error by using the reflection signal measurements based on the time difference of arrival method. Considering the reflection sound works as same as placing a virtual sensor symmetrically through the ground. This idea enables a virtual three-dimensional array configuration with a two-dimensional plane array above the ground as such. The time difference between the direct and the reflected propagations can be estimated using cepstrum analysis. Performance test has been made in the simulation experiment in the football size area.

A Study on the Thickness Measurement of Thin Film by Ultrasonic Wave (초음파(超音波)를 이용(利用)한 박막(薄膜)두께 측정(測定)에 관(關)한 연구(硏究))

  • Han, Eung-Kyo;Lee, Jae-Joon;Kim, Jae-Yeol
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.7 no.2
    • /
    • pp.27-34
    • /
    • 1988
  • Recently, it is gradually raised necessity that thickness of thin film is measured accurately and managed in industrial circles and medical world. In this study, regarding to the thickness of film which is in opaque object and is beyond distance resolution capacity, thickness measurement was done by MEM-cepstrum analysis of received ultrasonic wave. In measurement results, film thickness which is beyond distance resolution capacity was measured accurately. And within thickness range that don't exist interference, thickness measurement by MEM-ceptrum analysis was impossible.

  • PDF

A Study on the Diagnosis of Laryngeal Diseases by Acoustic Signal Analysis (음향신호의 분석에 의한 후두질환의 진단에 관한 연구)

  • Jo, Cheol-Woo;Yang, Byong-Gon;Wang, Soo-Geon
    • Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.151-165
    • /
    • 1999
  • This paper describes a series of researches to diagnose vocal diseases using the statistical method and the acoustic signal analysis method. Speech materials are collected at the hospital. Using the pathological database, the basic parameters for the diagnosis are obtained. Based on the statistical characteristics of the parameters, valid parameters are chosen and those are used to diagnose the pathological speech signal. Cepstrum is used to extract parameters which represents characteristics of pathological speech. 3 layered neural network is used to train and classify pathological speech into normal, benign and malignant case.

  • PDF