Automatic Phonetic Segmentation of Korean Speech Signal Using Phonetic-acoustic Transition Information

;;

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 20 Issue 8
/
Pages.24-30
/
2001
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

Automatic Phonetic Segmentation of Korean Speech Signal Using Phonetic-acoustic Transition Information

음소 음향학적 변화 정보를 이용한 한국어 음성신호의 자동 음소 분할

박창목 (아주대학교 대학원 산업공학과) ;
왕지남 (아주대학교 기계 및 산업공학부)

Published : 2001.11.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This article is concerned with automatic segmentation for Korean speech signals. All kinds of transition cases of phonetic units are classified into 3 types and different strategies for each type are applied. The type 1 is the discrimination of silence, voiced-speech and unvoiced-speech. The histogram analysis of each indicators which consists of wavelet coefficients and SVF (Spectral Variation Function) in wavelet coefficients are used for type 1 segmentation. The type 2 is the discrimination of adjacent vowels. The vowel transition cases can be characterized by spectrogram. Given phonetic transcription and transition pattern spectrogram, the speech signal, having consecutive vowels, are automatically segmented by the template matching. The type 3 is the discrimination of vowel and voiced-consonants. The smoothed short-time RMS energy of Wavelet low pass component and SVF in cepstral coefficients are adopted for type 3 segmentation. The experiment is performed for 342 words utterance set. The speech data are gathered from 6 speakers. The result shows the validity of the method.

본 논문에서는 발음표기가 주어진 상황에서 음성 신호의 자동 음소 분할에 관한 것이며 음소의 경계를 음소 음향학적인 변화특성에 따라 3가지 형태로 분류하여 각각에 적합한 분할 알고리즘을 개발하였다. 형태 1은 묵음·유성음·무성음간의 분할이며 히스토그램분석으로 구한 문턱 값으로 초기 분할 후, 웨이블릿 계수의 SVF (Spectral Variation Function)를 이용하여 분할하였다. 형태 2는 연속적인 모음의 분할이며 각 모음변화특성을 템플릿으로 구성하여 분할에 활용하였다. 형태 3은 모음과 유성자음 혹은 유성화 자음의 분할이며 특성주파수대역의 진폭변화를 이용하여 후보구간을 정한 후, 캡스트럼 계수의 SVF를 이용하여 최종적인 분할을 수행하였다. 본 실험에서는 분할 성능을 테스트하기 위하여 한국어 PBWSpeech DB에서 342개의 단어를 자동으로 분할한 후, 수작업으로 분할한 결과와 비교하였다. 전체적인 자동 분할 성능은 20 msec내에서 81.5%의 분할성능을 보였다.

Keywords

References

Speech Communication v.12 no.4 Automatic segmentation and labeling of speech based on Hidden Markov Models F. Brugnara;D. Falavigna;M. Omologo
Wavelet Applications, Proc. SPIE 2242 v.2 Applying wavelet analysis to speech segmentation and classification B. T. Tan;R. Lang;H. Schroder;A. Spray;P. Dermody
Fundamentals of speech Recognition L. R. Rabiner;B. H. Juang
Speech Analysis T. Robinson
음성 신호처리 기술 v.1 한국어의 조음 및 음향 음성화 지민제
말소리 no.35;36 자동 음성분할 및 레이블링 시스템의 성능향상 홍성태;김제우;김형순

The Journal of the Acoustical Society of Korea (한국음향학회지)

Automatic Phonetic Segmentation of Korean Speech Signal Using Phonetic-acoustic Transition Information

음소 음향학적 변화 정보를 이용한 한국어 음성신호의 자동 음소 분할

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)