A Segmentation Algorithm of the Connected Word Speech by Statistical Method

Cho, Jeong-Ho;Hong, Jae-Keun;Kim, Soo-Joong;

Journal of the Korean Institute of Telematics and Electronics (대한전자공학회논문지)

Volume 26 Issue 4
/
Pages.151-163
/
1989
/
1016-135X(pISSN)

The Institute of Electronics and Information Engineers (대한전자공학회)

A Segmentation Algorithm of the Connected Word Speech by Statistical Method

統計的인 方法에 依한 連結音의 音素分割 알고리듬

Cho, Jeong-Ho (Dept. of Elec. Eng., Kyungbook Nat'l Univ.) ;
Hong, Jae-Keun (Dept. of Elec. Eng., Kyungbook Nat'l Univ.) ;
Kim, Soo-Joong (Dept. of Elec. Eng., Kyungbook Nat'l Univ.)

조정호 (慶北大學校電子工學科) ;
홍재근 (慶北大學校電子工學科) ;
김수중 (慶北大學校電子工學科)

Published : 1989.04.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

A statistical approach for the segmentation of speed signals is described in this paper. The main idea of this algorithm is the use of three AR models. Two fixed models are identified at the stationary parts of the signal before and after the spectral change. Changes are detected when the distance between these two models is high. Another model is located between two fixed models and is used to estimate spectral change time. This segmentation algorithm has been tested with connected words and compared to classical methods. The results showed that it can provide more accurate locations of boundaries of segments and can reduce the amount of oversegmentation.

本論文에서는 音聲信號의 音素分割을 위한 統計的인 方法을 硏究하였다. 이 方法은 3個의 AR 모델을 使用하여, 이 中 2個의 모델은 音聲의 스펙트럼 變化前 및 變化後의 安定된 部分에서 求해지고 이들 間의 距離가 커지면 音素가 바뀐 것으로 간주된다. 다른 한 모델은 두 固定 모델 사이에 位置하며 音素間의 境界를 推定하는데 使用된다. 이 音素分割 알고리듬을 連結音에 對해 試驗해 본 結果, 從來의 方法에 비해 音素의 境界點을 좀더 正確히 찾을 수 있고, 또한 過度分割誤謬도 줄일 수 있었다.

Journal of the Korean Institute of Telematics and Electronics (대한전자공학회논문지)

A Segmentation Algorithm of the Connected Word Speech by Statistical Method

統計的인 方法에 依한 連結音의 音素分割 알고리듬

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)