• 제목/요약/키워드: Phonetic segmentation

검색결과 24건 처리시간 0.018초

한국어 특성과 CRFs를 이용한 자동 띄어쓰기 시스템 (Automatic Word Spacing for Korean Using CRFs with Korean Features)

  • 이현우;차정원
    • 대한음성학회지:말소리
    • /
    • 제65호
    • /
    • pp.125-141
    • /
    • 2008
  • In this work, we propose an automatic word spacing system for Korean using conditional random fields (CRFs) with Korean features. We map a word spacing problem into a classification problem in our work. We build a basic system which uses CRFs and Eumjeol bigram. After then, we analyze the result of inner-test. We extend a basic system added by some Korean features which are Josa, Eomi and two head Eumjeols of word extracting from lexicon. From the results of experiment, we can see that the proposed method is better than previous methods. Additionally the proposed method will be able to use mobile and speech applications because of very small size of model.

  • PDF

코퍼스 기반 무제한 단어 중국어 TTS (Corpus Based Unrestricted vocabulary Mandarin TTS)

  • ;하주홍;김병창;이근배
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 10월 학술대회지
    • /
    • pp.175-179
    • /
    • 2003
  • In order to produce a high quality (intelligibility and naturalness) synthesized speech, it is very important to get an accurate grapheme-to-phoneme conversion and prosody model. In this paper, we analyzed Chinese texts using a segmentation, POS tagging and unknown word recognition. We present a grapheme-to-phoneme conversion using a dictionary-based and rule-based method. We constructed a prosody model using a probabilistic method and a decision tree-based error correction method. According to the result from the above analysis, we can successfully select and concatenate exact synthesis unit of syllables from the Chinese Synthesis DB.

  • PDF

음성인식.합성을 위한 한국어 운율단위 음운론의 계산적 연구:음운단위에 따른 경계의 발견 (A Computation Study of Prosodic Structures of Korean for Speech Recognition and Synthesis:Predicting Phonological Boundaries)

  • 이찬도
    • 한국정보처리학회논문지
    • /
    • 제4권1호
    • /
    • pp.280-287
    • /
    • 1997
  • 성공적인 음성인식·합성 시스템을 구축하기 위해서는 음운론적 지식, 특히 운율 정보의 도입이 매우 중요하다. 본 연구에서는 우선 음성인식·합성을 위한 운율음운 론의 연구동향을 개관하고, 국어의 음운단위와 경계의 설정에 관한 이론적·실험적 고찰을 정리하였으며, 음운단위에 따른 경계의 자동적 발견을 위하여, 데이터를 수집 하고 시스템을 구현하여 실험을 행하였다. 단순회귀 신경망을 이용하여, 2,200여 개 의 문장에 있는 12,000여개의 음운단어를 외부정보의 도움이 전혀 없이 훈련시킨 결 과, 70%정도의 예측률을 보였다. 본 연구에서 사용한 방법을 다른 정보와 결합하여 사용한다면, 음운경계의 발전과 그에 따른 분절화를 정확하게 행할 수 있으리라 기대 된다.

  • PDF

W-CDMA 시스템을 위한 가변율 음성코덱 설계 (Design of a variable rate speech codec for the W-CDMA system)

  • 정우성
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
    • /
    • pp.142-147
    • /
    • 1998
  • Recently, 8 kb/s CS-ACELP coder of G.729 is atandardized by ITU-T SG15 and it has been reported that the speech quality of G729 is better than or equal to that of 32kb/s ADPCM. However G.729 is the fixed rate speech coder, and it does not consider the property of voice activity in mutual conversation. If we use the voice activity, we can reduce the average bit rate in half without any degradations of the speech quality. In this paper, we propose an efficient variable rate algorithm for G.729. The variable rate algorithm consists of two main subjects, the rate determination algorithm and algorithm, we combine the energy-thresholding method, the phonetic segmentation method by integration of various feature parameters obtained through the analysis procedure, and the variable hangover period method. Through the analysis of noise features, the 1 kb/s sub rate coder is designed for coding the background noise signal. So, we design the 4 kb/s sub rate coder for the unvoiced parts. The performance of the variable rate algorithm is evaluated by the comparison of speed quality and average bit rate with G.729. Subjective quality test is also done by MOS test. Conclusively, it is verified that the proposed variable rate CS-ACELP coder produced the same speech quality as G.729, at the average bit rate of 4.4 kb/s.

  • PDF