• Title/Summary/Keyword: Pronunciation modeling

Search Result 25, Processing Time 0.03 seconds

Chinese Pronunciation Correction System for Korean learners (한국인을 위한 중국어 발음 교정 시스템)

  • Kim, Hyo-Sook;Kim, Sun-Ju;Kang, Hyo-Won;Kim, Mu-Jung;Ha, Jin-Young
    • Proceedings of the KSPS conference
    • /
    • 2005.04a
    • /
    • pp.45-48
    • /
    • 2005
  • This study is about constructing L2 pronunciation correction system for L1 speakers using speech technology. Chinese pronunciation system consists of initials, finals and tones. Initials/finals are in segmental level and tones are in suprasegmental level. So different method could be used assessing Korean users' Chinese. The recognition rate of initials is 81.9% and that of finals is 68.7% in the standard acoustic model. Differ from native speech recognition, nonnative speech recognition could be promoted by additional modeling using L2 speakers' speech. As a first step for the those task we analysed nonnative speech and then set a strategy for modeling Korean speakers'.

  • PDF

Statistical Analysis of Korean Phonological Rules Using a Automatic Phonetic Transcription (발음열 자동 변환을 이용한 한국어 음운 변화 규칙의 통계적 분석)

  • Lee Kyong-Nim;Chung Minhwa
    • Proceedings of the KSPS conference
    • /
    • 2002.11a
    • /
    • pp.81-85
    • /
    • 2002
  • We present a statistical analysis of Korean phonological variations using automatic generation of phonetic transcription. We have constructed the automatic generation system of Korean pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived from knowledge-based morphophonological analysis and government standard pronunciation rules. This system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS(Phonetic Balanced Sentence) Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonants. These statistics can be used for improving the performance of speech recognition systems.

  • PDF

A Study on Phonetic Value - Transcription Look-Up Table Generation for Postprocessing of Voice Recognition (음성인식 후처리를 위한 음가-표기 변환표 생성에 관한 연구)

  • 김경징;최영규;이상범
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.5
    • /
    • pp.585-594
    • /
    • 2002
  • This paper, describes about creation and implementation of phonetic value- transcription conversion table for postprocessing of the voice recognition. Transcription set generator, which produces transcription set that is pronounced as recognized phonetic value, is designed and implemented to postprocess for the voice recognition system which recognizes syllable unit phonetic value Phonetic value-transcription conversion table is produced with transcription-phonetic value conversion table produced by modeling standard pronunciation on petrinet. To show that phonetic value-transcription conversion table produces correct transcription set, transcription set generator is designed and implemented. This paper proves that correct transcription set is produced, which is including pre-vocalization transcription as a result of experimenting standard pronunciation examples and the words randomly sampled from pronunciation dictionary.

  • PDF

Development of English Speech Recognizer for Pronunciation Evaluation (발성 평가를 위한 영어 음성인식기의 개발)

  • Park Jeon Gue;Lee June-Jo;Kim Young-Chang;Hur Yongsoo;Rhee Seok-Chae;Lee Jong-Hyun
    • Proceedings of the KSPS conference
    • /
    • 2003.10a
    • /
    • pp.37-40
    • /
    • 2003
  • This paper presents the preliminary result of the automatic pronunciation scoring for non-native English speakers, and shows the developmental process for an English speech recognizer for the educational and evaluational purposes. The proposed speech recognizer, featuring two refined acoustic model sets, implements the noise-robust data compensation, phonetic alignment, highly reliable rejection, key-word and phrase detection, easy-to-use language modeling toolkit, etc., The developed speech recognizer achieves 0.725 as the average correlation between the human raters and the machine scores, based on the speech database YOUTH for training and K-SEC for test.

  • PDF

Performance Evaluation of English word Pronunciation Correction system (한국인을 위한 영어 발음 교정 시스템에 대한 성능 평가)

  • Kim Mujung;Kim Hyosook;Kim Byunggi
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.71-74
    • /
    • 2003
  • In this paper, we present some of experimental results developed in computer-based English Pronunciation Correction System for Korean speakers. The aim of the system is to detect incorrectly pronounced phonemes in spoken words and to give correction comment to users. Speech data were collected from 254 native speakers and 411 Koreans, then used for phoneme modeling and test. We built two types of acoustic phoneme models: native speaker model and Korean speaker model. We also built langugage models to reflect Koreans' commonly occurred mispronunications. The detection rate was over 90% in insertion/deletion/replacement of phonemes, but we got under 75% detection rate in diphthong split and accents.

  • PDF

Palatal obturator restoration of a cleft palate patient with velopharyngeal insufficiency: a clinical report (구개인두 기능부전을 갖는 구개열 환자에서 폐쇄장치를 이용한 보철 치료 증례)

  • Heo, Yu-Ri;Kim, Jong-Wook;Lee, Gyeong-Je;Chung, Chae-Heon
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.51 no.4
    • /
    • pp.353-360
    • /
    • 2013
  • Cleft lip and palate is congenital deformity in oral and maxillofacial area. Normal soft palate has velopharyngeal closure action by connecting oral cavity and nasal cavity at rest and moving upward at swallowing and specific pronunciation. Cleft palate patients with velopharyngeal insufficiency have difficulty in mastication, swallowing and pronunciation because velopharyngeal closure is incomplete. At this time, a prosthetic device used to cover palate defects is called a palatal obturator. A palatal obturator separates oral cavity and nasal cavity and recovers pronunciation, mastication, swallowing and esthetic function. The purpose of this case study is to report the results because it reaches a satisfactory result in functional and esthetic aspects through functional impression procedures using modeling compound and tissue conditioner for restoration of a cleft palate patient with velopharyngeal insufficiency.

Stochastic Pronunciation Lexicon Modeling for Large Vocabulary Continous Speech Recognition (확률 발음사전을 이용한 대어휘 연속음성인식)

  • Yun, Seong-Jin;Choi, Hwan-Jin;Oh, Yung-Hwan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.49-57
    • /
    • 1997
  • In this paper, we propose the stochastic pronunciation lexicon model for large vocabulary continuous speech recognition system. We can regard stochastic lexicon as HMM. This HMM is a stochastic finite state automata consisting of a Markov chain of subword states and each subword state in the baseform has a probability distribution of subword units. In this method, an acoustic representation of a word can be derived automatically from sample sentence utterances and subword unit models. Additionally, the stochastic lexicon is further optimized to the subword model and recognizer. From the experimental result on 3000 word continuous speech recognition, the proposed method reduces word error rate by 23.6% and sentence error rate by 10% compare to methods based on standard phonetic representations of words.

  • PDF

A Study on Creation of Hangeu-Romanization Conversion Table Using Petri-Nets (페트리넷을 이용한 한글-로마자 표기 변환표 생성에 관한 연구)

  • Kim, Kyung-Jing;Choi, Young-Kyoo;Rhee, Sang-Burm
    • The KIPS Transactions:PartB
    • /
    • v.9B no.6
    • /
    • pp.827-834
    • /
    • 2002
  • In this paper, we proposed the formation of Korean-Roman alphabet notation conversion table for the generation of Korean-Roman alphabet notation that also meets revised Roman alphabet notation. Introduced a mathematical analyzing method of the natural language which used a petrinet model so that a base of Roman alphabet notation analyzed standard pronunciation and Roman alphabet notation to work mathematically. It display the practical example through a petrinet modeling of a plan and Roman alphabet notation to create a Korean Roman alphabet notation conversion table with the method of the analysis that used a petrinet model, and present a mathematical modeling plan and application method of Korean. We developed application program based on window in order to verify a created Korean-Roman alphabet notation conversion table, and compared the result of an application program with Roman alphabet notation of an Roman alphabet notation example dictionary.

Statistical Analysis of Korean Phonological Variations Using a Grapheme-to-phoneme System (발음열 자동 생성기를 이용한 한국어 음운 변화 현상의 통계적 분석)

  • 이경님;정민화
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.7
    • /
    • pp.656-664
    • /
    • 2002
  • We present a statistical analysis of Korean phonological variations using a Grapheme-to-Phoneme (GPT) system. The GTP system used for experiments generates pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived form morphophonological analysis and government standard pronunciation rules. The GTP system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonant's, These statistics can be used for improving the performance of speech recognition systems.

Computer-Based Fluency Evaluation of English Speaking Tests for Koreans (한국인을 위한 영어 말하기 시험의 컴퓨터 기반 유창성 평가)

  • Jang, Byeong-Yong;Kwon, Oh-Wook
    • Phonetics and Speech Sciences
    • /
    • v.6 no.2
    • /
    • pp.9-20
    • /
    • 2014
  • In this paper, we propose an automatic fluency evaluation algorithm for English speaking tests. In the proposed algorithm, acoustic features are extracted from an input spoken utterance and then fluency score is computed by using support vector regression (SVR). We estimate the parameters of feature modeling and SVR using the speech signals and the corresponding scores by human raters. From the correlation analysis results, it is shown that speech rate, articulation rate, and mean length of runs are best for fluency evaluation. Experimental results show that the correlation between the human score and the SVR score is 0.87 for 3 speaking tests, which suggests the possibility of the proposed algorithm as a secondary fluency evaluation tool.