• Title/Summary/Keyword: phoneme

Search Result 458, Processing Time 0.021 seconds

A Study on the Analysis and Recognition of Korean Speech Signal using the Phoneme (음소를 이용한 한국어 음성 신호의 분석과 인식에 관한 연구)

  • Kim Y. I.;Hwang Y. S.;Youn D. H.;Cha I. W.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.8 no.5
    • /
    • pp.70-77
    • /
    • 1989
  • In this paper, Korean language recognition using the phoneme is studied. The experiment is carried out by dividing 545 isolated words into phonemes. Using linear prediction coefficients the recognition rate of consonants, vowels, and end-consonants are $87.3(\%), 91.0(\%), 91.7(\%)$, respectively. Recognition rate of isolated words combined with the phonemes is $71.4(\%)$. Itakura-saito distortion measure is used to phoneme segmentation and phoneme recognition.

  • PDF

Large Scale Voice Dialling using Speaker Adaptation (화자 적응을 이용한 대용량 음성 다이얼링)

  • Kim, Weon-Goo
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.16 no.4
    • /
    • pp.335-338
    • /
    • 2010
  • A new method that improves the performance of large scale voice dialling system is presented using speaker adaptation. Since SI (Speaker Independent) based speech recognition system with phoneme HMM uses only the phoneme string of the input sentence, the storage space could be reduced greatly. However, the performance of the system is worse than that of the speaker dependent system due to the mismatch between the input utterance and the SI models. A new method that estimates the phonetic string and adaptation vectors iteratively is presented to reduce the mismatch between the training utterances and a set of SI models using speaker adaptation techniques. For speaker adaptation the stochastic matching methods are used to estimate the adaptation vectors. The experiments performed over actual telephone line shows that proposed method shows better performance as compared to the conventional method. with the SI phonetic recognizer.

Development of a Phoneme and Tone Labeling Program (음소 및 성조 레이블링 프로그램 개발)

  • Lee, Yun-Kyung;Kwak, Chul;Kwon, Oh-Wook
    • Proceedings of the KIEE Conference
    • /
    • 2007.10a
    • /
    • pp.435-436
    • /
    • 2007
  • Although previous speech analysis programs usually provide speech analysis and phoneme labeling functionalities, they require much time in manual labeling and support only English alphabets. To solve these problems, we develop a new Windows-based program with an improved phoneme and tone labeling method as well as the conventional speech analysis functionalities. The developed program has the unique feature in semi-automatic phoneme and tone labeling based on hidden Markov models.

  • PDF

The Usage of Phoneme Duration Information for Rejecting Garbage Sentences (소음문장 제거를 위한 음소지속시간 사용)

  • Koo Myoung-Wan;Kim Ho-Kyoung;Park Sung-Joon;Kim Jae-In
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.219-222
    • /
    • 2003
  • In this paper, we study the usage of phoneme duration information for rejection garbage sentence. First, we build a phoneme duration modeling in a speech recognition system based on dicicion tree state tying, We assume that phone duration has a Gamma distribution. Next, we build a verification module in which word-level confidence measure is used. Finally, we make a comparative study on phoneme duration with speech DB obtained from the live system. This DB consistes of OOT(out-of-task) and ING(in-grammar) utterences. the usage of phone duration information yields that OOT recognition rate is improved by 46% and that another 8.4% error rate is reduced when combined with utterence verification module.

  • PDF

Identification of Korea Traditional Color Harmony (비디오에서 프로젝션을 이용한 문자 인식)

  • Baek, Jeong-Uk;Shin, Seong-Yoon;Rhee, Yang-Won
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.10a
    • /
    • pp.196-197
    • /
    • 2009
  • In Video, key frame generated from the scene change detection is to perform character recognition through the projections. The separation between the text are separated by a vertical projection. Phoneme is separated Cho-sung, Jung-sung, and Jong-sung and is divided 6 types. Phoneme pattern is separated to suitable 6 types through the horizontal projection. Phoneme are separated horizontal, vertical, diagonal, reverse-diagonal direction. Phoneme is recognized using the 4-direction projection and location information.

  • PDF

A longitudinal study on the development of English phonological awareness in preschool children (어린이집 유아의 영어 음운 인식 발달 종단 연구)

  • Chung, Hyunsong
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.53-66
    • /
    • 2018
  • This study investigated the development of English phonological awareness in preschool children based on a longitudinal study. It carried out a phonological matching task, mispronunciation task, articulation test, explicit phoneme awareness task, rhyme matching task, and initial-phoneme matching task for three-, four- and five-year-old children. A letter knowledge test was also added to the tests for the 5-year-old children. The results revealed that the development of phonological awareness follows a progression of syllable, then onset and rhyme, then phoneme. It was also revealed that language skills such as vocabulary, detection of mispronunciations, and articulation were partially related to the development of phoneme awareness. Finally, we also found that letter knowledge partially affected the children's development of phonological awareness.

A Study of the Speaking-Centered Chinese Pronunciation Teaching Method for Basic Chinese Learners. (초급 중국어 학습자를 위한 발음교육 개선방안 - 말하기 중심 발음 교수법 -)

  • Lim, Seung Kyu
    • Cross-Cultural Studies
    • /
    • v.35
    • /
    • pp.339-368
    • /
    • 2014
  • In Teaching Chinese as a Foreign Language, phoneme-based pronunciation teaching such as tone, consonants, vowels is the most common teaching methods. Based on main character of Chinese grammar: 'lack of morphological change' in a narrow sense, was proposed by Lv Shuxiang and Zhu Dexi, I designed 'Communicative oriented Chinese pronunciation teaching method'. This teaching method is composed of seven elements: one kind is the 'structural elements': phoneme, word, phrase, sentence; another kind is the 'functional elements': listening, speaking and translation. This pronunciation teaching method has four kinds of practice methods: 1) phoneme learning method; 2) word based pronunciation practice; 3) phrase based pronunciation practice; 4) sentence based pronunciation practice. When the teachers use these practice methods, they can use the dialogue and Korean-Chinese translation. In particular, when the teachers use 'phoneme learning method', they must use Korean and Chinese phonetic comparison results. When the teachers try to correct learner's errors, they must first consider the speech communication.

A Study on the Pitch Contour Generator with Neural Network in the Isolated Words (신경망을 이용한 고립단어에서의 피치변화곡선 발생기에 관한 연구)

  • Lim Unchun;Kwak Jingu;Chang Sokwang
    • Proceedings of the KSPS conference
    • /
    • 1996.02a
    • /
    • pp.137-155
    • /
    • 1996
  • The purpose of this paper is to generate a pitch contour which is affected by tile phonetic environment and the number of syllables in each Korean isolated word using a neural network. To do this, we analyzed a set of 513 Korean isolated words, consisting of 1-4 syllables and extracted the pitch contour and the duration of each phoneme in all the words. The total number of phonemes we analyzed is about 3800. After that we approximated the pitch contour with a 1st order polynominal by a regression analysis. We could get the slope, the initial pitch and the duration of each phoneme. We used these 3 parameters as the target pattern of the neural network and let the neural network learn the rule of the variation of the pitch and duration, which was affected by the phonetic environment of each phoneme. We used 7 consecutive phoneme strings as an input pattern for a neural network to make the network learn the effect of phonetic environment around the center phoneme. In the learning phase, we used 3545 items(463 words) as target patterns which contained the phonetic environment of front and rear 3 phonemes and the neural network showed the correctness rate of 98.43%, 98.59%, 97.7% in the estimation of the duration, the slope, the initial pitch. In the recall phase, we tested the performance of tile neural network with 251 items(50 words) which weren't need as learning data and we could get the good correctness rate of 97.34%, 95.45%, 96.3% in the generation of the duration, the slope, and the initial pitch of each phoneme.

  • PDF

Cross-language Transfer of Phonological Awareness and Its Relations with Reading and Writing in Korean and English (음운인식의 언어 간 전이와 한글 및 영어의 읽기 쓰기와의 관계)

  • Kim, Sangmi;Cho, Jeung-Ryeul;Kim, Ji-Youn
    • Korean Journal of Cognitive Science
    • /
    • v.26 no.2
    • /
    • pp.125-146
    • /
    • 2015
  • This study investigated the contribution of Korean phonological awareness to English phonological awareness and the relations of phonological awareness with reading and writing in Korean Hangul and English among Korean 5th graders. With age and vocabulary knowledge statistically controlled, Korean phonological awareness was transferred to English phonological awareness. Specifically, syllable and phoneme awareness in Korean transferred to syllable awareness in English, and Korean phoneme awareness transferred to English phoneme awareness. In addition, English phoneme awareness independently explained significant variance of reading and writing in Korean and English after controlling for age and vocabulary. Syllable awareness in Korean and English explained Hangul reading and writing, respectively. The results suggest cross-language transfer of phonological awareness that is a metalinguistic skill. Phoneme awareness is important in reading and writing in English whereas both of syllable and phoneme awareness are important in literacy of Korean.

On-Line Korean Character Recognition by the Stroke Information of Korean Phoneme in Multimedia Terminal (한글 자소의 획 정보에 의한 멀티미디어 단말기에서의 온라인 한글 문자 인식)

  • Oh Juntaek;Jung Momoon;Lee Woobeom;Kim Wookhyun
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.1 no.1
    • /
    • pp.64-73
    • /
    • 2000
  • The Korean character recognition technology for user interface in multimedia terminal requires fast processing time and high recognition rate. In this paper, we propose an phoneme and character recognition technology which uses characteristic information of korean and features of input strokes, i.e, feature point, feature vector, virtual vector, position relation between strokes. And, a recognition both phoneme and character by the various writing types of users uses korean database. The Korean database has been constructed by the characteristic information of korean and phoneme models which have various stroke information. Also, we use successive processing by the position relation between strokes and backtracking processing by the modification processing of stroke numbers which composed of each phoneme. This method reduces the complex processing of phoneme separation. The proposed on-line korean character recognition system has obtained 13msec average character processing time and correct recognition rate more than $95{\%}$ In a recognition experiment, where we tested 600 characters written by 10 people among 1,200 words.

  • PDF