• 제목/요약/키워드: 음소

검색결과 529건 처리시간 0.024초

Facial Animation Generation by Korean Text Input (한글 문자 입력에 따른 얼굴 에니메이션)

  • Kim, Tae-Eun;Park, You-Shin
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • 제4권2호
    • /
    • pp.116-122
    • /
    • 2009
  • In this paper, we propose a new method which generates the trajectory of the mouth shape for the characters by the user inputs. It is based on the character at a basis syllable and can be suitable to the mouth shape generation. In this paper, we understand the principle of the Korean language creation and find the similarity for the form of the mouth shape and select it as a basic syllable. We also consider the articulation of this phoneme for it and create a new mouth shape trajectory and apply at face of an 3D avatar.

  • PDF

Textbook vocabulary analysis for Korean phonics program of 1st and 2nd graders (한글 파닉스 교육을 위한 초등 1-2학년 교과서 어휘 자소분석)

  • Lee, Daeun;Kim, Hyeji;Shin, Gayoung;Seol, Ahyoung;Pae, Soyeong;Kim, Mibae
    • Annual Conference on Human and Language Technology
    • /
    • 한국정보과학회언어공학연구회 2016년도 제28회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.226-230
    • /
    • 2016
  • 본 연구는 초등 저학년 읽기부진아동을 위한 한글 파닉스 교육의 기반을 확립하고자 1-2학년 교과서 고빈도 어절 531개를 기반으로 자소 및 음운규칙을 분석하였다. 연구결과, 자소-음소 일치 어절을 기반으로 하였을 때 초성에서 50번 이상 나타난 자소는 /ㄱ/, /ㄹ/, /ㄴ/, /ㅅ/, /ㅎ/, /ㅈ/이다. 중성에서 50번 이상 나타난 자소는 /ㅏ/, /ㅣ/, /ㅗ/, /ㅡ/, /ㅜ/이다. 종성에서 50번 이상 나타난 자소는 /ㄹ/, /ㄴ/, /ㅇ/이다. 자소와 음소가 불일치 된 어절을 기반으로 하였을 때 가장 많이 출현하는 음운규칙은 연음화 규칙이었다. 본 연구결과를 바탕으로 교과서를 기반으로 한 한글 파닉스 교육에 유용하게 사용될 수 있을 것이다.

  • PDF

In Out-of Vocabulary Rejection Algorithm by Measure of Normalized improvement using Optimization of Gaussian Model Confidence (미등록어 거절 알고리즘에서 가우시안 모델 최적화를 이용한 신뢰도 정규화 향상)

  • Ahn, Chan-Shik;Oh, Sang-Yeob
    • Journal of the Korea Society of Computer and Information
    • /
    • 제15권12호
    • /
    • pp.125-132
    • /
    • 2010
  • In vocabulary recognition has unseen tri-phone appeared when recognition training. This system has not been created beginning estimation figure of model parameter. It's bad points could not be created that model for phoneme data. Therefore it's could not be secured accuracy of Gaussian model. To improve suggested Gaussian model to optimized method of model parameter using probability distribution. To improved of confidence that Gaussian model to optimized of probability distribution to offer by accuracy and to support searching of phoneme data. This paper suggested system performance comparison as a result of recognition improve represent 1.7% by out-of vocabulary rejection algorithm using normalization confidence.

A Study on Performance Evaluation of Hidden Markov Network Speech Recognition System (Hidden Markov Network 음성인식 시스템의 성능평가에 관한 연구)

  • 오세진;김광동;노덕규;위석오;송민규;정현열
    • Journal of the Institute of Convergence Signal Processing
    • /
    • 제4권4호
    • /
    • pp.30-39
    • /
    • 2003
  • In this paper, we carried out the performance evaluation of HM-Net(Hidden Markov Network) speech recognition system for Korean speech databases. We adopted to construct acoustic models using the HM-Nets modified by HMMs(Hidden Markov Models), which are widely used as the statistical modeling methods. HM-Nets are carried out the state splitting for contextual and temporal domain by PDT-SSS(Phonetic Decision Tree-based Successive State Splitting) algorithm, which is modified the original SSS algorithm. Especially it adopted the phonetic decision tree to effectively express the context information not appear in training speech data on contextual domain state splitting. In case of temporal domain state splitting, to effectively represent information of each phoneme maintenance in the state splitting is carried out, and then the optimal model network of triphone types are constructed by in the parameter. Speech recognition was performed using the one-pass Viterbi beam search algorithm with phone-pair/word-pair grammar for phoneme/word recognition, respectively and using the multi-pass search algorithm with n-gram language models for sentence recognition. The tree-structured lexicon was used in order to decrease the number of nodes by sharing the same prefixes among words. In this paper, the performance evaluation of HM-Net speech recognition system is carried out for various recognition conditions. Through the experiments, we verified that it has very superior recognition performance compared with the previous introduced recognition system.

  • PDF

The Structure of Korean Consonants as Perceived by the Japanese (일본인이 지각하는 한국어 자음의 구조)

  • Bae, Moon-Jung;Kim, Jung-Oh
    • Korean Journal of Cognitive Science
    • /
    • 제19권2호
    • /
    • pp.163-175
    • /
    • 2008
  • Twelve Japanese students living in South Korea have been examined for their perceptual identification of an initial consonant in Korean syllables with or without a white noise. A confusion matrix was then subject to analyses of additive clustering, individual difference scaling, and probability of information transmission, the results of which were also compared to those of South Koreans. The Japanese in the present experiment confused /다/and/타/ most frequently, followed by /가/ and /카/, /자, 차, 짜/, /타/ and /따/, and so on. The results of additive clustering analysis of the Japanese significantly differed from those of the South Koreans. Individual difference scaling revealed dimensions of sonorant, aspiration and coronal. While South Koreans showed binary values on aspiration and tenseness dimensions, the Japanese did continuous values on such dimensions. An information transmission probability analysis revealed that the Japanese participants could not perceive very well such larynx features as tenseness and aspiration compared to the South Korean participants. The former group, however, perceived very well place of articulation features such as labial and coronal. The present results suggest that an approach dealing with structures of base representations is important in understanding the phonological categories of languages.

  • PDF

Improvement of Naturalness for a HMM-based Korean TTS using the prosodic boundary information (운율경계정보를 이용한 HMM기반 한국어 TTS 자연성 향상 연구)

  • Lim, Gi-Jeong;Lee, Jung-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • 제17권9호
    • /
    • pp.75-84
    • /
    • 2012
  • HMM-based Text-to-Speech systems generally utilize context dependent tri-phone units from a large corpus speech DB to enhance the synthetic speech. To downsize a large corpus speech DB, acoustically similar tri-phone units are clustered based on the decision tree using context dependent information. Context dependent information includes phoneme sequence as well as prosodic information because the naturalness of synthetic speech highly depends on the prosody such as pause, intonation pattern, and segmental duration. However, if the prosodic information was complicated, many context dependent phonemes would have no examples in the training data, and clustering would provide a smoothed feature which will generate unnatural synthetic speech. In this paper, instead of complicate prosodic information we propose a simple three prosodic boundary types and decision tree questions that use rising tone, falling tone, and monotonic tone to improve naturalness. Experimental results show that our proposed method can improve naturalness of a HMM-based Korean TTS and get high MOS in the perception test.

Knowledge based Text to Facial Sequence Image System for Interaction of Lecturer and Learner in Cyber Universities (가상대학에서 교수자와 학습자간 상호작용을 위한 지식기반형 문자-얼굴동영상 변환 시스템)

  • Kim, Hyoung-Geun;Park, Chul-Ha
    • The KIPS Transactions:PartB
    • /
    • 제15B권3호
    • /
    • pp.179-188
    • /
    • 2008
  • In this paper, knowledge based text to facial sequence image system for interaction of lecturer and learner in cyber universities is studied. The system is defined by the synthesis of facial sequence image which is synchronized the lip according to the text information based on grammatical characteristic of hangul. For the implementation of the system, the transformation method that the text information is transformed into the phoneme code, the deformation rules of mouse shape which can be changed according to the code of phonemes, and the synthesis method of facial sequence image by using deformation rules of mouse shape are proposed. In the proposed method, all syllables of hangul are represented 10 principal mouse shape and 78 compound mouse shape according to the pronunciation characteristics of the basic consonants and vowels, and the characteristics of the articulation rules, respectively. To synthesize the real time facial sequence image able to realize the PC, the 88 mouth shape stored data base are used without the synthesis of mouse shape in each frame. To verify the validity of the proposed method the various synthesis of facial sequence image transformed from the text information is accomplished, and the system that can be applied the PC is implemented using the proposed method.

A study on the duration of Korean fricatives /s, s'/ and factors that Influence their duration (한국어 마찰음 /ㅅ,ㅆ/의 지속시간에 영향을 미치는 요인에 관한 연구)

  • Song YoonGyoung
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 한국음향학회 1999년도 학술발표대회 논문집 제18권 2호
    • /
    • pp.333-336
    • /
    • 1999
  • 본 연구의 목적은 한국어 마찰음 /ㅅ, ㅆ/가 지속시간에 있어서 유의미한 차이를 가지고 있는가를 관찰하고, 나아가 지속시간에 영향을 미치는 요인에 어떠한 것이 있는가를 기술하는 데에 있다. 이러한 결과는 음성합성을 위한 기초자료로 이용될 수 있을 것이다. 분석 결과, /ㅅ/보다 /ㅆ/가 더 긴 지속시간을 가졌으며 마찰음을 선행하는 음소의 성질, 단어에서 마찰음이 가지는 음절 위치, 그리고 마찰음 앞에서 끊어읽기가 이루어졌는가의 여부가 지속시간에 영향을 미치는 요인으로 작용하였다.

  • PDF

An Experimental Field Trial of Stock Information Retrieval System Based on Speech Recognition (음성인식기술을 이용한 증권정보 안내 시스템의 실험적 실용시험)

  • 도삼주
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.241-244
    • /
    • 1994
  • 이 논문은 대어휘, 화자독립 음성인식 시스템인 KT-STOCK과 이 시스템에 대한 전화망을 통한 실험적 실용시험에 대해 기술하였다. KT-STOCK은 현재 주식시장에 상장된 712개 회사의 현재주가를 음성을 이용하여 검색할 수 있는 시스템이다. 이 시스템은 hidden markov model 기술에 기반을 둔 고립단어 인식 시스템이며 유사음소를 기본 인식단위로 사용한다. KT-STOCK은 1994년 6월 24일부터 실험적 실용시험 중에 있다. 중간 결과에 따르면 모의 실험 결과는 실제 환경에서의 시험과 차이가 있는 거승로 나타났다. 실제 환경에서 이 시스템의 인식률은 현재 61.9%이다.

  • PDF

Analysis of Speech Signals by linear prediction and It's Application (선형 예측법에 의한 음성신호의 분석과 그 응용 방안)

  • 김명규
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • 제18권4호
    • /
    • pp.27-33
    • /
    • 1981
  • In this paper, the effect of tone variation of speech signals is discussedty showing the variations of the linear prediction model spectra and the estimated vocal tract shape for Korean vowels. As an application of the analysis results a speech spenthesis scheme by combination of phonemes is also discussed based on experimental results.

  • PDF