• 제목/요약/키워드: phoneme

검색결과 457건 처리시간 0.026초

음소길이를 고려한 3-State Hidden Markov Model 에 의한 한국어 음소인식 (Korean Phoneme Recognition Using duration-dependent 3-State Hidden Markov Model)

  • 유현창;이희정;박병철
    • 한국음향학회지
    • /
    • 제8권1호
    • /
    • pp.81-87
    • /
    • 1989
  • 본 논문은 Markov 모델에 의한 효과적인 한국어 음소모델 작성방식과 인식에 대하여 기술한다. hidden Markov 모델은 음성신호 고유의 비정상성을 효과적으로 모델화할 수 있다. 본 논문에서는 음소의 일련의 변화하는 특성, 즉 천이-안정-천이의 변화를 나타내기 위하여 3상태 음소모델을 제안한다. 또한 음소길이가 인식성능에 영향을 미치는 중요한 요소임을 밝히고 길이를 고려한 3상태 hidden Markov 모델을 사용하여 인식률을 개선시킬 수 있음을 보였다.

  • PDF

악리론으로 본 정음창제와 정음소 분절 알고리즘 (Ortho-phonic Alphabet Creation by the Musical Theory and its Segmental Algorithm)

  • 진용옥;안정근
    • 음성과학
    • /
    • 제8권2호
    • /
    • pp.49-59
    • /
    • 2001
  • The phoneme segmentation is a very difficult problem in speech sound processing because it has found out segmental algorithm in many kinds of allophone and coarticulation's trees. Thus system configuration for the speech recognition and voice retrieval processing has a complex system structure. To solve it, we discuss a possibility of new segmental algorithm, which is called the minus a thirds one or plus in tripartitioning(삼분손익) of twelve temporament(12 율려), first proposed by Prof. T. S. Han. It is close to oriental and western musical theory. He also has suggested a 3 consonant and 3 vowel phonemes in Hunminjungum(훈민정음) invented by the King Sejong in the 15th century. In this paper, we suggest to newly name it as ortho-phonic phoneme(OPP/정음소), which carries the meaning of 'the absoluteness and independency'. OPP also is acceptable to any other languages, for example IPA. Lastly we know that this algorithm is constantly applicable to the global language and is very useful to construct a voice recognition and retrieval structuring engineering.

  • PDF

Effects of Inter-phoneme Probabilities on the Acceptability Judgment of Korean CVC Nonwords

  • Lee, Yong-Eun
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.41-52
    • /
    • 2007
  • Recent experimental studies have shown that language-users' knowledge of the statistical characteristic of their native language plays a key role in their task performance. One specific instance of this that the current study focuses on is the effect of phonotactic probabilities on speakers' wordlikeness judgment of nonwords. In this paper, I explore the question of whether the judgment of Korean speaking subjects as to the wordlikeness of Korean nonsense words is influenced by the degree of association between two-phoneme sequences in Korean. The current results suggest that the objective measure of correlations (expressed by $r_{\phi}$ values) between an onset consonant and a vowel inside Korean syllables play an important role in Korean speakers' nonword processing. The current results additionally indicate an effect of the correlations of two-phoneme sequences including vowels and coda consonants on nonword processing. Implications of these findings for Korean speakers' learning the correlations between adjacent segments inside the syllable are discussed.

  • PDF

음소기반 인식 네트워크에서의 단어 검출률을 이용한 문장거부 (Sentence Rejection using Word Spotting Ratio in the Phoneme-based Recognition Network)

  • 김형태;하진영
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 춘계 학술대회 발표논문집
    • /
    • pp.99-102
    • /
    • 2005
  • Research efforts have been made for out-of-vocabulary word rejection to improve the confidence of speech recognition systems. However, little attention has been paid to non-recognition sentence rejection. According to the appearance of pronunciation correction systems using speech recognition technology, it is needed to reject non-recognition sentences to provide users with more accurate and robust results. In this paper, we introduce standard phoneme based sentence rejection system with no need of special filler models. Instead we used word spotting ratio to determine whether input sentences would be accepted or rejected. Experimental results show that we can achieve comparable performance using only standard phoneme based recognition network in terms of the average of FRR and FAR.

  • PDF

제한된 한국어 연속음성에 나타난 음소인식에 관한 연구 (A Study on the Phoneme Recognition in the Restricted Continuously Spoken Korean)

  • 심성룡;김선일;이행세
    • 전자공학회논문지B
    • /
    • 제32B권12호
    • /
    • pp.1635-1643
    • /
    • 1995
  • This paper proposes an algorithm for machine recognition of phonemes in continuously spoken Korean. The proposed algorithm is a static strategy neural network. The algorithm uses, at the stage of training neurons, features such as the rate of zero crossing, short-term energy, and either PARCOR or auditory-like perceptual linear prediction(PLP) but not both, covering a time of 171ms long. Numerical results show that the algorithm with PLP achieves approximately the frame-based phoneme recognition rate of 99% for small vocabulary recognition experiments. Based on this it is concluded that the proposed algorithm with PLP analysis is effective in phoneme recognition.

  • PDF

Selective Adaptation of Speaker Characteristics within a Subcluster Neural Network

  • Haskey, S.J.;Datta, S.
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 10월 학술대회지
    • /
    • pp.464-467
    • /
    • 1996
  • This paper aims to exploit inter/intra-speaker phoneme sub-class variations as criteria for adaptation in a phoneme recognition system based on a novel neural network architecture. Using a subcluster neural network design based on the One-Class-in-One-Network (OCON) feed forward subnets, similar to those proposed by Kung (2) and Jou (1), joined by a common front-end layer. the idea is to adapt only the neurons within the common front-end layer of the network. Consequently resulting in an adaptation which can be concentrated primarily on the speakers vocal characteristics. Since the adaptation occurs in an area common to all classes, convergence on a single class will improve the recognition of the remaining classes in the network. Results show that adaptation towards a phoneme, in the vowel sub-class, for speakers MDABO and MWBTO Improve the recognition of remaining vowel sub-class phonemes from the same speaker

  • PDF

음소경계 정보를 이용한 한국어 숫자음 인식에 관한 연구 (A Study on Korean Digit Recognition by Using Phoneme Boundary Information)

  • 최관묵;임동철;이행세
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 2001년도 추계학술발표대회 논문집 제20권 2호
    • /
    • pp.117-120
    • /
    • 2001
  • Recognition rate of Korean digit is lower than that of other words because it is composed of similar phonemes. In this paper, a new method is proposed for the improvement of recognition rate by using the phoneme boundary information. In addition, the proposed method rarely increase cost because phoneme boundary is found by using simple method. We experimented with speech data of one man and then obtained results of enhanced speech recognition rate.

  • PDF

신경회로망 이용한 한국어 음소 인식 (Korean Phoneme Recognition Using Neural Networks)

  • 김동국;정차균;정홍
    • 대한전기학회논문지
    • /
    • 제40권4호
    • /
    • pp.360-373
    • /
    • 1991
  • Since 70's, efficient speech recognition methods such as HMM or DTW have been introduced primarily for speaker dependent isolated words. These methods however have confronted with difficulties in recognizing continuous speech. Since early 80's, there has been a growing awareness that neural networks might be more appropriate for English and Japanese phoneme recognition using neural networks. Dealing with only a part of vowel or consonant set, Korean phoneme recognition still remains on the elementary level. In this light, we develop a system based on neural networks which can recognize major Korean phonemes. Through experiments using two neural networks, SOFM and TDNN, we obtained remarkable results. Especially in the case of using TDNN, the recognition rate was estimated about 93.78% for training data and 89.83% for test data.

음성인식 시스템에서의 음소분할기의 성능 (Performance of the Phoneme Segmenter in Speech Recognition System)

  • 이광석
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국해양정보통신학회 2009년도 추계학술대회
    • /
    • pp.705-708
    • /
    • 2009
  • 본 연구는 자연음성의 인식을 위하여 신경회로망을 기초로 한 음소 분할기에 대하여 기술하였다. 자연음성의 인식을 위한 음소 분할기의 입력으로는 16차 멜 스케일의 FFT, 정규화된 프레임 에너지, 0~3[KHz] 주파수 대역 및 그 이상의 대역에서의 에너지 비를 사용하였다. 모든 특징들은 두개의 연속적인 10[msec] 프레임의 차이며, 본 연구에 사용한 음소분할기는 하나의 72입력을 가지는 은닉층 퍼셉트론, 20은닉노드 및 하나의 출력노드로 구성하여 사용하였다. 자연음성에 대한 음소분할의 정확도는 7.8%삽입을 가지는 78%를 얻을 수 있었다.

  • PDF

한글 음절 분류를 통한 입 모양 궤적 생성 (Mouth Shape Trajectory Generation Using Hangul Phoneme Analysis)

  • 박유신;김종수;김태용;최종수
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2003년도 신호처리소사이어티 추계학술대회 논문집
    • /
    • pp.53-56
    • /
    • 2003
  • In this paper, we propose a new method which generates the trajectory of the mouth shape for the characters by the user inputs. It is based on the character at a basis syllable and can be suitable to the mouth shape generation. In this paper, we understand the principle of the Korean language creation and find the similarity for the form of the mouth shape and select it as a basic syllable. We also consider the articulation of this phoneme for it and create a new mouth shape trajectory and apply at face of an 3D avatar.

  • PDF