• 제목/요약/키워드: phonemes

검색결과 226건 처리시간 0.018초

퍼지추론을 이용한 한국어 자음분류에 관한 연구 (A Study on the Consonant Classification Using Fuzzy Inference)

  • 박경식
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1992년도 학술논문발표회 논문집 제11권 1호
    • /
    • pp.71-75
    • /
    • 1992
  • This paper proposes algorithm in order to classify Korean consonant phonemes same as polosives, fricatives affricates into la sounds, glottalized sounds, aspirated sounds. This three kinds of sounds are one of distinctive characters of the Korean language which don't eist in language same as English. This is thesis on classfication of 14 Korean consonants(k, t, p, s, c, k', t', p', s', c', kh, ph, ch) as a previous stage for Korean phone recognition. As feature sets for classification, LPC cepstral analysis. The eperiments are two stages. First, using short-time speech signal analysis and Mahalanobis distance, consonant segments are detected from original speech signal, then the consonants are classified by fuzzy inference. As the results of computer simulations, the classification rate of the speech data was come to 93.75%.

  • PDF

청각단어 재인에서 나타난 한국어 단어 길이 효과 (The Korean Word Length Effect on AudWord Recognition)

  • 최원일;남기춘
    • 대한음성학회지:말소리
    • /
    • 제44호
    • /
    • pp.33-46
    • /
    • 2002
  • This study was conducted to examine the effect of word length on auditory word recognition. Word length can be defined by several sublexical units, such as letters, phonemes, syllables, etc. To find out which sublexical units are influential in auditory word recognition, the auditory lexical decision task was used. In Experiment 1, we examined the partial correlation between the speed of reaction time and the number of sublexical units, and in Experiment 2, we executed ANOVA to find out which sublexical length variable was an influential unit. Through these two experiment, we concluded syllable length was the most important variable on auditory word recognition.

  • PDF

단순 조음장애 환자군에 대한 통계적 연구 -배경정보와 조음 오류 양상을 중심으로- (The Statistical Study on the Patients with Functional Articulation Disorders - Centering on the Background Information and Phonological Processes of Errors -)

  • 표화영
    • 대한음성학회지:말소리
    • /
    • 제39호
    • /
    • pp.53-71
    • /
    • 2000
  • With the 130 patients who were diagnosed as functional articulation disorders with no physical problems, a statistical study was performed to investigate their background information and phonological processes of errors. The results are as follows: (1) Males showed higher prevalence than females, and 5-year-old-patients showed the highest in age. (2) Most patients showed errors of 2~5 phonemes (3) The most frequent errors were found in plosives and alveolar sounds, and the most frequent phonological process of errors in the aspects of manner and place of articulation were stop-assimilations and alveolar assimilations, respectively.

  • PDF

한국어 연속음성인식 시스템 구현을 위한 형태소 단위의 발음 변화 모델링 (Modeling Cross-morpheme Pronunciation Variations for Korean Large Vocabulary Continuous Speech Recognition)

  • 정민화;이경님
    • 대한음성학회지:말소리
    • /
    • 제49호
    • /
    • pp.107-121
    • /
    • 2004
  • In this paper, we describe a cross-morpheme pronunciation variation model which is especially useful for constructing morpheme-based pronunciation lexicon to improve the performance of a Korean LVCSR. There are a lot of pronunciation variations occurring at morpheme boundaries in continuous speech. Since phonemic context together with morphological category and morpheme boundary information affect Korean pronunciation variations, we have distinguished phonological rules that can be applied to phonemes in within-morpheme and cross-morpheme. The results of 33K-morpheme Korean CSR experiments show that an absolute reduction of 1.45% in WER from the baseline performance of 18.42% WER was achieved by modeling proposed pronunciation variations with a possible multiple context-dependent pronunciation lexicon.

  • PDF

음성파형의 비대칭율을 이용한 음소의 전이구간 검출 (On Detecting the Transition Regions of Phonemes by Using the Asymmetrical Rate of Speech Waveforms)

  • 배명진;이을재;안수길
    • 한국음향학회지
    • /
    • 제9권4호
    • /
    • pp.55-65
    • /
    • 1990
  • 연속음 인식을 위해서는 음성신호의 음성학적 경계를 결정짓는 분할과정이 필요하다. 본 논문에서는 음성신호의 전이구간을 결정하기 위한 파라미터로 한 프레임 내의 비대칭율을 제안하였다. 제안된 그 프레임에서 음성진폭의 변화율을 대별하며, 인근 프레임의 비대칭율과 비교하면 현재의 프레임이 정상상태 혹은 전이영역에 있는지를 구별할 수 있게 해 준다.

  • PDF

DHMM과 어휘해석을 이용한 Voice dialing 시스템 (The Voice Dialing System Using Dynamic Hidden Markov Models and Lexical Analysis)

  • 최성호;이강성;김순협
    • 전자공학회논문지B
    • /
    • 제28B권7호
    • /
    • pp.548-556
    • /
    • 1991
  • In this paper, Korean spoken continuous digits are ercognized using DHMM(Dynamic Hidden Markov Model) and lexical analysis to provide the base of developing voice dialing system. After segmentation by phoneme unit, it is recognized. This system can be divided into the segmentation section, the design of standard speech section, the recognition section, and the lexical analysis section. In the segmentation section, it is segmented using the ZCR, O order LPC cepstrum, and Ai, parameter of voice speech dectaction, which is changed according to time. In the standard speech design section, 19 phonemes or syllables are trained by DHMM and designed as a standard speech. In the recognition section, phomeme stream are recognized by the Viterbi algorithm.In the lexical decoder section, finally recognized continuous digits are outputed. This experiment shiwed the recognition rate of 85.1% using data spoken 7 times of 21 classes of 7 continuous digits which are combinated all of the occurence, spoken by 10 man.

  • PDF

악리론으로 본 정음창제와 정음소 분절 알고리즘 (Ortho-phonic Alphabet Creation by the Musical Theory and its Segmental Algorithm)

  • 진용옥;안정근
    • 음성과학
    • /
    • 제8권2호
    • /
    • pp.49-59
    • /
    • 2001
  • The phoneme segmentation is a very difficult problem in speech sound processing because it has found out segmental algorithm in many kinds of allophone and coarticulation's trees. Thus system configuration for the speech recognition and voice retrieval processing has a complex system structure. To solve it, we discuss a possibility of new segmental algorithm, which is called the minus a thirds one or plus in tripartitioning(삼분손익) of twelve temporament(12 율려), first proposed by Prof. T. S. Han. It is close to oriental and western musical theory. He also has suggested a 3 consonant and 3 vowel phonemes in Hunminjungum(훈민정음) invented by the King Sejong in the 15th century. In this paper, we suggest to newly name it as ortho-phonic phoneme(OPP/정음소), which carries the meaning of 'the absoluteness and independency'. OPP also is acceptable to any other languages, for example IPA. Lastly we know that this algorithm is constantly applicable to the global language and is very useful to construct a voice recognition and retrieval structuring engineering.

  • PDF

한국어 마찰음과 파찰음의 음향학적 및 공기역학적 특성에 관한 연구 (An Acoustic and Aerodynamic Study of Korean Fricatives and Affricates)

  • 표화영;이주환;최성희;심현섭;최홍식
    • 음성과학
    • /
    • 제6권
    • /
    • pp.145-161
    • /
    • 1999
  • 21 normal Korean native speakers participated as subjects to investigate the acoustic and aerodynamic study of Korean fricatives and affricates and to make good use of the results for the patients with articulation problems. Their productions of [sa], [s'a], [ca], [$c^{h}a$], [c'a], [asa], [as'a], [aca], [$ac^{h}a$], and [ac'a] were analyzed with CSL and AP II instruments. The results are as followings: (1) Fricatives showed higher frequency in minimum and maximum frequency and longer duration than affricates. (2) Fricatives showed higher peak flow rate and longer rise time than affricates. (3) When we compared the different phonemes with each other, their differences were usually statistically significant, but when we compared CV and VCV syllables, they did not show significant difference, even VCV's showed higher and longer values than CV syllables. (4) Normaly, lax fricatives and affricates showed lower frequency and higher peak flow rate, shorter frication duration, and longer rise time.

  • PDF

단순 조음장애 환자군에 대한 통계적 연구 - 배경정보와 조음 오류 양상을 중심으로 - (The Statistical Study on the Patients with Functional Articulation Disorders - Centering on the Background Information and Phonological Processes of Errors -)

  • 표화영
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2000년도 3월 학술대회지
    • /
    • pp.141-155
    • /
    • 2000
  • With the 130 patients who were diagnosed as functional articulation disorders with no physical problems, statistical study was performed to investigate their background informations and phonological processes of errors. The results are as followings : (1) Males showed higher prevalence than females, and 5-year-patients showed the highest in age. (2) Most patients showed errol.s of 2 - 5 phonemes (3) The most frequent errors were found in plosives and alveoalrs, and the most frequent phonological process of errors in the aspects of manner and place of articulation were stop-assimilations and alveolar assimilations, respectively.

  • PDF

음소인식 오류에 강인한 N-gram 기반 음성 문서 검색 (N-gram Based Robust Spoken Document Retrievals for Phoneme Recognition Errors)

  • 이수장;박경미;오영환
    • 대한음성학회지:말소리
    • /
    • 제67호
    • /
    • pp.149-166
    • /
    • 2008
  • In spoken document retrievals (SDR), subword (typically phonemes) indexing term is used to avoid the out-of-vocabulary (OOV) problem. It makes the indexing and retrieval process independent from any vocabulary. It also requires a small corpus to train the acoustic model. However, subword indexing term approach has a major drawback. It shows higher word error rates than the large vocabulary continuous speech recognition (LVCSR) system. In this paper, we propose an probabilistic slot detection and n-gram based string matching method for phone based spoken document retrievals to overcome high error rates of phone recognizer. Experimental results have shown 9.25% relative improvement in the mean average precision (mAP) with 1.7 times speed up in comparison with the baseline system.

  • PDF