• Title/Summary/Keyword: 한국어 단모음

Search Result 47, Processing Time 0.024 seconds

Speech Recognition Based on VQ/NN using Fuzzy (Fuzzy를 이용한 VQ/NN에 기초를 둔 음성 인식)

  • Ann, Tae-Ock
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.6
    • /
    • pp.5-11
    • /
    • 1996
  • This paper is the study for recognizing single vowels of speaker-independent, and we suppose a method of speech recognition using VQ(Vector Quantization)/NN(Neural Network). This method makes a VQ codebook, which is used for obtaining the observation sequence, and then claculates the probability value by comparing each codeword with the data, finally uses these probability values for the input value of the neural network. Korean signle vowels are selected for our recognition experiment, and ten male speakers pronounced eight single vowels ten times. We compare the performance of our method with those of fuzzy VQ/HMM and conventional VQ/NN According to the experiment result, the recognition rate by VQ/NN is 92.3%, by VQ/HMM using fuzzy is 93.8% and by VQ/NN using fuzzy is 95.7%. Therefore, it is shown that recognition rate of speech recognition by fuzzy VQ/NN is better than those of fuzzy VQ/HMM and conventional VQ/HMM because of its excellent learning ability.

  • PDF

The Recognition of Korean Single vowels by Use of the Diffusion Filter Bank as a Pre-processor (확산필터뱅크를 전처리기로 사용한 한국어 단모음인식)

  • Huh, Man-Tak;Kim, Jae-Chang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1
    • /
    • pp.81-87
    • /
    • 1997
  • In this paper, a new pre-processing method for the recognition of single vowels by use of spectrum envelope is presented. We use new extraction method of a spectrum envelope using the diffusion filter bank. By dividing analysis band of a diffusion filter bank into subbands, we decreased the number of diffusion process. And, by increasing the number of difference, we got higher selectivity. As a result of them, we reduced the total processing time, and got higher enhancement of discrimination. By getting 88.3% of average recognition rate for single vowels of natural voice through computer simulation. We confirmed it to be useful for speech recognition which use spectrum analysis of the voice signal to have many frequency components.

  • PDF

A Fundamental Phonetic Investigation of Korean Monophthongs (한국어 단모음의 음성학적 기반연구)

  • Moon, Seung-Jae
    • MALSORI
    • /
    • no.62
    • /
    • pp.1-17
    • /
    • 2007
  • The purpose of this study was to investigate and quantitatively describe the acoustic characteristics of current Korean monophthongs. Recordings were made of 33 men and 27 women producing the vowels /i, e, ${\epsilon}$, a, ${\partial}$, o, u, i/ in a carrier phrase "This character is ___." A listening test was conducted in which 19 participants judged each vowel. F1, F2, and F3 were measured from the vowels judged as intended vowels by more than 17 people from the listening test. Analysis of formant data shows some interesting results including the undeniable confirmation of the 7-vowel system in modern Korean. It turns out that quite different sounding Korean vowels and English vowels happen to have very similar formant measurements. Also the difference between "citation-form reading" vs. "natural utterance reading" is discussed.

  • PDF

A study on the automatic recognition of Korean vowel (한국어 단모음 자동 인식에 관한 연구)

  • 안동순
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1984.12a
    • /
    • pp.57-61
    • /
    • 1984
  • In this study, the system is proposed which can be used for recognition of Koean single vowles "ㅏ, ㅓ, ㅗ, ㅜ, ㅡ, ㅣ, ㅐ, ㅔ, ㅚ,", and automatic recognition is processed using $\mu$-computer. 3 men of not-being-studied are participated in this experiment. Using the period of vowels, one part of the steady state is selected for high speed recognition, and amplitude comparison method, LPC, PARCOR, and Formant are used for parameter of recognition. Formant is obtained by peak picking method using LPC, and then vowels are recognized by amplitude comparison method, LPC, PARCOR, and Formant. As a result, Recognition rates are 90.1% for amplitude comparison method, 93.1% for LPC, 100% for PARCOR, 88.8% for using formant.

  • PDF

An Analysis of Korean Monophthongs Produced by Korean Native Speakers and Adult Learners of Korean (한국인과 한국어 학습자의 단모음 발화)

  • Kim, Jeong-Ah;Kim, Da-Hee;Rhee, Seok-Chae
    • MALSORI
    • /
    • no.65
    • /
    • pp.13-36
    • /
    • 2008
  • This paper attempts to analyze the characteristics of Korean vowel production by 12 Korean native speakers and 36 adult learners. The analyses have been performed with investigations of F1and F2 values. Results showed that there's no significant difference between /ㅔ/ and /H/ and between /ㅗ/ and /ㅜ/ in Korean native speakers' pronunciations. The distinguishing tendencies found in the analyses of foreign learners' pronunciations are fronting and lowering of /ㅗ/ by English speakers, backing and heightening of /ㅓ/ by Japanese speakers and backing and lowering of /ㅏ/ by Chinese speakers. For the limitations of this paper, it has a meaning of a preliminary study and could be developed into further research to show the order of acquisition and L1 transference.

  • PDF

A Vowel Discrimination of Korean Monophthongs [i, e, a, o, u, ${\omega}$] Using Vocal Tract Magnetic Resonance Image and F1/F2 (성도 자기공명 영상과 음향정보(F1/F2)를 이용한 한국어 단모음 [이, 에, 아, 오, 우, 으] 판별)

  • Seong, Cheol-Jae;Park, Jong-Won;Kim, Gui-Ryong
    • MALSORI
    • /
    • no.56
    • /
    • pp.103-125
    • /
    • 2005
  • We present a new method of measuring the volume and cross-sectional area of the vocal tract from magnetic resonance images. The vocal tract was divided by the 2 constriction points on the horizontal and vertical planes. The ratios of the volumes of the segment vocal tracts to that of the entire vocal tract play a crucial role in discriminating Korean monophthongs in that vowels were successfully discriminated by the ratios. The discriminant analysis also demonstrated that the acoustic parameters F1 and F2, in addition to the segment volumes, serve as significant parameters in discriminating Korean monophthongs.

  • PDF

A Longitudinal Study of Korean Vowel Production by Chinese Learners of Korean (중국인 학습자가 발음한 한국어 단모음에 대한 종단 연구)

  • Kim, Jooyeon
    • Phonetics and Speech Sciences
    • /
    • v.5 no.2
    • /
    • pp.71-79
    • /
    • 2013
  • This study provided longitudinal examination of the Chinese learners' acquisition of the Korean vowels. Specifically the author examined whether Korean monophthongs are acquired rapidly in early stages of learning (Flege, Munro and Skelton, 1992; Munro and Derwing, 2008) or they develop rather gradually in proportion to the learners' experience (Byee, 2001; Ellis, 2006). This study collected the Korean vowel production by 23 Chinese learners for a year, and then analysed F1 and F2 of each Korean vowel. The results showed that 1) Most of the second language (L2) vowels were rapidly improved during the first six or nine months of Korean learning before reaching the constant stage; and 2) The exact acquisition trajectories varied across the seven vowels. Specifically the vowels which were acquired in the early stage of learning were /i, e, ɨ/ for F1 and /ʌ, e, o, u/ for F2. Thus this study supports the hypothesis of Flege et al. (1992) and Munro and Derwing (2008) except the fact that each vowel showed the different learning route.

A Study on Phoneme-Based PSOLA Speech Synthesis Using LSP (LSP를 이용한 음소단위 PSOLA 음성합성에 관한 연구)

  • 권혁제;조순계;김종교
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.2
    • /
    • pp.3-10
    • /
    • 1998
  • 본 논문에서는 음소단위 PSOLA 한국어 합성을 LSP line의 조절과 자모음 분석을 통해서 실시하였다. 음성합성에서 많이 사용하는 triphone, diphone, demisyllable등과 같은 합성단위들은 자연스러운 합성음을 위해 다양한 음운환경에서 수집된다. 그러나, 이런 방법 은 많은 시간과 메모리가 요구된다. 본 논문에서는 합성단위로서 자음17개, 모음 16개로 총 33개의 음소를 이용하였다. 자음은 후위모음/이/인 CV에서 segment되고, 모음은 단음절의 단모음과 이중모음을 1인의 화자로부터 합성데이터를 수집하였다. 또한, 10명의 화자가 발성 한 CV에서 각 모음에 따라 변하는 자음의 주파수를 분석하였고, CV+VC 또는 CV+CV에서 각 자음에 따라 변하는 모음의 포먼트변화를 분석하였다. 분석결과를 토대로 모음은 LSP line을 조절해서 PSOLA합성을 하고, 자음은 합성하려는 모음과 결합하였다. 그 결과 6개의 합성단어에 대한 청취율은 65%를 보였다.

  • PDF

Monophthong Analysis on a Large-scale Speech Corpus of Read-Style Korean (한국어 대용량발화말뭉치의 단모음분석)

  • Yoon, Tae-Jin;Kang, Yoonjung
    • Phonetics and Speech Sciences
    • /
    • v.6 no.3
    • /
    • pp.139-145
    • /
    • 2014
  • The paper describes methods of conducting vowel analysis from a large-scale corpus with the aids of forced alignment and optimal formant ceiling methods. 'Read Style Corpus of Standard Korean' is used for building the forced alignment system and a subset of the corpus for the processing and extraction of features for vowel analysis based on optimal formant ceiling. The results of the vowel analysis are reliable and comparable to the results obtained using traditional analytical methods. The findings indicate that the methods adopted for the analysis can be extended and be used for more fine-grained analysis without time-consuming manual labeling without losing accuracy and reliability.

Changes in Features of Korean Vowels with Age and Sex of Speakers and Their Recognition (한국어 단모음의 성별, 연령별 특징변화 및 인식)

  • 이용주;김경태;차균현
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.25 no.12
    • /
    • pp.1503-1512
    • /
    • 1988
  • As the basic analysis to solve the within-and cross-speaker variability in phoneme based speech recognition, changes in pitch and formant frequencies of 8 Korean vowels with age and sex of speaker has been investigated by analyzing a large number fo samples. Conclusions obtained are as follows: 1) Changes in pitch frequency with age and sex of speaker for children are hard to distinguish and the difference of before and after the voice change is analyzed approximately 0.2 oct. for female an 0.9 oct. for male. 2) While most of the formants of vowel considerably change with the age of speaker, the change becomes smaller as the age becomes older. 3) While there is an indirect correlation between pitch and formant with change in age, it is hard to see a direct correlation. 4) When the objects of the recognition experiment by pitch and formants are various speakers in each age and sex, pitch also works as an efficient recognition parameter.

  • PDF