• Title/Summary/Keyword: 단모음

Search Result 72, Processing Time 0.026 seconds

Korean Single-Vowel Recognition Using Cumulants in Color Noisy Environment (유색 잡음 환경하에서 Cumulant를 이용한 한국어 단모음 인식)

  • Lee, Hyung-Gun;Yang, Won-Young;Cho, Yong-Soo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.2
    • /
    • pp.50-59
    • /
    • 1994
  • This paper presents a speech recognition method utilizing third-order cumulants as a feature vector and a neural network for recognition. The use of higher-order cumulants provides desirable uncoupling between the gaussian noise and speech, which enables us to estimate the coefficients of AR model without bias. Unlike the conventional method using second-order statistics, the proposed one exhibits low bias even in SNR as low as 0 dB at the expense of higher variance. It is confirmed through computer simulation that recognition rate of korean single-vowels with the cumulant-based method is much higher than the results with the conventional method even in low SNR.

  • PDF

The Recognition of Korean Single vowels by Use of the Diffusion Filter Bank as a Pre-processor (확산필터뱅크를 전처리기로 사용한 한국어 단모음인식)

  • Huh, Man-Tak;Kim, Jae-Chang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1
    • /
    • pp.81-87
    • /
    • 1997
  • In this paper, a new pre-processing method for the recognition of single vowels by use of spectrum envelope is presented. We use new extraction method of a spectrum envelope using the diffusion filter bank. By dividing analysis band of a diffusion filter bank into subbands, we decreased the number of diffusion process. And, by increasing the number of difference, we got higher selectivity. As a result of them, we reduced the total processing time, and got higher enhancement of discrimination. By getting 88.3% of average recognition rate for single vowels of natural voice through computer simulation. We confirmed it to be useful for speech recognition which use spectrum analysis of the voice signal to have many frequency components.

  • PDF

The Experimental Study on Korean Monophthong of Taiwanese Learners of Korean-Focusing on College Students Majoring in Korean (대만 한국어 학습자의 한국어 단모음에 대한 실험음성학적 연구 -한국어를 전공하는 대학생을 중심으로-)

  • Jung, Sunghoon
    • Journal of Korean language education
    • /
    • v.29 no.2
    • /
    • pp.155-180
    • /
    • 2018
  • The purpose of this study is to acoustically analyze eight Korean monophthongs produced by 29 Taiwanese learners of Korean and 20 native speakers of Korean, and to compare their pronunciations in experimental phonetics. Using the first formants(F1) and the second formants(F2) of Korean monophthongs, we can estimate the tongue positions of vowels produced by participants. In order to compare them directly, we had to normalize participants' F1 and F2. The result shows that almost all vowels of the Taiwanese learners are significantly different from those of Korean native speakers in their F1 and F2 values without the /ㅏ/ vowel. In particular, when pronouncing Korean monophthongs, the Korean learners of Taiwan had a narrow area of the place of articulation compared to the Korean native speakers except for back vowels. Finally, it shows that the Korean learners in Taiwan had a narrower range of articulation and articulated the vowels towards the back a little comparing to the Korean native speakers.

Analysis of Speech Signals by linear prediction and It's Application (선형 예측법에 의한 음성신호의 분석과 그 응용 방안)

  • 김명규
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.18 no.4
    • /
    • pp.27-33
    • /
    • 1981
  • In this paper, the effect of tone variation of speech signals is discussedty showing the variations of the linear prediction model spectra and the estimated vocal tract shape for Korean vowels. As an application of the analysis results a speech spenthesis scheme by combination of phonemes is also discussed based on experimental results.

  • PDF

A Study on Unspecified Speaker Recognition by Selective Pattern-Block Neural Network (선택적 패턴블럭 신경회로망을 이용한 불특정 화자 인식)

  • 강명광
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.96-99
    • /
    • 1995
  • 본 연구는 특징 파라메터의 특성을 고려한 신경회로망에 관한 연구로서 패턴블럭 선택적 신경회로망을 제안하고, 제안한 신경회로망의 성능을 평가하기 위하여 한국어 단모음에 대한 불특정 화자 인식 실험을 하였다. 각 패턴에 따른 특징 파라메터의 변화를 고려하지 않은 기존의 패턴매칭 알고리즘에 비하여 제안된 신경회로망은 인가된 패턴을 파라메터의 특성에 맞게 몇 개의 부패턴으로 분할한 후 가장 최적의 부패턴을 선택하여 학습하고 인지하는 것이 그 특성이다.

  • PDF

A Study on the Vowel Recognition of Korean Speech using Spatio-temporal Method (Spatio-temporal 방법을 이용한 우리말 모음 인식에 관한 연구)

  • 송도선;김선일;김석동;이행세
    • The Journal of the Acoustical Society of Korea
    • /
    • v.12 no.4
    • /
    • pp.57-62
    • /
    • 1993
  • 본 논문은 신경망을 이용한 우리말 모음에 대한 인식 연구이다. 음성을 나누거나. 음소별 인식이나, 시간 신축 방법을 사용하지 않고 모음을 인식하였다. 식나의 변화에 따른 음성의 변화를 정적인 음성으로 취급하였다. 10개로 균등히 나눈 프레임에 각 프레임마다 10차의 PARCOR계수를 추출하였다. 신경망의 구조를 간단히 하기 위해서 단모음과 복모음을 구분하여 학습시켰으며, 출력 노드의 수를 감소시키기 위해 이진 코드 형태로 구성하였다.

  • PDF

A Realization of Tone in Modern Chinese by the Leverage Principle and Its Teaching Strategies (지렛대 원리에 따른 중국어 성조 실현과 교육 방법)

  • Chang, Ho-Deug
    • Cross-Cultural Studies
    • /
    • v.30
    • /
    • pp.259-277
    • /
    • 2013
  • This article covers realization of tone in Modern Chinese by the leverage principle, and then explores its teaching strategies. The results of this study are as follows: The teaching strategies are as follows: Firstly, pronouncing Chinese vowels always takes far longer than you anticipate. Secondly, pronounce and practice Chinese vowels with leverage principle. Thirdly, understand and practice the sound change rule of '?'.

Speaker Adapted Real-time Dialogue Speech Recognition Considering Korean Vocal Sound System (한국어 음운체계를 고려한 화자적응 실시간 단모음인식에 관한 연구)

  • Hwang, Seon-Min;Yun, Han-Kyung;Song, Bok-Hee
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.6 no.4
    • /
    • pp.201-207
    • /
    • 2013
  • Voice Recognition technique has been developed and it has been actively applied to various information devices such as smart phones and car navigation system. But the basic research technique related the speech recognition is based on research results in English. Since the lip sync producing generally requires tedious hand work of animators and it serious affects the animation producing cost and development period to get a high quality lip animation. In this research, a real time processed automatic lip sync algorithm for virtual characters in digital contents is studied by considering Korean vocal sound system. This suggested algorithm contributes to produce a natural lip animation with the lower producing cost and the shorter development period.

Fundamental Acoustic Investigation of Korean Male 5 Monophthongs (한국 남성의 단모음 [아, 에, 이, 오, 우]에 대한 음향음성학적 기반연구)

  • Choi, Yae-Lin
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.6
    • /
    • pp.373-377
    • /
    • 2010
  • Numerous quantitative and qualitative studies have already been published related to English vowels. However, only minimal amounts of studies based on the acoustic analysis of Korean vowels have been accomplished. The purpose of this study is to obtain sufficient quantitative data based on the acoustic aspects of Korean vowels produced by males between the ages of 20s and 30s. A total of 31 males in their 20s and 30s produced the five fundamental vowels /a, e, i, o, u/ by repeating each of them three times in the standard Korean dialect. Such speech productions were recorded with 'Cool edit' and F1, F2, F3, F4 were extracted through the MATLAB acoustic analysis program. Results indicated that the overall patterns of formants were similar to previous studies, except that the formant levels of F1 and F2 of the vowels produced in this study were generally lower than that in previous studies. Future studies need to focus on obtaining vowel data by considering other factors such as age and other speech materials.

Characteristics of Vowel Formants, Voice Intensity, and Fundamental Frequency of Female with Amyotrophic Lateral Sclerosis using Spectrograms (스펙트로그램을 이용한 근위축성측삭경화증 여성 화자의 모음 포먼트, 음성강도, 기본주파수의 변화)

  • Byeon, Haewon
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.9
    • /
    • pp.193-198
    • /
    • 2019
  • This study analyzed the changes of vowel formant, voice intensity, and fundamental frequency of vowels for 11 months using acoustochemical spectrogram analysis of women diagnosed with amyotrophic lateral sclerosis (ALS). The test word was a vowel /a, i, u/ and a diphthong /h + ja + da/, /h + wi + da/, and /h +ɰi+ da/. Speech data were collected through the word reading task presented on the monitor using 'Alvin' program, and the recording environment was set to 5,500 Hz for the nyquist frequency and 11,000 Hz for the sampling rate. The records were analyzed by using spectrograms to vowel formants, voice intensity, and fundamental frequency. As a result of analysis, the fundamental frequency and intensity of the ALS process were decreased and the formant slope of the diphthong was decreased rather than the formant change in the vowel. This result suggests that the vowel distortion of ALS due to disease progression is due to the decrease of tongue and jaw co morbidity.