• Title/Summary/Keyword: Korean consonants

Search Result 400, Processing Time 0.029 seconds

Lengthening and shortening processes in Korean

  • Kang, Hyunsook;Kim, Tae-kyung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.15-23
    • /
    • 2020
  • This study examines the duration of Korean lax and tense stops in the prosodic word-medial position, their interactions with nearby segments, and the phonological implications of these interactions. It first examines the lengthening of consonants at the function of the short lax stop. Experiment 1 shows that the sonorant C1 is significantly longer before a short lax stop C2 than before a long tense stop. Experiment 2 shows that the short lax stop C1 cancels the contrast between the lax and tense obstruent at C2, making them appear as long tense obstruents (Post-Stop Tensing Rule). We suggest that such lengthening phenomena occur in Korean to robustly preserve the contrastive length difference between C and CC. Second, this study examines the vowel shortening, known as Closed-Syllable Vowel Shortening, before a long tense stop or before the consonant sequence. Experiment 3 suggests that it be interpreted as temporal adjustment to make the interval from the onset of a vowel to the onset of the following vowel of near-equal length. Conclusively, we suggest that Korean speech be planned and controlled with two specific intervals. One is the duration of contrastive consonant intervals between vowels, and the other is the duration from the onset of a vowel to the onset of the following vowel.

A Study of Korean Soft-keyboard Layout for One Finger Text Entry (한 손가락 문자 입력을 위한 한글 Soft-keyboard 배열에 관한 연구)

  • Kong, Byung-Don;Hong, Seung-Kweon;Jo, Seong-Sik;Myung, Ro-Hae
    • IE interfaces
    • /
    • v.22 no.4
    • /
    • pp.329-335
    • /
    • 2009
  • Recently, the use of soft-keyboard is widespread and increases, because various handheld devices were developed such as PDA, navigation, mobile phones with enhanced competence of touchscreen. The use of soft-keyboard requires different characteristics compared to traditional hard-keyboard like QWERTY keyboard: no standard character layout, one finger entry, and cognitive processing time. In this study, therefore, the optimal soft-keyboard layout for one finger text entry in touchscreen environment was investigated among 6 keyboard layouts which were developed based on traditional characteristic of Korean text and the usage frequency of both vowels and consonants. As a result, the interface with Korean text invention order like 'ㄱㄴㄷㄹ' or 'ㅏㅑㅓㅕㅕ' was found to be better than the interface with usage frequency-based arrangement. Especially the vowels were most efficient when separated into two parts; located at the right-hand side and at right below the consonants. In conclusion, the keyboard layout with regard to the Korean text characteristic and the invention order was a more effective layout resulted from the minimum cognitive processing time.

Statistical Analysis of Korean Phonological Rules Using a Automatic Phonetic Transcription (발음열 자동 변환을 이용한 한국어 음운 변화 규칙의 통계적 분석)

  • Lee Kyong-Nim;Chung Minhwa
    • Proceedings of the KSPS conference
    • /
    • 2002.11a
    • /
    • pp.81-85
    • /
    • 2002
  • We present a statistical analysis of Korean phonological variations using automatic generation of phonetic transcription. We have constructed the automatic generation system of Korean pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived from knowledge-based morphophonological analysis and government standard pronunciation rules. This system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS(Phonetic Balanced Sentence) Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonants. These statistics can be used for improving the performance of speech recognition systems.

  • PDF

A Study on the Speech Recognition of Korean Phonemes Using Recurrent Neural Network Models (순환 신경망 모델을 이용한 한국어 음소의 음성인식에 대한 연구)

  • 김기석;황희영
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.40 no.8
    • /
    • pp.782-791
    • /
    • 1991
  • In the fields of pattern recognition such as speech recognition, several new techniques using Artifical Neural network Models have been proposed and implemented. In particular, the Multilayer Perception Model has been shown to be effective in static speech pattern recognition. But speech has dynamic or temporal characteristics and the most important point in implementing speech recognition systems using Artificial Neural Network Models for continuous speech is the learning of dynamic characteristics and the distributed cues and contextual effects that result from temporal characteristics. But Recurrent Multilayer Perceptron Model is known to be able to learn sequence of pattern. In this paper, the results of applying the Recurrent Model which has possibilities of learning tedmporal characteristics of speech to phoneme recognition is presented. The test data consist of 144 Vowel+ Consonant + Vowel speech chains made up of 4 Korean monothongs and 9 Korean plosive consonants. The input parameters of Artificial Neural Network model used are the FFT coefficients, residual error and zero crossing rates. The Baseline model showed a recognition rate of 91% for volwels and 71% for plosive consonants of one male speaker. We obtained better recognition rates from various other experiments compared to the existing multilayer perceptron model, thus showed the recurrent model to be better suited to speech recognition. And the possibility of using Recurrent Models for speech recognition was experimented by changing the configuration of this baseline model.

A SOUND SPECTROGRAPHICAL STUDY ON THE KOREAN VOWELS AND CONSONANTS PRONOUNCED BY OPENBITE PATIENTS - Frequency Analysis - (SOUND SPECTROGRAPH를 이용한 개교환자의 한국어 자${\cdot}$모음의 발성에 관한 연구 - 주파수 분석을 중심으로 -)

  • Kim, Ki-Dal;Yang, Won Sik
    • The korean journal of orthodontics
    • /
    • v.15 no.1
    • /
    • pp.55-66
    • /
    • 1985
  • The study was undertaken to ascertain the speech defect of patients with malocclusion, especially of openbite patients, by means of the spectral analysis method. The experimental group was composed of ten female openbite patients and their mean age was 13.8 yrs. The control group was also composed of ten female girls and their mean age was 13.7 yrs. As for the speech material, eight Korean monophthrongs and two Korean fricatives and two affricatives were used. Speeches were recorded and then analyzed by a Kay 7800 digital sonagraph. Formant frequency level or range was used as a phonemic parameter. The results were as follows: 1. Among Vowels /a:/ : $F_1,\;F_3\;and\;F_1/F_2$ showed abnormality. /o:/ and $/w:/:F_2,\;F_2-F_1\;and\;F_1/F_2$ showed abnormality. 2. Among Consonants /S/ and /h/ : The upper and lower borders of the frequency range showed abnormality. (equation omitted) : The lower border of the frequency range showed abnormality. $/C^{h}/$ : The upper and lower borders of the frequency range and concentration point showed abnormality.

  • PDF

Voicing and Tone Correlation in L2 English

  • Kim, Mi-Ryoung
    • Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.113-128
    • /
    • 2005
  • The underlying premise of this study was that L1 production is easily transferred into L2 production. In neutral intonation, there is a consonant-tone correlation in Korean: High tone patterns are correlated with voiceless aspirated and tense consonants and Low-High tone patterns are correlated with lax or other voiced consonants. The purpose of this study was to see whether the correlation in Korean (L1) is transferred into English (L2) production and whether the degree of transfer differs depending on the degree of proficiency. Eight Korean speakers and two American speakers participated in the experiment. F0 contours of words and sentences were collected and analyzed. The results of the present study showed that there is a strong correlation between voicing and tone in L2 utterances. When utterance-initial consonant types were voiceless, the word or the sentence began with the H pattern; otherwise it had the LH pattern. The degree of interference differed depending on the degree of proficiency: less proficient speakers showed a stronger correlation in terms of the magnitude (Hz) and size (ms) of the effects on F0. The results indicate that the consonant-tone correlation in L1 is strongly transferred into L2 production and the correlation transfer can be one of the actual aspects that cause L2 speakers to produce deviant L2 accents and intonation.

  • PDF

The Effect of Visual Cues in the Identification of the English Consonants /b/ and /v/ by Native Korean Speakers (한국어 화자의 영어 양순음 /b/와 순치음 /v/ 식별에서 시각 단서의 효과)

  • Kim, Yoon-Hyun;Koh, Sung-Ryong;Valerie, Hazan
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.25-30
    • /
    • 2012
  • This study investigated whether native Korean listeners could use visual cues for the identification of the English consonants /b/ and /v/. Both auditory and audiovisual tokens of word minimal pairs in which the target phonemes were located in word-initial or word-medial position were used. Participants were instructed to decide which consonant they heard in $2{\times}2$ conditions: cue (audio-only, audiovisual) and location (word-initial, word-medial). Mean identification scores were significantly higher for audiovisual than audio-only condition and for word-initial than word-medial condition. Also, according to signal detection theory, sensitivity, d', and response bias, c were calculated based on both hit rates and false alarm rates. The measures showed that the higher identification rate in the audiovisual condition was related with an increase in sensitivity. There were no significant differences in response bias measures across conditions. This result suggests that native Korean speakers can use visual cues while identifying confusing non-native phonemic contrasts. Visual cues can enhance non-native speech perception.

The Phoneme Synthesis of Korean CV Mono-Syllables (한국어 CV단음절의 음소합성)

  • 안점영;김명기
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.11 no.2
    • /
    • pp.93-100
    • /
    • 1986
  • We analyzed Korean CV mono-syllables consisted of concatenation of consonants/k, t, p, g/, their fortis and rough sound and vowels/a, e, o, u, I/by the PARCOR technique, and then we synthesized those speech by means of the phoneme synthesis controlling the analyzed data. In the speech analysis, the duration of consonants decreases in the rough sound, the lenis and the fortis in turns. And also the gain of them decreases in the same tendency. The pitch period increases more and more in vowels following the rough sound, the fortis and the lenis in turns. We synthesized the lenis and the fortis by controlling the duration and the gain of the rough sound, and vowels following the fortis and the rough sound by controlling the pitch period and the duration of vowels following the lenis. As the results, the synthesized speech quality is good and we make certain it is possible to make a rule to the phonome synthesis in Korea speech.

  • PDF

Development of Automatic Lip-sync MAYA Plug-in for 3D Characters (3D 캐릭터에서의 자동 립싱크 MAYA 플러그인 개발)

  • Lee, Sang-Woo;Shin, Sung-Wook;Chung, Sung-Taek
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.18 no.3
    • /
    • pp.127-134
    • /
    • 2018
  • In this paper, we have developed the Auto Lip-Sync Maya plug-in for extracting Korean phonemes from voice data and text information based on Korean and produce high quality 3D lip-sync animation using divided phonemes. In the developed system, phoneme separation was classified into 8 vowels and 13 consonants used in Korean, referring to 49 phonemes provided by Microsoft Speech API engine SAPI. In addition, the pronunciation of vowels and consonants has variety Mouth Shapes, but the same Viseme can be applied to some identical ones. Based on this, we have developed Auto Lip-sync Maya Plug-in based on Python to enable lip-sync animation to be implemented automatically at once.

A Study On the Proportional Difference of Segments in Imitating Voice (모방발화에 나타나는 분절음의 비율연구)

  • Park, Ji-Hye;Shin, Ji-Young;Kang, Sun-Mee
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.205-208
    • /
    • 2004
  • The aim of this study is to analyse the adjustment of the proportion of segment duration in imitating voice. When imitating others' voices, how far is his/her original proportion of segment duration adjusted, and what is this adjustment like under various segments? In this study, I classified segments into consonants and vowels and consonants classified into obstruents and sonorants. The result of the analysis is as follows. ; (1)Individual variation in the proportion of obstruent is not significant, and when imitating, and its distribution is not typicalized. (2) Vowels has individual variation in the proportion of segment duration even under imitating. (3) Nasal has the most distinct individual variation even under imitating, compared with vowel and obstruent. For the further study, I should examine the characteristics of quantitative and qualitative changes in liquid (among sonorant) to find out which segment can best describe personnel characteristics of the proportion of segment duration in imitating voice.

  • PDF