• Title/Summary/Keyword: High Vowel

Search Result 144, Processing Time 0.024 seconds

An Acoustic Study of the Stress and Intonational System in Lakhota: A Preliminary Report

  • Cho, Tae-Hong
    • Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.23-42
    • /
    • 2006
  • This paper reports a preliminary result of an acoustic study on the stress and intonational system in Lakhota, a native American language. It investigates how the stress and intonation in Lakhota are phonetically manifested; and how the stress interacts with other prosodic factors. The results preliminarily obtained from one native Lakhota speaker suggest that the primary cue of the stress is relatively high F0 which is often accompanied by higher intensity (for the vowel) and longer VOT (for aspirated stops). The results also indicate that stress is not reliably marked by duration. The stress system, however, interacts with the intonational pattern, such that, for example, intonational peak falls on the stressed syllable with a general pattern of L+H* and that it interacts with the boundary tone L%, resulting in mid tone utterance-finally. This paper can be viewed largely as a qualitative study on an understudied native American language, Lakhota and as forming a basis for further development of its stress and intonation system whose acoustic properties of its prosodic system have not been investigated before.

  • PDF

Online korean character recognition using letter spotting method (자소 탐색 방법에 의한 온라인 한글 필기 인식)

  • 조범준
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.6
    • /
    • pp.1379-1389
    • /
    • 1996
  • Hangul character always consists of consonants-vowel-consonants in order. Using this point, this paper proposes an approach to design a model for spotting each letter in Hangul, and then recognize characters based on the spotting results. The network model consist of a set of HMMs. The letter search is carried out by Viterbi algorithm, while character recognition is performed by searching the lattice of letter hypotheses. Experimental results show that, in spite of simple architecture of recognition, the performance is quite high reaching 87.47% for discrete regular characters. In particular the approach shows highly plausible segmentation of letters in characters.

  • PDF

Target F2 Values of Coronal Stops in Korean, English, and. French (설단 폐쇄음의 목표 F2 값: 한국어, 영어, 불어의 비교)

  • Oh, Eun-Jin
    • Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.81-91
    • /
    • 2003
  • The aim of this study was to estimate the target F2 values of the coronal plain stop in Korean and the degree of deviation from the target in the context of various vowels, and to compare the results of Korean regarding the coronal stop with those of English and French. An acoustic analysis showed that the mean F2 value of the Korean coronal stop produced by 10 male speakers was 1,855 Hz and the deviation from the target was 94 Hz in the context of [i], 204 Hz in the context of [u], and 407 Hz in the context of [o]. The target F2s of the coronal stop were the highest in English (1,929 Hz) and the lowest in French (1,662 Hz), and the deviation from the targets in the context of the high back vowel was the largest in French (257 Hz) and the smallest in English (73 Hz).

  • PDF

An Acoustic Analysis of Vowels for Severe-profound Hearing Impaired Children (최고도이상의 청력손실을 가진 아동의 모음음형대 분석)

  • Huh, Myung-Jin
    • Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.65-71
    • /
    • 2007
  • The severe-profound hearing impaired children have various disorders in everday communication due to the lack of hearing feedback. Especially, their speech produced unstable voice, omission and distortion of articulation, pitch break, cul-de-sac voice, and so on so that they were difficult to accurately deliver an intended message. This study attempts to analyze the acoustic characteristics of 4 vowel sounds produced by 35 severe-profound hearing impaired children using CSL(Computerized Speech Lab, Model 4300b). The formant data were obtained from the spectrogram and analyzed data by 12 formant filter and auto-correlation among the formants. Results showed that the hearing impaired children's formant values came out very high. They produced the vowels at the mode of hypertension with unstable voice. In order to improve their speech, they would need some adequate auditory feedback.

  • PDF

Handwritten Hangul Graphemes Classification Using Three Artificial Neural Networks

  • Aaron Daniel Snowberger;Choong Ho Lee
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.2
    • /
    • pp.167-173
    • /
    • 2023
  • Hangul is unique compared to other Asian languages because of its simple letter forms that combine to create syllabic shapes. There are 24 basic letters that can be combined to form 27 additional complex letters. This produces 51 graphemes. Hangul optical character recognition has been a research topic for some time; however, handwritten Hangul recognition continues to be challenging owing to the various writing styles, slants, and cursive-like nature of the handwriting. In this study, a dataset containing thousands of samples of 51 Hangul graphemes was gathered from 110 freshmen university students to create a robust dataset with high variance for training an artificial neural network. The collected dataset included 2200 samples for each consonant grapheme and 1100 samples for each vowel grapheme. The dataset was normalized to the MNIST digits dataset, trained in three neural networks, and the obtained results were compared.

Aerodynamic Characteristics, Vocal Efficiency, and Closed Quotient Differences according to Fundamental Frequency Fixation (음도 고정 유무에 따른 공기역학, 음성효율성 및 성대접촉률 차이)

  • Kim, Jaeock
    • Phonetics and Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.19-26
    • /
    • 2013
  • The aerodynamic characteristics (subglottal pressure (Ps) and mean airflow rate (MFR)), fundamental frequency (Fo), intensity (I), vocal efficiency (VE), and closed quotient (CQ) were compared during a sustained vowel /o/ sound under three conditions: in a comfortable loudness and pitch level (condition 1), in a maximum loudness level with a fixed pitch (condition 2), and in a maximum loudness level without a fixed pitch (condition 3). Also, multiple regression analyses were done to measure the aerodynamic characteristics affect on the VE and the CQ in each condition. The results showed the Fo, Ps, MFR, VE, and CQ increased as I increased with and without fixed pitch. Most notably, VE in condition 3 was the highest of all the conditions, but CQ was not very high. By the results of multiple regression analysis, VE was significantly affected by I and Ps in all conditions; Fo was the other main key for affecting VE in high pitch. However, none of the aerodynamic characteristics significantly affected CQ. As I increases, Fo should be increased by increasing Ps and VE. Therefore, researchers should consider and specify an a priori to Fo, Ps, and I when measuring VE to examine the complex and delicate vocal mechanism.

Effects of Prosodic Strengthening on the Production of English High Front Vowels /i, ɪ/ by Native vs. Non-Native Speakers (원어민과 비원어민의 영어 전설 고모음 /i, ɪ/ 발화에 나타나는 운율 강화 현상)

  • Kim, Sahyang;Hur, Yuna;Cho, Taehong
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.129-136
    • /
    • 2013
  • This study investigated how acoustic characteristics (i.e., duration, F1, F2) of English high front vowels /i, ɪ/ are modulated by boundary- and prominence-induced strengthening in native vs. non-native (Korean) speech production. The study also examined how the durational difference in vowels due to the voicing of a following consonant (i.e., voiced vs. voiceless) is modified by prosodic strengthening in two different (native vs. non-native) speaker groups. Five native speakers of Canadian English and eight Korean learners of English (intermediate-advanced level) produced 8 minimal pairs with the CVC sequence (e.g., 'beat'-'bit') in varying prosodic contexts. Native speakers distinguished the two vowels in terms of duration, F1, and F2, whereas non-native speakers only showed durational differences. The two groups were similar in that they maximally distinguished the two vowels when the vowels were accented (F2, duration), while neither group showed boundary-induced strengthening in any of the three measurements. The durational differences due to the voicing of the following consonant were also maximized when accented. The results are discussed further in terms of phonetics-prosody interface in L2 production.

Frequency of grammar items for Korean substitution of /u/ for /o/ in the word-final position (어말 위치 /ㅗ/의 /ㅜ/ 대체 현상에 대한 문법 항목별 출현빈도 연구)

  • Yoon, Eunkyung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.33-42
    • /
    • 2020
  • This study identified the substitution of /u/ for /o/ (e.g., pyəllo [pyəllu]) in Korean based on the speech corpus as a function of grammar items. Korean /o/ and /u/ share the vowel feature [+rounded], but are distinguished in terms of tongue height. However, researchers have reported that the merger of Korean /o/ and /u/ is in progress, making them indistinguishable. Thus, in this study, the frequency of the phonetic manifestation /u/ of the underlying form of /o/ for each grammar item was calculated in The Korean Corpus of Spontaneous Speech (Seoul Corpus 2015) which is a large corpus from a total of 40 speakers from Seoul or Gyeonggi-do. It was then confirmed that linking endings, particles, and adverbs ending with /o/ in the word-final position were substituted for /u/ approximately 50% of the stimuli, whereas, in nominal items, they were replaced at a frequency of less than 5%. The high rates of substitution were the special particle "-do[du]" (59.6%) and the linking ending "-go[gu]" (43.5%) among high-frequency items. Observing Korean pronunciation in real life provides deep insight into its theoretical implications in terms of speech recognition.

Reduction and Frequency Analyses of Vowels and Consonants in the Buckeye Speech Corpus

  • Yang, Byung-Gon
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.75-83
    • /
    • 2012
  • The aims of this study were three. First, to examine the degree of deviation from dictionary prescribed symbols and actual speech made by American English speakers. Second, to measure the frequency of vowel and consonant production of American English speakers. And third, to investigate gender differences in the segmental sounds in a speech corpus. The Buckeye Speech Corpus was recorded by forty American male and female subjects for one hour per subject. The vowels and consonants in both the phonemic and phonetic transcriptions were extracted from the original files of the corpus and their frequencies were obtained using codes of a free software R. Results were as follows: Firstly, the American English speakers produced a reduced number of vowels and consonants in daily conversation. The reduction rate from the dictionary transcriptions to the actual transcriptions was around 38.2%. Secondly, the American English speakers used more front high and back low vowels while three-fourths of the consonants accounted for stops, fricatives, and nasals. This indicates that the segmental inventory has nonlinear frequency distribution in the speech corpus. Thirdly, the two gender groups produced vowels and consonants similarly even though there were a few noticeable differences in their speech. From these results we propose that English teachers consider pronunciation education reflecting the actual speech sounds and that linguists find a way to establish unmarked segmentals from speech corpora.

An Analysis of Acoustic Features Caused by Articulatory Changes for Korean Distant-Talking Speech

  • Kim Sunhee;Park Soyoung;Yoo Chang D.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.2E
    • /
    • pp.71-76
    • /
    • 2005
  • Compared to normal speech, distant-talking speech is characterized by the acoustic effect due to interfering sound and echoes as well as articulatory changes resulting from the speaker's effort to be more intelligible. In this paper, the acoustic features for distant-talking speech due to the articulatory changes will be analyzed and compared with those of the Lombard effect. In order to examine the effect of different distances and articulatory changes, speech recognition experiments were conducted for normal speech as well as distant-talking speech at different distances using HTK. The speech data used in this study consist of 4500 distant-talking utterances and 4500 normal utterances of 90 speakers (56 males and 34 females). Acoustic features selected for the analysis were duration, formants (F1 and F2), fundamental frequency, total energy and energy distribution. The results show that the acoustic-phonetic features for distant-talking speech correspond mostly to those of Lombard speech, in that the main resulting acoustic changes between normal and distant-talking speech are the increase in vowel duration, the shift in first and second formant, the increase in fundamental frequency, the increase in total energy and the shift in energy from low frequency band to middle or high bands.