• Title/Summary/Keyword: Phonetics

Search Result 948, Processing Time 0.016 seconds

A statistical analysis of vowel inventories of world languages

  • Byunggon Yang
    • Phonetics and Speech Sciences
    • /
    • v.16 no.3
    • /
    • pp.1-6
    • /
    • 2024
  • Vowels are fundamental elements of spoken language, providing insights into linguistic patterns and phonological systems. This study examines vowel inventories from 913 languages in a database, analyzing their statistical distributions. It investigates unique vowels within each language category, focusing on the two major dimensions of vowel height and backness, including diphthongs. The results show that vowel phonemes without diacritics constitute 30% of all distinct vowel types but account for 64% of the total vowel phonemes, highlighting the dominance of primary vowel articulations. The most frequent vowels are /i/, followed by /u/, /a/, /o/, /e/, /ɛ/, and /ɔ/. Multidimensional scaling of vowels, with or without diacritics, reveals distinctive clusters and co-occurrence patterns, necessitating more detailed analysis. Future research should consider the establishment of linguistic criteria on vowel representation and incorporate actual speech data for comprehensive linguistic studies.

Physiologic Phonetics for Korean Stop Production (한국어 자음생성의 생리음성학적 특성)

  • Hong, Ki-Hwan;Yang, Yoon-Soo
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.17 no.2
    • /
    • pp.89-97
    • /
    • 2006
  • The stop consonants in Korean are classified into three types according to the manner of articulation as unaspirated (UA), slightly aspirated (SA) and heavily aspirated (HA) stops. Both the UA and the HA types are always voiceless in any environment. Generally, the voice onset time (VOT) could be measured spectrographically from release of consonant burst to onset of following vowel. The VOT of the UA type is within 20 msec of the burst, and about 40-50 msec in the SA and 50-70 msec in the HA. There have been many efforts to clarify properties that differentiate these manner categories. Umeda, et $al^{1)}$ studied that the fundamental frequency at voice onset after both the UA and HA consonants was higher than that for the SA consonants, and the voice onset times were longest in the HA followed by the SA and UA. Han, et $al^{2)}$ reported in their speech synthesis and perception studies that the SA and UA stops differed primarily in terms of a gradual versus a relatively rapid intensity build-up of the following vowel after the stop release. Lee, et $al^{3)}$ measured both the intraoral and subglottal air pressure that the subglottal pressure was higher for the HA stop than for the other two stops. They also compared the dynamic pattern of the subglottal pressure slope for the three categories and found that the HA stop showed the most rapid increase in subglottal pressure in the time period immediately before the stop release. $Kagaya^{4)}$ reported fiberscopic and acoustic studies of the Korean stops. He mentioned that the UA type may be characterized by a completely adducted state of the vocal folds, stiffened vocal folds and the abrupt decreasing of the stiffness near the voice onset, while the HA type may be characterized by an extensively abducted state of the vocal folds and a heightened subglottal pressure. On the other hand, none of these positive gestures are observed for the SA type. Hong, et $al^{5)}$ studied electromyographic activity of the thyroarytenoid and posterior cricoarytenoid (PCA) muscles during stop production. He reported a marked and early activation of the PCA muscle associated with a steep reactivation of the thyroarytenoid muscle before voice onset in the production of the HA consonants. For the production of the UA consonants, little or no activation of the PCA muscle and earliest and most marked reactivation of the thyroarytenoid muscle were characteristic. For the SA consonants, he reported a more moderate activation of the PCA muscle than for the UA consonant, and the least and the latest reactivation of the thyroarytenoid muscle. Hong, et $al^{6)}$ studied the observation of the vibratory movements of vocal fold edges in terms of laryngeal gestures according to the different types of stop consonants. The movements of vocal fold edges were evaluated using high speed digital images. EGG signals and acoustic waveforms were also evaluated and related to the vibratory movements of vocal fold edges during stop production.

  • PDF

Speech Rates of Male Esophageal Speech (식도발성 남성 발화의 말 속도)

  • Park, Won-Kyoung;Shim, Hee-Jeong;Ko, Do-Heung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.143-149
    • /
    • 2012
  • The purpose of this study is to investigate the speech rate of an esophageal speech group that is capable of vocalization after surgery. The subjects in this experiment were 10 male esophageal speakers and 10 male laryngeal speakers. Each group read a reading passage that was recorded by a DAT recorder (Rolando, EDIROL R-09). These records were analyzed by using CSL (Computerized Speech Lab, model 4150). The results were as follows: (1) the overall speech rate of esophageal speech was 2.50 SPS (syllable per second) while the overall speech rate of laryngeal speech was 4.23 SPS. (2) The articulatory rate of esophageal speech was 3.14 SPS (syllable per second) while the articulatory rate of laryngeal speech was 4.75 SPS. Speech rates as well as articulatory rates of esophageal speech were significantly lower than laryngeal speech. These differences between the two groups may be due to reduced efficiency of airflows across the pharyngeal-esophageal segment for esophageal speakers when compared to airflow through the glottis for laryngeal speakers. These results would provide a guideline in speech rates for esophageal speakers in clinical settings.

Characteristics of Speech Intelligibility and the Vowel Space in Patients with Parkinson's disease (파킨슨병 환자의 말 명료도와 모음 공간 특성)

  • Shim, Hee-Jeong;Park, Won-Kyoung;Ko, Do-Heung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.161-169
    • /
    • 2012
  • The purpose of this study was to investigate the characteristics of speech intelligibility of spontaneous speech and the vowel space parameters in patients with Parkinson's disease. Ten PD patients (M=5, F=5) and a corresponding control group of ten normal adults participated in this study. Firstly, subjects were asked to tell a story about their hometown and youth in order to analyze speech intelligibility. Secondly, the subjects were also asked to repeat four vowels (/a/, /i/, /u/, /e/) five times in order to compare their vowel spaces. The results were as follows: (1) the speech intelligibility of the PD group was lower than that of the control group. (2) Four parameters including vowel area, vowel articulatory index, formant centralization ratio, F2i/F1u ratio were significantly different in each group. For instance, vowel area and F2 ratio were wider and higher, respectively. As a result, a decrease in speech intelligibility of patients with PD is likely to show different types of errors from the normal group. The results of this research are meaningful in a sense that they could provide the objective standard of speech intelligibility and vowel space parameters.

A Study on Vestibulosaccular Hearing (전정구형낭 청력에 관한 연구)

  • Heo, Seung-Deok
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.179-186
    • /
    • 2012
  • The aims of this study are to consider auditory physiological characteristics and to confirm audiological evaluation and interpretation in regards to cases of sensorineural hearing loss that observe an abnormal AB gap. Vestibulosaccular hearing occurs when there is an abnormally large air-bone gap (AB gap) in sensorineural hearing loss, also known as pure cochlear conductive hearing loss. Generally, an AB gap is caused by damage to the external and/or middle ear. In conductive hearing loss, loss of air condition hearing occurs due to a loss of resonance in the outer ear and/or impedance mismatching in the middle ear. Most of these types of hearing loss can be treated medically and surgically. However, there is no medical treatment for an AB gap in sensorineural hearing loss and hearing loss can worsen gradually or suddenly. In addition, many studies have reported that head trauma makes hearing loss even more serious. Therefore, in order to differentiate between conductive hearing losses, it is important to check whether or not there is an enlarged vestibular aqueduct by means of temporal bone computerized tomography and/or magnetic resonance imaging.

Phonological Status of Korean /w/: Based on the Perception Test

  • Kang, Hyun-Sook
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.13-23
    • /
    • 2012
  • The sound /w/ has been traditionally regarded as an independent segment in Korean regardless of the phonological contexts in which it occurs. There have been, however, some questions regarding whether it is an independent phoneme in /CwV/ context (cf. Kang 2006). The present pilot study examined how Korean /w/ is realized in $/S^*wV/$ context by performing some perception tests. Our assumption was that if Korean /w/ is a part of the preceding complex consonant like $/C^w/$, it should be more or less uniformly articulated and perceived as such. If /w/ is an independent segment, it will be realized with speaker variability. Experiments I and II examined the identification rates as "labialized" of the spliced original stimuli of $/S^*-V/$ and $/S^{w*}-^wV/$, and the cross-spliced stimuli $/S^{w*}-V/$ and $/S^*-^wV/$. The results showed that round qualities of /w/ are perceived at significantly different temporal point with speaker and context variability. We therefore conclude that /w/ in $/S^*wV/$ context is an independent segment, not a part of the preceding segment. Full-scale examination of the production test in the future should be performed to verify the conclusion we suggested in this paper.

English /s/ and Korean sh/-/s*/ Contrast in Seoul and Busan Dialects: A Study of Category Solidity

  • Kang, Kyoung-Ho
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.3-12
    • /
    • 2012
  • The primary goal of the current study was to examine category solidity of Korean alveolar fricatives in the Busan and Seoul dialects of Korean. Considering the common belief of $/s^h/-/s^*/$ neutralization in Kyungsang speech, plain $/s^h/$ and fortis $/s^*/$ fricatives of Busan speakers were examined against the same fricatives of Seoul speakers. Perceptual distance between Korean $/s^h/$ and $/s^*/$ on the one hand and English /s/ on the other was investigated by use of across-linguistic mapping method. Two experiments of a perceptual mapping task of English /s/ to Korean $/s^h/$ and $/s^*/$ and a $/s^*/$-production task were conducted on users of the Busan and Seoul dialects of Korean. The results from the perception and production experiments suggested that at a micro-level, younger Busan speakers have less solid category stability for Korean $/s^*/$ compared with Seoul speakers, although their production of $/s^h/$ and $/s^*/$ was as highly distinctive from each other as that of Seoul speakers.

The Effect of Visual Cues in the Identification of the English Consonants /b/ and /v/ by Native Korean Speakers (한국어 화자의 영어 양순음 /b/와 순치음 /v/ 식별에서 시각 단서의 효과)

  • Kim, Yoon-Hyun;Koh, Sung-Ryong;Valerie, Hazan
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.25-30
    • /
    • 2012
  • This study investigated whether native Korean listeners could use visual cues for the identification of the English consonants /b/ and /v/. Both auditory and audiovisual tokens of word minimal pairs in which the target phonemes were located in word-initial or word-medial position were used. Participants were instructed to decide which consonant they heard in $2{\times}2$ conditions: cue (audio-only, audiovisual) and location (word-initial, word-medial). Mean identification scores were significantly higher for audiovisual than audio-only condition and for word-initial than word-medial condition. Also, according to signal detection theory, sensitivity, d', and response bias, c were calculated based on both hit rates and false alarm rates. The measures showed that the higher identification rate in the audiovisual condition was related with an increase in sensitivity. There were no significant differences in response bias measures across conditions. This result suggests that native Korean speakers can use visual cues while identifying confusing non-native phonemic contrasts. Visual cues can enhance non-native speech perception.

F0 Extrema Timing of HL and LH in North Kyungsang Korean: Evidence from a Mimicry Task

  • Kim, Jung-Sun
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.43-49
    • /
    • 2012
  • This paper describes the categorical effects of pitch accent contrasts in a mimicry task. It focuses, specifically, on examining how fundamental frequency (f0) variation reflects phonological contrasts from speakers of two distinct varieties of Korean (i.e., North Kyungsang and South Cholla). The results showed that, in a mimicry task using synthetic speech continua, there was a categorical effect in f0 peak timing for North Kyungsang speakers, but the timing of f0 peaks and valleys in the responses of South Cholla speakers was more variable, presenting a gradient or non-categorical effect. Evidence of categorical effects was represented as the shift of f0 peak times along an acoustic continuum for North Kyungsang speakers. The range for the shift of f0 valley times was much narrower, compared to that of f0 peak times. The degree of a shift near the middle of the continuum showed variability across individual mimicry responses. However, the categorical structure in mimicry responses regarding the clustering of f0 peak points was more significant for North Kyungsang speakers than for South Cholla speakers. Additionally, the finding of the current study implies that the location of f0 peak times depends on individuals' imitative (or cognitive) abilities.

The Contribution of Prosody to the Foreign Accent of Chinese Talkers' English Speech

  • Liu, Xing;Lee, Joo-Kyeong
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.59-73
    • /
    • 2012
  • This study attempts to investigate the contribution of prosody to the foreign accent in Chinese speakers' English production by examining the synthesized speech of crossing native and non-native talkers' prosody and segments. For the stimuli of the foreign accent ratings, we transplanted gender-matched native speakers' prosody onto non-native talkers' segments and vice versa, utilizing the TD-PSOLA algorithm. Eight English native listeners participated in judging foreign accent and comprehensibility of the transplanted stimuli. Results showed that the synthesized stimuli were perceived as stronger foreign accent regardless of speakers' proficiency when English speakers' prosody was crossed with Chinese speakers' segments. This suggests that segments contribute more than prosody to native listeners' evaluation of foreign accent. When transplanted with English speakers' segments, Chinese speakers' prosody showed a difference in duration rather than pitch between high and low proficiency such that stronger foreign accent was detected when low proficient Chinese speakers' duration was crossed with English speakers' segments. This indicated that prosody, more specifically duration, plays a role though the prosodic role is not overall as significant as segments. According to the post acoustic analysis, the temporal features contributing to making the duration parameter prominent as opposed to pitch were found out to be speaking rate, pause duration and pause frequency. Finally, foreign accent and comprehensibility showed no significant correlation such that native listeners had no difficulty listening to highly foreign accented speech.