• Title/Summary/Keyword: Phonetics

Search Result 948, Processing Time 0.031 seconds

SPATIAL EXPLANATIONS OF SPEECH PERCEPTION: A STUDY OF FRICATIVES

  • Choo, Won;Mark Huckvale
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.399-403
    • /
    • 1996
  • This paper addresses issues of perceptual constancy in speech perception through the use of a spatial metaphor for speech sound identity as opposed to a more conventional characterisation with multiple interacting acoustic cues. This spatial representation leads to a correlation between phonetic, acoustic and auditory analyses of speech sounds which can serve as the basis for a model of speech perception based on the general auditory characteristics of sounds. The correlations between the phonetic, perceptual and auditory spaces of the set of English voiceless fricatives /f $\theta$ s $\int$ h / are investigated. The results show that the perception of fricative segments may be explained in terms of 2-dimensional auditory space in which each segment occupies a region. The dimensions of the space were found to be the frequency of the main spectral peak and the 'peakiness' of spectra. These results support the view that perception of a segment is based on its occupancy of a multi-dimensional parameter space. In this way, final perceptual decisions on segments can be postponed until higher level constraints can also be met.

  • PDF

Growth curve modeling of nucleus F0 on Korean accentual phrase

  • Yoon, Tae-Jin
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.17-23
    • /
    • 2017
  • The present study investigates the effect of Accentual Phrase on F0 using a subset of large-scale corpus of Seoul Korean. Four syllable words which were neither preceded nor followed by silent pauses were presumed to be canonical exemplars of Accentual Phrases in Korean. These four syllable words were extracted from female speakers' speech samples. Growth curve analyses, combination of regression and polynomial curve fitting, were applied to the four syllable words. Four syllable words were divided into four groups depending on the categorical status of the initial segment: voiceless obstruents, voiced obstruents, sonorants, and vowels. Results of growth curve analyses indicate that initial segment types have an effect on the F0 (in semitone) in the nucleus of the initial syllable, and the cubic polynomial term revealed that some of the medial low tones in the 4 syllable words may be guided by the principle of contrast maximization, while others may be governed by the principle of ease of articulation.

Phonetics and Language as a formal System

  • Port, Robert F.;Leary, Adam P.
    • Lingua Humanitatis
    • /
    • v.5
    • /
    • pp.221-264
    • /
    • 2003
  • This paper takes issue with the idea of language as a 'serial-time structure' as opposed to the 'real-time event' of speech, an idea entrenched in Chomskyan model of linguistic theory. The discussion centers around the leitmotif question: Is language constructed entirely from a finite set of apriori discrete symbol types, as the 'competence vs performance' dichotomy implies\ulcorner A set of linguistic patterns examined in this study, largely with regard to phonological considerations, points to the evidence to the contrary. That is, while the patterns may be said to be linguistically distinct, they are not discretely, different, i.e. not different enough to be reliably differentiated. It is demonstrated that much of current research in phonology, including the most recent Optimality Theory, is misdirected in that it falsely presupposes a discrete universal phonetic inventory. The main thrust of the present study is that there is no sharp boundary between 'competence' defined as the formal, symbolic, discrete time domain of language and human cognition on the one hand and 'performance' as the continuous, fuzzy, real-time domain of human physiology on the other.

  • PDF

Analysis of the Timing of Spoken Korean Using a Classification and Regression Tree (CART) Model

  • Chung, Hyun-Song;Huckvale, Mark
    • Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.77-91
    • /
    • 2001
  • This paper investigates the timing of Korean spoken in a news-reading speech style in order to improve the naturalness of durations used in Korean speech synthesis. Each segment in a corpus of 671 read sentences was annotated with 69 segmental and prosodic features so that the measured duration could be correlated with the context in which it occurred. A CART model based on the features showed a correlation coefficient of 0.79 with an RMSE (root mean squared prediction error) of 23 ms between actual and predicted durations in reserved test data. These results are comparable with recent published results in Korean and similar to results found in other languages. An analysis of the classification tree shows that phrasal structure has the greatest effect on the segment duration, followed by syllable structure and the manner features of surrounding segments. The place features of surrounding segments only have small effects. The model has application in Korean speech synthesis systems.

  • PDF

Design and Construction of Korean-Spoken English Corpus (K-SEC) (한국인의 영어 음성코퍼스 설계 및 구축)

  • Rhee Seok-Chae;Lee Sook-Hyang;Kang Seok-keun;Lee Yong-Ju
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.12-20
    • /
    • 2003
  • K-SEC(Korean-Spoken English Corpus) is a kind of speech database that is being under construction by the authors of this paper. This article discusses the needs of the K-SEC from various academic disciplines and industrial circles, and it introduces the characteristics of the K-SEC design, its catalogues and contents of the recorded database, exemplifying what are being considered from both Korean and English languages' phonetics and phonologies. The K-SEC can be marked as a beginning of a parallel speech corpus, and it is suggested that a similar corpus should be enlarged for the future advancements of the experimental phonetics and the speech information technology.

  • PDF

A preliminary study on laryngeal and supralaryngeal articulatory distinction of the three-way contrast of Korean velar stops

  • Jiyeon Song;Sahyang Kim;Taehong Cho
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.19-24
    • /
    • 2023
  • This study investigated acoustic (VOT) and articulatory characteristics of Korean velar stops in monosyllabic CV structures to examine how the three-way distinction is realized in the laryngeal and supralaryngeal domains and how the distinction is manifested in male versus female speakers' speech production. EMA data were collected from 22 speakers. In line with previous studies, male speakers preserved the three-way differentiation of velar stops (/k*/</k/</kh/) in terms of VOT while female speakers showed only a two-way distinction (/k*/</k/=/kh/). As for the kinematic characteristics, a clear three-way distinction was found only in male speakers' peak velocity measure in the C-to-V opening movement (/kh/</k/</k*/). For the other kinematic measures (i.e., articulatory closure duration, deceleration duration of the opening movement and the entire opening movement duration), male speakers showed only a two-way distinction between fortis and the other two stops. Female speakers did not show a three-way contrast in any kinematic measure. They showed a two-way distinction between lenis and the other two stops in C-to-V deceleration duration (/k*/=/kh/</k/), and a two-way distinction between fortis and lenis stops in the opening movement duration. An overall comparison of VOT and articulatory analyses revealed that the lenis-aspirated kinematic distinction is diminishing, driven by female speakers, in line with the loss of the lenis-aspirated distinction in VOT that could influence supralaryngeal articulation.

Phonetic Evaluation in Speech Sciences and Issues in Phonetic Transcription (음성 평가의 다학문적 현황과 표기의 과제)

  • Kim, Jong-Mi
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.259-280
    • /
    • 2003
  • The paper discusses the way in which speech sounds are being evaluated and transcribed in various fields of speech sciences, and suggests ways for a more accurate transcription. The academic fields explored are of phonetics, speech processing, speech pathology, and foreign language education. The discussion centers on the International Phonetic Alphabet (IPA), most commonly used in these fields, and other less widely-accepted transcription conventions such as the TOnes and Break Indices (ToBI), the Speech Assessment Methods Phonetic Alphabet (SAMPA), an extension of the official Korean Romanization (KORBET), and the American-English transcription system in the TIMIT database (TIMITBET). These transcription conventions are dealt with Korean, English, and Korean-accented English. The paper demonstrates that each transcription can exclusively be recommended for a specific need from different academic fields. Due to its publicity, the IPA is best suited for phonetic evaluation in the fields of phonetics, speech pathology, and foreign language education. The rest of the transcriptions are useful for keyboard-inputting the phonetically evaluated data from all these fields as well as for sound transcription in speech engineering, because they use convenient letter symbols for typing, searching, and programming. Several practical suggestions are made to maintain the transcriptional efficiency and consistency to accommodate the intra-and inter-transcriber variability.

  • PDF

Effects of Prosodic Strengthening on the Production of English High Front Vowels /i, ɪ/ by Native vs. Non-Native Speakers (원어민과 비원어민의 영어 전설 고모음 /i, ɪ/ 발화에 나타나는 운율 강화 현상)

  • Kim, Sahyang;Hur, Yuna;Cho, Taehong
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.129-136
    • /
    • 2013
  • This study investigated how acoustic characteristics (i.e., duration, F1, F2) of English high front vowels /i, ɪ/ are modulated by boundary- and prominence-induced strengthening in native vs. non-native (Korean) speech production. The study also examined how the durational difference in vowels due to the voicing of a following consonant (i.e., voiced vs. voiceless) is modified by prosodic strengthening in two different (native vs. non-native) speaker groups. Five native speakers of Canadian English and eight Korean learners of English (intermediate-advanced level) produced 8 minimal pairs with the CVC sequence (e.g., 'beat'-'bit') in varying prosodic contexts. Native speakers distinguished the two vowels in terms of duration, F1, and F2, whereas non-native speakers only showed durational differences. The two groups were similar in that they maximally distinguished the two vowels when the vowels were accented (F2, duration), while neither group showed boundary-induced strengthening in any of the three measurements. The durational differences due to the voicing of the following consonant were also maximized when accented. The results are discussed further in terms of phonetics-prosody interface in L2 production.