• Title/Summary/Keyword: Acoustic cues

Search Result 68, Processing Time 0.024 seconds

SPATIAL EXPLANATIONS OF SPEECH PERCEPTION: A STUDY OF FRICATIVES

  • Choo, Won;Mark Huckvale
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.399-403
    • /
    • 1996
  • This paper addresses issues of perceptual constancy in speech perception through the use of a spatial metaphor for speech sound identity as opposed to a more conventional characterisation with multiple interacting acoustic cues. This spatial representation leads to a correlation between phonetic, acoustic and auditory analyses of speech sounds which can serve as the basis for a model of speech perception based on the general auditory characteristics of sounds. The correlations between the phonetic, perceptual and auditory spaces of the set of English voiceless fricatives /f $\theta$ s $\int$ h / are investigated. The results show that the perception of fricative segments may be explained in terms of 2-dimensional auditory space in which each segment occupies a region. The dimensions of the space were found to be the frequency of the main spectral peak and the 'peakiness' of spectra. These results support the view that perception of a segment is based on its occupancy of a multi-dimensional parameter space. In this way, final perceptual decisions on segments can be postponed until higher level constraints can also be met.

  • PDF

Acoustic Characteristics and Pitch Accent Realization in English Elliptical Sentences - VP-ellipsis, sluicing, gapping - (영어 생략구문의 음성적 특성과 피치악센트 실현 양상-동사구 생략, 슬루싱, 공소화를 중심으로-)

  • Kim, Hee-Sung
    • Speech Sciences
    • /
    • v.11 no.2
    • /
    • pp.119-136
    • /
    • 2004
  • Ellipsis is the figure of speech characterized by the deliberate omission of words that are obviously understood, but that must be supplied to make a construction grammatically or semantically complete. The purpose of this study is to examine how ellipsis affects its adjacent elements acoustically and phonologically in English VP-ellipsis, sluicing and gapping. In the experiment, the realizations by English native speakers are set as the criteria for the observing point and are compared to Korean speakers' realizations. For the results, while English native speakers utilized various acoustic information such as word duration and pitch range and phonological information such as pith accent realization in order to intend the cues for decoding the missing constituent, Korean English learners relied on only duration information and could not use various information effectively.

  • PDF

The Acoustic Realization of Phrasal Verb vs. Verb-preposition (구절 동사와 전치사 수반동사의 의미에 따른 음성적 실현)

  • Kim, Hee-Sung;Song, Ji-Yeon;Kim, Kee-Ho
    • MALSORI
    • /
    • no.63
    • /
    • pp.67-84
    • /
    • 2007
  • Verb phrase could have two different meanings according to which is followed after verb; adverb or preposition. The meaning of 'verb+adverb' is deduced from a figurative meaning which is idiomatic expression, and 'verb+preposition' is interpreted as the literal meaning. The purpose of this study is to observe how English native speakers and Korean leaners of English distinguish two sentences of the same word strings with acoustic cues like pause and duration. According to the result, as pause was used for meaning distinction, it was likely that the pause length preceding prepositions was longer than that of following adverbs. To distinguish two sentences of the same word strings, all participants seemed to use pause, verb lengthening and adverb/preposition lengthening. Among them, there is a hierarchical significance; in sequence, pause, verb lengthening, adverb/preposition lengthening.

  • PDF

The final stop consonant perception in typically developing children aged 4 to 6 years and adults (4-6세 정상발달아동 및 성인의 종성파열음 지각력 비교)

  • Byeon, Kyeongeun;Ha, Seunghee
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.57-65
    • /
    • 2015
  • This study aimed to identify the development pattern of final stop consonant perception using the gating task. Sixty-four subjects participated in the study: 16 children aged 4 years, 16 children aged 5 years, 17 children aged 6 years, and 15 adults. One-syllable words with consonant-vowel-consonant(CVC) structure, mokㄱ-motㄱ and papㄱ-patㄱ were used as stimuli in order to remove the redundancy of acoustic cues in stimulus words, 40ms-length (-40ms) and 60ms-length (-60ms) from the entire duration of the final consonant were deleted. Three conditions (the whole word segment, -40ms, -60ms) were used for this speech perception experiment. 48 tokens (4 stimuli ${\times}3$ conditions ${\times}4$ trials) in total were provided for participants. The results indicated that 5 and 6 year olds showed final consonant perception similar to adults in stimuli, papㄱ-patㄱ and only the 6-year-old children showed perception similar to adults in stimuli, 'mokㄱ-motㄱ. The results suggested that younger typically developing children require more acoustic information to accurately perceive final consonants than older children and adults. Final consonant perception ability may become adult-like around 6 years old. The study provides fundamental data on the development pattern of speech perception in normal developing children, which can be used to compare to those of children with communication disorders.

Closure Duration and Pitch as Phonetic Cues to Korean Stop Identity in AP-medial Position: Perception Test

  • Kang, Hyun-Sook;Dilley, Laura
    • Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.25-39
    • /
    • 2007
  • The present study investigated some perceptual phonetic attributes of two Korean stop types, aspirated and lax, in medial position of an accentual phrase. The intonational pattern across syllables (Jun, 1993) is argued to depend on the type of stop (aspirated vs. lax) only in the initial position of an accentual phrase. In Kang & Dilley (2007), we showed that significant differences between aspirated and lax stops in medial position of an accentual phrase exist in closure duration, voice-onset time, and fundamental frequency (F0) values for post-stop vowels. In the present perception experiment, we investigated whether these phonetic attributes contribute to the perception of these two types of stops: The closure durations and/or F0's of post-stop vowels on accentual-phrase medial words were altered and twenty native Korean speakers then judged these words as beginning with an aspirated or lax stop. Both closure duration and F0 significantly affected judgments of stop identity. These results indicate that a wider range of acoustic cues that distinguish aspirated and lax Korean stops in production also plays a role in perception. To account for these results we suggest some phonetic and phonological models of consonant-tone interactions for Korean.

  • PDF

A Study of the Prosodic Characteristics of Homographs with Context Cues by Subjects with Right and Left Hemisphere Damage (문맥 내에서 좌우반구 손상자의 동음어에 대한 운율 산출 비교)

  • Lee, Myoung-Soon
    • Phonetics and Speech Sciences
    • /
    • v.2 no.1
    • /
    • pp.13-21
    • /
    • 2010
  • The purpose of this study was to examine the prosody characteristics of sentence-level utterances which contain homographs with context cues in patients with neurogenic communication disorders. Homographs which may be affected by prosody, especially tonic length features, were used to investigate this matter. The characteristics of tone, duration, pitch, and pitch peak were analyzed to examine the characteristics of prosody in patients with lesions in the left or right hemisphere and normal controls. The whole process was recorded using Praat 4.3.14 and for statistical analyses, three-way ANOVA and multiple comparative analyses, Chi-Square tests, and a one-way ANOVA were carried out using SPSS 12.0 for Windows. The conclusions of this study are as follows. First, the length of syllables and vowels in homographs in Korean was different depending on the meaning and was not significant between groups. Second, it was found that patients with lesions in the right hemisphere had significant difference on pitch. Third, it was found that frequency of pitch peak and tone in 'short' tone syllables were different between groups. The conclusion of this study found that the prosody of homographs between groups absolutely was not differentiated. Accordingly, more detailed studies of acoustic parameters and other parameters which the prosody characteristic between groups could be found are needed in the future.

  • PDF

A Study of Korean Phonetic and Phonological Properties for Speech Recognition and Synthesis (음성 인식/합성을 위한 국어의 음성-음운론적 특성 연구)

  • Chung, Kook;Koo, Hee-San;Lee, Chan-Do;Kim, Jong-Mi;Han , Sun-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.6
    • /
    • pp.31-44
    • /
    • 1994
  • The paper introduces several studies of various aspects of Korean phonology and phonetics for speech recognition and synthesis. The phonological and phonetic studies presented in this paper are : i) For a study of segmental phonology, we made an annotated list of Korean allophones and their corresponding alphabetic symbols to type into computers. ii) For a study of segmental phonetics, we present some acoustic regulations in Korean consonants according to their phonological environment within a word. iii) For a study of prosodic phonology, we suggest the phonological functions of prosodic features and their acoustic cues. iv) For a study of prosodic phonetics, we present the characteristic patterns of accent and intonation in Korean. v) Finally, we suggest some ways of using this phonological and phonetic knowledge for possible improvement of speech recognition and synthesis.

  • PDF

A New Method for Segmenting Speech Signal by Frame Averaging Algorithm

  • Byambajav D.;Kang Chul-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.4E
    • /
    • pp.128-131
    • /
    • 2005
  • A new algorithm for speech signal segmentation is proposed. This algorithm is based on finding successive similar frames belonging to a segment and represents it by an average spectrum. The speech signal is a slowly time varying signal in the sense that, when examined over a sufficiently short period of time (between 10 and 100 ms), its characteristics are fairly stationary. Generally this approach is based on finding these fairly stationary periods. Advantages of the. algorithm are accurate border decision of segments and simple computation. The automatic segmentations using frame averaging show as much as $82.20\%$ coincided with manually verified segmentation of CMU ARCTIC corpus within time range 16 ms. More than $90\%$ segment boundaries are coincided within a range of 32 ms. Also it can be combined with many types of automatic segmentations (HMM based, acoustic cues or feature based etc.).

Acoustic Realization of Metrical Structure in Orally Produced Korean Modern Poetry (한국 현대시 운율의 음향 발현)

  • Kim, Hyun-Gi;Hong, Ki-Hwan;Kim, Sun-Sook
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.181-192
    • /
    • 2004
  • The metrical structures in orally produced the poetry were generally analyzed by accent, metre and syllable. The purpose of this study is to investigate of metrical structures of Korean modem poetry using computer implemented speech analysis system. Two famous poet's poems confidential talk, Miloe and 'A buddhist dance, Sungmu' were selected for prosodic analysis. The informant is 60 years old professor in major of Korean and French poetry. The syllable structures of poems were analyzed primarily by vowel timbers, which can classified compact and diffuse vowels according to the distance of F2-F1. The perception cues of consonants were analyzed by VOT and tensity features of articulation. Rhythm is classified by dactyl, anapest, trochee, spondee and iambic. As a result, syllable structures of Korean modem poetry were mainly CV and CVC and the reading times of each lines were 3-4sec for 12 and 15 syllables. Main metre of Korean modem poems constructed the Imbic and Anapest. The break of each lines were demarcated by grammatical structure or meaning rather than phonetic structures.

  • PDF

Effects of attention on the perception of L2 phonetic contrast

  • Lee, Hyunjung
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.47-52
    • /
    • 2014
  • This study investigated how the degree of attention modulates English learners' perception of Korean stop contrasts. The contributions of VOT and F0 in perceiving Korean stops were examined while availability of attentional resources was manipulated using a dual-task paradigm. Results demonstrated the attentional modulation in the use of VOT, but not in F0: under less attention, the contribution of VOT to the perception of aspirated stops decreased, whereas that of lenis stops increased, which suggests more native-like performance. This implies that the role of attention in perceiving non-native contrasts might differ depending on how equivalent the acoustic and perceptual cues are between L1 and target L2 contrasts.