• 제목/요약/키워드: speech cues

검색결과 117건 처리시간 0.018초

발음평가용 멀티미디어 시스템 구현을 위한 구어 프랑스어의 음향학적 단서 (Acoustic Cues in Spoken French for the Pronunciation Assessment Multimedia System)

  • 이은영;송미영
    • 음성과학
    • /
    • 제12권3호
    • /
    • pp.185-200
    • /
    • 2005
  • The objective of this study is to examine acoustic cues in spoken French for the assessment of pronunciation which is necessary to realization of the multimedia system. The corpus is composed of simple expressions which consist of the French phonological system include all phonemes. This experiment was made on 4 male and female French native speakers and on 20 Korean speakers, university students who had learned the French language more than two years. We analyzed the recorded data by using spectrograph and measured comparative features by the numerical values. First of all, we found the mean and the deviation of all phonemes, and then chose features which had high error frequency and great differences between French and Korean pronunciations. The selected data were simplified and compared among them. After we judged whether the problems of pronunciation in each Korean speaker were either the utterance mistake or the interference of mother tongue, in terms of articulatory and auditory aspects, we tried to find acoustic features as simplified as possible. From this experiment, we could extract acoustic cues for the construction of the French pronunciation training system.

  • PDF

Multiple Acoustic Cues for Stop Recognition

  • Yun, Weon-Hee
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 10월 학술대회지
    • /
    • pp.3-16
    • /
    • 2003
  • ㆍAcoustic characteristics of stops in speech with contextual variability ㆍPosibility of stop recognition by post processing technique ㆍFurther work - Speech database - Modification of decoder - automatic segmentation of acoustic parameters

  • PDF

폐쇄음 음향 단서의 다차원 표현과 상관관계 분석 (Multi-dimensional Representation and Correlation Analyses of Acoustic Cues for Stops)

  • 윤원희
    • 대한음성학회지:말소리
    • /
    • 제55권
    • /
    • pp.45-60
    • /
    • 2005
  • The purpose of this paper is to represent values of acoustic cues for Korean oral stops in the multi-dimensional space, and to attempt to find possible relationships among acoustic cues through correlation analyses. The acoustic cues used for differentiation of 3 types of Korean stops are closure duration, voice onset time and fundamental frequency of a vowel after a stop. The values of these cues are plotted in the two and three dimensional space to see what the critical cues are for separation of different types of stops. Correlation coefficient analyses show that multi-variate approach to statistical analysis is legitimate, and that there are statistically significant relationships among acoustic cues but Oey are not strong enough to make the conjecture that there is a possible relationship among the articulatory or laryngeal mechanisms employed by the acoustic cues.

  • PDF

한국어 폐쇄음 음향단서의 다차원 표현 (Multi-dimenstional Representation of Acoustic Cues for Korean Stops)

  • 윤원희
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 춘계 학술대회 발표논문집
    • /
    • pp.25-28
    • /
    • 2005
  • The purpose of this paper is to represent values of acoustic cues for Korean oral stops in the multi-dimensional space, and to attempt to find possible relationships among acoustic cues through correlation coefficient analyses. The acoustic cues used for differentiation of 3 types of Korean stops are closure duration, voice onset time and fundamental frequency of a vowel after a stop. The values of these cues are plotted in the two and three dimensional space and see what the critical cues are for complete separation of different types of stops. Correlation coefficient analyses show that there are statistically significant relationships among acoustic cues but they are not strong enough to make a conjecture that there is a possible articulatory relationship among the mechanisms employed by the acoustic cues.

  • PDF

한국어 파열음의 음향적 특성과 지각 단서 (Acoustic characteristics and perceptual cues for Korean Stops)

  • 이경희;정명숙
    • 음성과학
    • /
    • 제7권2호
    • /
    • pp.139-155
    • /
    • 2000
  • The aim of this research is to investigate acoustic characteristics of three different types of Korean Stops-plain, tensed, and aspirated-, and employ these as a base to determine which one(s) can be used as perceptual cues. In this paper, we have examined acoustic characteristics of Korean Stops, especially voice onset time(VOT), closure duration(CD), degree of pitch of following vowels and differences in the intensity of the Stops build-up after the onset of voicing. From the above characteristics, differences can be made between word-initial and word-medial positions. That is to say, in word-initial position, the three Korean Stops are distinguished by VOT and pitch, whereas in word-medial by CD, VOT and pitch. However, the acoustic characteristics do not have the same value as perceptual cues. In both word-initial, and medial positions, the immediately following vowels play the most important role in perceiving Korean Stops. And in case of word'-medial positions,. CD and VOT also play important perceptual roles. In order to have a more fine-grained distinction among Korean Stops, we think future research should be done to investigate which factor(s) of the following vowels is/are the most determinative perceptual cue(s). However, based on our investigation, we may conclude that it is highly plausible that pitch can be one of the most important perceptual cues when distinguishing the three Korean Stops.

  • PDF

Two-Microphone Binary Mask Speech Enhancement in Diffuse and Directional Noise Fields

  • Abdipour, Roohollah;Akbari, Ahmad;Rahmani, Mohsen
    • ETRI Journal
    • /
    • 제36권5호
    • /
    • pp.772-782
    • /
    • 2014
  • Two-microphone binary mask speech enhancement (2mBMSE) has been of particular interest in recent literature and has shown promising results. Current 2mBMSE systems rely on spatial cues of speech and noise sources. Although these cues are helpful for directional noise sources, they lose their efficiency in diffuse noise fields. We propose a new system that is effective in both directional and diffuse noise conditions. The system exploits two features. The first determines whether a given time-frequency (T-F) unit of the input spectrum is dominated by a diffuse or directional source. A diffuse signal is certainly a noise signal, but a directional signal could correspond to a noise or speech source. The second feature discriminates between T-F units dominated by speech or directional noise signals. Speech enhancement is performed using a binary mask, calculated based on the proposed features. In both directional and diffuse noise fields, the proposed system segregates speech T-F units with hit rates above 85%. It outperforms previous solutions in terms of signal-to-noise ratio and perceptual evaluation of speech quality improvement, especially in diffuse noise conditions.

Effects of phonological and phonetic information of vowels on perception of prosodic prominence in English

  • Suyeon Im
    • 말소리와 음성과학
    • /
    • 제15권3호
    • /
    • pp.1-7
    • /
    • 2023
  • This study investigates how the phonological and phonetic information of vowels influences prosodic prominence among linguistically untrained listeners using public speech in American English. We first examined the speech material's phonetic realization of vowels (i.e., maximum F0, F0 range, phone rate [as a measure of duration considering the speech rate of the utterance], and mean intensity). Results showed that the high vowels /i/ and /u/ likely had the highest max F0, while the low vowels /æ/ and /ɑ/ tended to have the highest mean intensity. Both high and low vowels had similarly high phone rates. Next, we examined the effects of the vowels' phonological and phonetic information on listeners' perceptions of prosodic prominence. The results showed that vowels significantly affected the likelihood of perceived prominence independent of acoustic cues. The high and low vowels affected probability of perceived prominence less than the mid vowels /ɛ/ and /ʌ/, although the former two were more likely to be phonetically enhanced in the speech than the latter. Overall, these results suggest that perceptions of prosodic prominence in English are not directly influenced by signal-driven factors (i.e., vowels' acoustic information) but are mediated by expectation-driven factors (e.g., vowels' phonological information).

중의적 문장 인지에 있어서의 구경계의 영향 (The Influence of Phrasing on the Perception of Ambiguous Sentences)

  • 강선미;김기호;이주경
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.65-80
    • /
    • 2007
  • This experimental study is designed to investigate the acoustic cues produced by English native speakers in order to disambiguate the ambiguous sentences. This study also investigates whether Korean learners of English and English native speakers can perceive the appropriate meanings from the sentences produced with those acoustic cues. In the perception test, English native speakers successfully found out the proper meaning, utilizing the intonational cues, while Korean learners had difficulties in distinguishing the differences in meaning. The break interval was manipulated in order to see whether the pause duration facilitates or interferes with disambiguation. Though phrasing played an important role in disambiguation, the break interval itself did not have influence on it. The result, therefore, suggests that the tonal realization of phrasal accents and boundary tones seem to be more significant than the break interval in the perception of phrasing.

  • PDF

한국인 영어 학습자의 영어 단어 경계 인지 시 변이음 단서 사용 연구 (A Study of the use of allophonic cues in the perception of English word boundaries by Korean learners of English)

  • 장수영;박한상
    • 말소리와 음성과학
    • /
    • 제3권3호
    • /
    • pp.63-68
    • /
    • 2011
  • This study investigates how Korean students employ acoustic-phonetic cues in perceiving word boundaries of near-homophonous English phrases. For this study, 60 Korean college students participated in the experiment of discriminating word boundaries for 42 pairs of stimuli comprising the allophonic cues of aspiration and glottal stop. Results were analysed in terms of the correctness of responses and the correlation between correctness and confidence. Results showed that stimuli pairs of the glottal stop cue give a higher correctness but those of aspiration a relatively lower correctness. Comparison of the results of this study with those of the previous studies of English and Japanese speakers showed that Korean and Japanese speakers of English give a substantially lower correctness than native speakers of English, while Korean learners of English as a foreign language provide a lower correctness than Japanese speakers of English as a second language.

  • PDF

유창성 실어증과 비유창성 실어증 환자의 생성 이름대기 특성 연구 (A Comparison of Generative Naming Characteristics in Fluent and Non-fluent Aphasics)

  • 김애리;심현섭;김영태
    • 음성과학
    • /
    • 제11권4호
    • /
    • pp.151-161
    • /
    • 2004
  • The characteristics of generative naming ability between fluent aphasiacs and non-fluent aphasics were investigated for 10 fluent aphasics (6 Wernicke's and 4 conduction type) and 10 non-fluent aphasics (6 Broca's and 4 transcortical motor type). Subjects were given 2 types of generative naming task and asked to generate lists of words to categorical ('animal', 'things at a supermarket') and phonetic ('ㄱ', 'ㅇ', 'ㅅ') cues. The total numbers of correct and incorrect response and error type ratios were calculated. The results of the present study were as follows: (1) Fluent aphasics had higher generative naming scores than non-fluent aphasics. (2) A remarkable dissociation between performance on categorical and phonetic cue in both aphasic groups was observed. Both aphasic groups produced a large number of responses in the categorical cue. (3) There was no significant group-difference in the error type. (4) Any correlation between generative naming and confrontation naming in K - WAB was not found.

  • PDF