• Title/Summary/Keyword: Phonetics

Search Result 948, Processing Time 0.019 seconds

How Korean Learner's English Proficiency Level Affects English Speech Production Variations

  • Hong, Hye-Jin;Kim, Sun-Hee;Chung, Min-Hwa
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.115-121
    • /
    • 2011
  • This paper examines how L2 speech production varies according to learner's L2 proficiency level. L2 speech production variations are analyzed by quantitative measures at word and phone levels using Korean learners' English corpus. Word-level variations are analyzed using correctness to explain how speech realizations are different from the canonical forms, while accuracy is used for analysis at phone level to reflect phone insertions and deletions together with substitutions. The results show that speech production of learners with different L2 proficiency levels are considerably different in terms of performance and individual realizations at word and phone levels. These results confirm that speech production of non-native speakers varies according to their L2 proficiency levels, even though they share the same L1 background. Furthermore, they will contribute to improve non-native speech recognition performance of ASR-based English language educational system for Korean learners of English.

  • PDF

Characteristics of speech intelligibility and speech acceptability connected with mouth opening condition (구강 개방 상태에 따른 말 명료도 및 말 용인도 특성)

  • Song, Yun-Kyung
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.141-148
    • /
    • 2011
  • There are many factors that affect speech intelligibility and speech acceptability. Structural anomalies and neuromotor pathologies are known for the reasons of abnormal speech sounds. And there are minor variations related to oral mechanism. Speaking with restricted mouth opening related to therapeutic procedure or habitual speech pattern might affect the quality of speech sounds. So this study compared speech intelligibility and speech acceptability of recorded 24 words in two conditions (restricted mouth opening condition and normal mouth opening condition) by 30 normal hearing adults. The results showed that speech intelligibility and speech acceptability were significantly lower in restricted mouth opening condition. And speech acceptability was significantly lower than speech intelligibility in restricted mouth opening condition. Speech acceptability in restricted mouth opening condition was significantly lower especially in open vowel. These findings indicated that the mouth opening condition could affect vowel shape and could be an adverse effect on speech intelligibility and speech acceptability.

  • PDF

Discourse Characteristics in Healthy Elderly: Effects of Aging, Gender and Educational Level (노년층의 담화 산출 특성: 노화, 성별, 교육정도에 따른 차이)

  • Choi, Hyun-Joo
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.135-143
    • /
    • 2012
  • Discourse is regarded as an important component of communication assessment, but studies about the discourse characteristics of the elderly are scant. The purpose of this study was to confirm the effects of aging, gender, and educational level on discourse in elderly people with normal cognitive function. Forty normal elderly and forty young people participated in this study. A picture description task (Boston Cookie-Theft picture) was used to examine discourse function. The description task was analyzed for both productivity (total number of sentences, total number of syllables, and syllables per sentence) and semantics (CIU ratio). The results were as follows: 1) Only CIU ratio differed significantly according to age. 2) In the total number of syllables and syllables per sentence, females demonstrate a higher number than males. 3) The CIU ratio differed significantly according to educational level. These results suggest that impairment of communicative function is an aspect of cognitive impairment that can be related to aging. Also, discourse performance in the elderly is associated with their gender and educational level.

The Effect of Word Frequency and Neighborhood Density on Spoken Word Segmentation in Korean (단어 빈도와 음절 이웃 크기가 한국어 명사의 음성 분절에 미치는 영향)

  • Song, Jin-Young;Nam, Ki-Chun;Koo, Min-Mo
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.3-20
    • /
    • 2012
  • The purpose of this study was to investigate whether a segmentation unit for a Korean noun is a 'syllable' and whether the process of segmenting spoken words occurs at the lexical level. A syllable monitoring task was administered which required participants to detect an auditorily presented target from visually presented words. In Experiment 1, syllable neighborhood density of high frequency words which can be segmented into both CV-CVC and CVC-VC were controlled. The syllable effect and the neighborhood density effect were significant, and the syllable effect emerged differently depending on the syllable neighborhood density. Similar results were obtained in Experiment 2 where low frequency words were used. The significance of word frequency effect on syllable effect was also examined. The results of Experiments 1 and 2 indicated that the segmentation unit for a Korean noun is indeed a 'syllable', and this process can occur at the lexical level.

The Prosodic Characteristics of Children with Cochlear Implants with Respect to Speech Rate and Intonation Slope (인공와우이식 아동의 운율 특성 - 발화속도와 억양기울기를 중심으로 -)

  • Oh, Soon-Young;Seong, Cheol-Jae;Choi, Eun-Ah
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.157-165
    • /
    • 2011
  • This study investigated speech rate and intonation slope (least square method; F0, quarter-tone) in normal and CI children's utterances. Each group consisted of 12 people and were divided into groups of children with CI operation (before 3;00), children with CI operation (after 3;00), and normal children. Materials are composed of four kinds of grammatical dialogue sentences which are lacking in respect. Given three groups as independent variables and both speech rate and intonation slope as dependent variables, a one-way ANOVA showed that normal children had faster speech rates and steeper intonation slopes than those of the CI group. More specifically, there was a statistically significant speech rate difference between normal and CI children in all of the sentential patterns but imperative form (p<.01). Additionally, F0 and qtone slope observed in sentential final word showed a significant statistical difference between normal and CI children in imperative form (f0: p<.01; q-tone: p<.05).

  • PDF

Histogram Equalization Using Background Speakers' Utterances for Speaker Identification (화자 식별에서의 배경화자데이터를 이용한 히스토그램 등화 기법)

  • Kim, Myung-Jae;Yang, Il-Ho;So, Byung-Min;Kim, Min-Seok;Yu, Ha-Jin
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.79-86
    • /
    • 2012
  • In this paper, we propose a novel approach to improve histogram equalization for speaker identification. Our method collects all speech features of UBM training data to make a reference distribution. The ranks of the feature vectors are calculated in the sorted list of the collection of the UBM training data and the test data. We use the ranks to perform order-based histogram equalization. The proposed method improves the accuracy of the speaker recognition system with short utterances. We use four kinds of speech databases to evaluate the proposed speaker recognition system and compare the system with cepstral mean normalization (CMN), mean and variance normalization (MVN), and histogram equalization (HEQ). Our system reduced the relative error rate by 33.3% from the baseline system.

A Study on the Voice Onset Time of English Voiceless Stops in the Buckeye Corpus (벅아이 코퍼스를 이용한 영어 무성파열음의 VOT 연구)

  • Yoon, Kyu-Chul
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.33-40
    • /
    • 2012
  • The purpose of this paper is to investigate the voice onset time (VOT) of the English voiceless stops [p, t, k] found in the Buckeye Corpus of Conversational Speech [1]. Three young female speakers were chosen for this study and their VOT values were semi-automatically extracted along with other factors. The factors used for the analysis were place of articulation, location in word, syllabic stress, content word or not, word frequency calculated from the corpus, and the speech rate expressed in syllables per second. Results showed that, for the three places of articulation of each speaker, all the factors had a statistically significant effect on the VOT values. This paper has significance in that the materials used for the analysis were from a corpus of spontaneous natural English speech.

Extraction of Speech Features for Emotion Recognition (감정 인식을 위한 음성 특징 도출)

  • Kwon, Chul-Hong;Song, Seung-Kyu;Kim, Jong-Yeol;Kim, Keun-Ho;Jang, Jun-Su
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.73-78
    • /
    • 2012
  • Emotion recognition is an important technology in the filed of human-machine interface. To apply speech technology to emotion recognition, this study aims to establish a relationship between emotional groups and their corresponding voice characteristics by investigating various speech features. The speech features related to speech source and vocal tract filter are included. Experimental results show that statistically significant speech parameters for classifying the emotional groups are mainly related to speech sources such as jitter, shimmer, F0 (F0_min, F0_max, F0_mean, F0_std), harmonic parameters (H1, H2, HNR05, HNR15, HNR25, HNR35), and SPI.

A Study on the Detection and the Correction of Prosodic Errors Produced by Chinese Korean-Learners (중국인 학습자들의 한국어 강세구 실현양상과 오류진단 및 교정방안 연구)

  • Yune, Young-Sook
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.51-59
    • /
    • 2012
  • The purpose of this study is to examine the pitch pattern of Korean accentual phrases produced by Chinese Korean-learners in the reading of a Korean text. Korean accentual phrase is determined by a specific F0 contour. And the pitch contour of APs differ depending on their length and the nature of initial segment. In order to examine if Chinese speakers are also aware such a phonetic properties, we have examined the AP pitch contours produced by 15 Chinese speakers differing in proficiency, and compared them to pitch contours produced by six Korean native speakers. The results show that Chinese speakers' pitch errors were observed in initial segment-tone interaction and in type of pitch patterns. However, even though Chines speakers produced the same type of pitch patterns, internal tonal modulation differs from native speakers. Finally, on the basis of theses results, we proposed a teaching method that visualizes the F0 contour.

Automated Speech Analysis Applied to Sasang Constitution Classification (음성을 이용한 사상체질 분류 알고리즘)

  • Kang, Jae-Hwan;Yoo, Jong-Hyang;Lee, Hae-Jung;Kim, Jong-Yeol
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.155-163
    • /
    • 2009
  • This paper introduces an automatic voice classification system for the diagnosis of individual constitution based on Sasang Constitutional Medicine (SCM) in Traditional Korean Medicine (TKM). For the developing of this algorithm, we used the voices of 473 speakers and extracted a total of 144 speech features from the speech data consisting of five sustained vowels and one sentence. The classification system, based on a rule-based algorithm that is derived from a non parametric statistical method, presents binary negative decisions. In conclusion, 55.7% of the speech data were diagnosed by this system, of which 72.8% were correct negative decisions.

  • PDF