• Title/Summary/Keyword: Phonetics

Search Result 948, Processing Time 0.017 seconds

The Acoustic Characteristics of Transgenders' Voice (성전환자 음성의 음향학적 특성)

  • Yoo, Jae-Yeon
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.145-151
    • /
    • 2009
  • The purpose of this study was to investigate the acoustic characteristics of transgenders' voice. This study obtained acoustic measurements (F0, jitter, shimmer, and NHR) of 45 subjects (15 male adults, 15 female adults, and 15 transgender(male to female)) and compared acoustic measurements of the vowel /a/ produced by 3 groups. The MDVP was used to measure the acoustic parameters. A one-way ANOVA was used for statistical analysis. The results were as follow: Firstly, there was a significant difference among the 3 groups in F0. And F0 of transgenders was higher than that of male group and lower than that of female group. Secondly, there was a significant difference between male and transgender group in jitter and shimmer so that the transgender had a tendency to phonate roughly than male.

  • PDF

Nasalance and Intensity of Profound Hearing-Impaired Adults (심도 청각장애 성인의 비성도 및 강도)

  • Choi, Eun-Ah;Park, Han-Sang;Seong, Cheol-Jae
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.123-132
    • /
    • 2009
  • This study investigates the differences in nasalance across handicap, gender, and vowels and the correlation between nasal energy and oral energy both of which are used to compute nasalance. For this study, 20 hearing-impaired adults and 20 normal hearing adults as a control group were asked to read 7 Korean vowels (/$\alpha$, $\Lambda$, o, u, ɯ, i, $\varepsilon$/). Subjects' readings were recorded by NasalView and analyzed by Praat. Results showed that the hearing impaired group (HL) has a significantly higher nasalance than the normal hearing group(NH), and that there was a significant positive correlation between nasal energy and oral energy. A higher nasalance of the hearing impaired group seems to be due to an improper velopharyngeal control which is caused by lack of a proper auditory feedback.

  • PDF

Exclusion of Non-similar Candidates using Positional Accuracy based on Levenstein Distance from N-best Recognition Results of Isolated Word Recognition (레벤스타인 거리에 기초한 위치 정확도를 이용한 고립 단어 인식 결과의 비유사 후보 단어 제외)

  • Yun, Young-Sun;Kang, Jeom-Ja
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.109-115
    • /
    • 2009
  • Many isolated word recognition systems may generate non-similar words for recognition candidates because they use only acoustic information. In this paper, we investigate several techniques which can exclude non-similar words from N-best candidate words by applying Levenstein distance measure. At first, word distance method based on phone and syllable distances are considered. These methods use just Levenstein distance on phones or double Levenstein distance algorithm on syllables of candidates. Next, word similarity approaches are presented that they use characters' position information of word candidates. Each character's position is labeled to inserted, deleted, and correct position after alignment between source and target string. The word similarities are obtained from characters' positional probabilities which mean the frequency ratio of the same characters' observations on the position. From experimental results, we can find that the proposed methods are effective for removing non-similar words without loss of system performance from the N-best recognition candidates of the systems.

  • PDF

A Study of the English Pronunciation of Korean Exchange Students (교환학생프로그램 참가자들의 영어발음에 관한 연구)

  • Park, Hee-Suk
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.87-93
    • /
    • 2009
  • The purpose of this experimental study is to investigate and compare the vowel lengths of English diphthongs and low vowels among native-English-speaking Americans and Korean college exchange students. To do this eight words and sixteen sentences were uttered and recorded by nine subjects, five Korean subjects and four American subjects. Results showed that the vowel lengths of English low vowels between American subjects and Korean subjects were different, which may lead to foreign accent of Korean speakers. Comparing the average length of English low vowels of Korean subjects with those of American subjects, we can see that American subjects tend to pronounce the English low vowels longer than Korean subjects do. In the pronunciation of diphthongs /eI/ and /ou/, Korean subjects pronounced longer than American subjects did. However, in the pronunciation of diphthongs /au/, /aI/, and /ɔI/, American subjects pronounced longer than Korean subjects did.

  • PDF

Probabilistic Target Speech Detection and Its Application to Multi-Input-Based Speech Enhancement (확률적 목표 음성 검출을 통한 다채널 입력 기반 음성개선)

  • Lee, Young-Jae;Kim, Su-Hwan;Han, Seung-Ho;Han, Min-Soo;Kim, Young-Il;Jeong, Sang-Bae
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.95-102
    • /
    • 2009
  • In this paper, an efficient target speech detection algorithm is proposed for the performance improvement of multi-input speech enhancement. Using the normalized cross correlation value between two selected channels, the proposed algorithm estimates the probabilistic distribution function of the value from the pure noise interval. Then, log-likelihoods are calculated with the function and the normalized cross correlation value to detect the target speech interval precisely. The detection results are applied to the generalized sidelobe canceller-based algorithm. Experimental results show that the proposed algorithm significantly improves the speech recognition performance and the signal-to-noise ratios.

  • PDF

Synthesis and Evaluation of Prosodically Exaggerated Utterances

  • Yoon, Kyu-Chul
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.73-85
    • /
    • 2009
  • This paper introduces the technique of synthesizing and evaluating human utterances with exaggerated or atypical prosody. Prosody exaggeration can be implemented by manipulating either the fundamental frequency (F0) contour, the segmental durations, or the intensity contour of an utterance. Of these three prosodic elements, two or more can be exaggerated at the same time. The algorithms of synthesis and evaluation were suggested. Learner utterances exaggerated in each of the three prosodic features were evaluated with respect to their original native versions in terms of the differences in their F0 contours, the segmental durations, and the intensity contours. The measure of differences was the Euclidean distance metric between the matching points in their F0 and intensity contours. The measure was calculated after the exaggerated learner utterances were aligned by the segments and rendered identical to their native version in terms of their segmental durations. For the evaluation of the segmental durations, no prior modifications were made in durations and the same measure was used. The results from the pilot experiment suggest the viability of this measure in the evaluation of learner utterances with atypical prosody with respect to their native versions.

  • PDF

Rhythmic Differences between Spontaneous and Read Speech of English

  • Kim, Sul-Ki;Jang, Tae-Yeoub
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.49-55
    • /
    • 2009
  • This study investigates whether rhythm metrics can be used to capture the rhythmic differences between spontaneous and read English speech. Transcription of spontaneous speech tokens extracted from a corpus is read by three English native speakers to generate the corresponding read speech tokens. Two data sets are compared in terms of seven rhythm measures that are suggested by previous studies. Results show that there is a significant difference in the values of vowel-based metrics (VarcoV and nPVI-V) between spontaneous and read speech. This manifests a greater variability in vocalic intervals in spontaneous speech than in read speech. The current study is especially meaningful as it demonstrates a way in which speech styles can be differentiated and parameterized in numerical terms.

  • PDF

Korean Speakers' Perception of Hindi Stop Consonants (한국인의 힌디어 폐쇄음 인식)

  • Ahn, Hyun-Kee
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.57-63
    • /
    • 2009
  • The two specific research questions pursued in this paper are: (i) how Korean speakers perceive Hindi stops in terms of the three laryngeal categories of Korean stops; (ii) how well Korean speakers do with an ABX perception test that utilizes a total of 52 Hindi minimal pairs where all sounds are identical except for the laryngeal features of a stop in each word. A total of 45 university students participated in this experiment. The results showed that (i) Koreans tended to perceive Hindi voiceless unaspirated stops as Korean fortis ones, voiceless aspirated stops as aspirated ones, voiced stops as lenis ones, and breathy stops as aspirated ones, and (ii) Koreans had difficulty in distinguishing between voiceless aspirated and breathy stops in Hindi.

  • PDF

The Role of Contrast in Prosodically Induced Acoustic Variation

  • Choi, Han-Sook
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.29-37
    • /
    • 2009
  • This paper presents results from speech production experiments on English, Korean, and Hindi that compare variation in the acoustic expression of dissimilar phonological laryngeal contrast in stops conditioned by prosodic prominence. Target stops are analyzed from utterance-initial, -medial, and -final positions, with a variation in contrastive focal accent, from the speech data by six male American English speakers, five male Seoul Korean speakers, and five male Delhi Hindi speakers. The results show that prosodic prominence conditions enhanced distinctiveness between contrastive segments in the three languages. The manner in which prosodic prominence and prosodic phrase structure is marked at the level of segmental variation is, however, found to be language-specific to some extent. In addition, a correlation between the size of the phonological inventory and the corresponding acoustic variation was found but the linear correlation was not strongly supported with the findings in the present study.

  • PDF

An analysis of English pronunciation for high-level proficiency adult learners (발음 숙련도 상위 성인 학습자들의 영어 발음에 대한 분석)

  • Kim, Ji-Eun
    • Phonetics and Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.39-44
    • /
    • 2018
  • The purpose of this study is to investigate the English pronunciation for high-leveled adult Korean speakers based on pronunciation proficiency test. For this purpose, one native English speaker and eight Korean speakers' suprasegmental features such as sentence F0, standard deviation of vowels and stressed / unstressed vowels' F0, duration and intensity were measured and analyzed. The major results show that (1) high-leveled adult Korean speakers' sentence F0 was similar to that of native English speaker, (2) vowel durations, were less diverse than those of native English speakers, and (3) high-leveled adult Korean speakers utilize vowel duration more actively than F0 to indicate the stress assignment of vowels.