• 제목/요약/키워드: phonetic data

검색결과 200건 처리시간 0.021초

The acoustic realization of the Korean sibilant fricative contrast in Seoul and Daegu

  • Holliday, Jeffrey J.
    • 말소리와 음성과학
    • /
    • 제4권1호
    • /
    • pp.67-74
    • /
    • 2012
  • The neutralization of /$s^h$/ and /$s^*$/ in Gyeongsang dialects is a culturally salient stereotype that has received relatively little attention in the phonetic literature. The current study is a more extensive acoustic comparison of the sibilant fricative productions of Seoul and Gyeongsang dialect speakers. The data presented here suggest that, at least for young Seoul and Daegu speakers, there are few inter-dialectal differences in sibilant fricative production. These conclusions are supported by the output of mixed effects logistic regression models that used aspiration duration, spectral mean of the frication noise, and H1-H2 of the following vowel to predict fricative type in each dialect. The clearest dialect difference was that Daegu speakers' /$s^h$/ and /$s^*$/ productions had overall shorter aspiration durations than those of Seoul speakers, suggesting the opposite of the traditional "/$s^*$/ produced as [$s^h$]" stereotype of Gyeongsang dialects. Further work is needed to investigate whether /$s^h/-/s^*$/ neutralization in Daegu is perceptual rather than acoustic in nature.

우리말 어휘빈도 정보와 분절음 탈락의 관련성에 대한 연구 (Correlation between the frequency of word and the deletion of segment)

  • 차재은
    • 대한음성학회지:말소리
    • /
    • 제47호
    • /
    • pp.1-13
    • /
    • 2003
  • The purpose of this paper is to research the correlation between frequency and the deletion of /w, (equation omitted)/ in Korean. For this purpose, I select 11 words from the frequency data, then, analyze the speech of 20 speakers of standard Korean. As a result, I can find that there is correlation between the frequency and the deletion rate of segment. The rate of deletion is higher in high frequency words, while the rate of realization is higher in low frequency words. Although there is correlation between the frequency and the deletion rate of segment, the feature of segment, prosodic environments are more important in segment deletion.

  • PDF

Computerized Sound Dictionary of Korean and English

  • Kim, Jong-Mi
    • 음성과학
    • /
    • 제8권1호
    • /
    • pp.33-52
    • /
    • 2001
  • A bilingual sound dictionary in Korean and English has been created for a broad range of sound reference to cross-linguistic, dialectal, native language (L1)-transferred biological and allophonic variations. The paper demonstrates that the pronunciation dictionary of the lexicon is inadequate for sound reference due to the preponderance of unmarked sounds. The audio registry consists of the three-way comparison of 1) English speech from native English speakers, 2) Korean speech from Korean speakers, and 3) English speech from Korean speakers. Several sub-dictionaries have been created as the foundation research for independent development. They are 1) a pronunciation dictionary of the Korean lexicon in a keyboard-compatible phonetic transcription, 2) a sound dictionary of L1-interfered language, and 3) an audible dictionary of Korean sounds. The dictionary was designed to facilitate the exchange of the speech signal and its corresponding text data on various media particularly on CD-ROM. The methodology and findings of the construction are discussed.

  • PDF

장모음 인식장치 설계 제작 (Design and Manufacture of a Device for the Recognition of Long Vowels)

  • 구용회
    • 전자공학회논문지T
    • /
    • 제35T권3호
    • /
    • pp.9-14
    • /
    • 1998
  • 장모음 음성인식을 전자회로로 수행하였다. 레벨 압축은 음성 파형을 직렬 펄스로 변화시킬 수 있었다 이 펄스들로 모음을 구별하는 정보가 된다. 펄스의 샘풀링은 한 단위로 모음의 피치 직렬신호를 얻어지는 레지스터 에 의해서 이루어진다. 샘풀링 펄스에 의한 시간제어 펄스는 음성파형의 첩두치 펄스에 의해 발진되어 진다. 이 레지스터에 있는 병렬 데이터는 만약 OO이면 OO이다는 규칙으로 이루처지는 의지결정 회로의 뜻에 따라 음성 심볼이 인식되어진다.

  • PDF

Effects of Word Frequency on a Lenition Process: Evidence from Stop Voicing and /h/ Reduction in Korean

  • Choi, Tae-Hwan;Lim, Nam-Sil;Han, Jeong-Im
    • 음성과학
    • /
    • 제13권3호
    • /
    • pp.35-48
    • /
    • 2006
  • The present study examined whether words with higher frequency have more exposure to the lenition process such as intervocalic stop voicing or /h/ reduction in the production of the Korean speakers. Experiment 1 and Experiment 2 tested if word-internal intervocalic voicing and /h/ reduction occur more often in the words with higher frequency than less frequent words respectively. Results showed that the rate of voicing was not significantly different between the high frequency group and the low frequency group; rather both high and low frequency words were shown to be fully voiced in this prosodic position. However, intervocalic /h/s were deleted more in high frequency words than in low frequency words. Low frequency words showed that other phonetic variants such as [h] and [w] were found more often than in high frequency group. Thus the results of the present study are indefinitive as to the relationship between the word frequency and lenition with the data at hand.

  • PDF

후천적 신경손상으로 인한 마비성 구어장애자들의 마찰음 /s/에 대한 음향학적 연구 (An Acoustical Analysis of /s/ in the Speech of Dysarthric Speakers)

  • 박희정;신혜정;정옥란;석동일
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.101-104
    • /
    • 2003
  • The purpose of this study was to compare acoustic differences of fricative /s/ between the dysarthric subjects and normal subjects. In additional, the subjects' speeches were evaluated in terms of word intelligibility containing /s/ and perceptual severity. Acoustic parameters were duration, peak frequency and intensity of /s/. The results showed that the first peak frequency and intensity of /s/ were significantly different between dysarthric subjetcts and normal subjects. Second, peceptual parameters were significantly different between dysarthric subjetcts and normal subjects. The Pearson correlation coefficient was used to determine the relationship between the acoustical and perceptual data. The results showed that there was a strong correlation between perceptual parameters and peak frequency of /s/.

  • PDF

K-ToBI 기호에 준한 F0 곡선 생성 알고리듬 (A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using K-ToBI system)

  • 이용주;이숙향;김종진;고현주;김영일;김상훈;이정철
    • 대한음성학회지:말소리
    • /
    • 제35_36호
    • /
    • pp.131-143
    • /
    • 1998
  • This study describes an algorithm for the F0 contour generation system for Korean sentences and its evaluation results. 400 K-ToBI labeled utterances were used which were read by one male and one female announcers. F0 contour generation system uses two classification trees for prediction of K-ToBI labels for input text and 11 regression trees for prediction of F0 values for the labels. Evaluation results of the system showed 77.2% prediction accuracy for prediction of IP boundaries and 72.0% prediction accuracy for AP boundaries. Information of voicing and duration of the segments was not changed for F0 contour generation and its evaluation. Evaluation results showed 23.5Hz RMS error and 0.55 correlation coefficient in F0 generation experiment using labelling information from the original speech data.

  • PDF

외래어 발음에서 나타난 영어와 한국어의 운율적 차이 (A Note on Prosodic Differences between Korean and English - in loan words from English -)

  • 김선미;문수미
    • 대한음성학회지:말소리
    • /
    • 제35_36호
    • /
    • pp.25-36
    • /
    • 1998
  • The prosodic properties of Korean and English stress were examined with focus on syllable duration and pitch by loan words. 14 loan words were selected by the criteria of the numbers of syllables and stress positions. 3 Korean males using Seoul dialect and 3 American males using general American English served as subjects. Each tokens were uttered 3 times and second one was chosen to be analysed by CSL. We measured the duration and F0 of each syllable. In English, duration is the most salient acoustic correlates of stress, and pitch is the second. In Korean, by contrast, it seems that neither duration nor pitch is the acoustic features of stress, from our data

  • PDF

한국어 원거리 음성의 지속시간 연구 (A Study on the Durational Characteristics of Korean Distant-Talking Speech)

  • 김선희
    • 대한음성학회지:말소리
    • /
    • 제54호
    • /
    • pp.1-14
    • /
    • 2005
  • This paper presents durational characteristics of Korean distant-talking speech using speech data, which consist of 500 distant-talking utterances and 500 normal utterances of 10 speakers (5 males and 5 females). Each file was segmented and labeled manually and the duration of each segment and each word was extracted. Using a statistical method, the durational change of distant-talking speech in comparison with normal speech was analyzed. The results show that the duration of words with distant-talking speech is increased in comparison with normal style, and that the average unvoiced consonantal duration is reduced while the average vocalic duration is increased. Female speakers show a stronger tendency towards lengthening the duration in distant-talking speech. Finally, this study also shows that the speakers of distant-talking speech could be classified according to their different duration rate.

  • PDF

화자적응과 군집화를 이용한 화자식별 시스템의 성능 및 속도 향상 (Adaptation and Clustering Method for Speaker Identification with Small Training Data)

  • 김세현;오영환
    • 대한음성학회지:말소리
    • /
    • 제58호
    • /
    • pp.83-99
    • /
    • 2006
  • One key factor that hinders the widespread deployment of speaker identification technologies is the requirement of long enrollment utterances to guarantee low error rate during identification. To gain user acceptance of speaker identification technologies, adaptation algorithms that can enroll speakers with short utterances are highly essential. To this end, this paper applies MLLR speaker adaptation for speaker enrollment and compares its performance against other speaker modeling techniques: GMMs and HMM. Also, to speed up the computational procedure of identification, we apply speaker clustering method which uses principal component analysis (PCA) and weighted Euclidean distance as distance measurement. Experimental results show that MLLR adapted modeling method is most effective for short enrollment utterances and that the GMMs performs better when long utterances are available.

  • PDF