• 제목/요약/키워드: phonetic data

검색결과 200건 처리시간 0.023초

Effects of age of L2 acquisition and L2 experience on the production of English vowels by Korean speakers

  • Eunhae Oh;Eunyoung Shin
    • 말소리와 음성과학
    • /
    • 제15권3호
    • /
    • pp.9-16
    • /
    • 2023
  • The current study investigated the influence of age of L2 acquisition (AOA) and length of residence (LOR) in the L2 setting country on the production of voicing-conditioned vowel duration and spectral qualities in English by Korean learners. The primary aim was to explore the ways in which the language-specific phonetic features are acquired by the age of onset and L2 experience. Analyses of the archived corpus data produced by 45 native speakers of Korean showed that, regardless of AOA or LOR, absolute vowel duration was used as a salient correlate of voicing contrast in English for Korean learners. The accuracy of relative vowel duration was influenced more by onset age than by L2 experience, suggesting that being exposed to English at an early age may benefit the acquisition of temporal dimension. On the other hand, the spectral characteristics of English vowels were more consistently influenced by L2 experience, indicating that immersive experience in the L2 speaking environment are likely to improve the accurate production of vowel quality. The distinct influence of the onset age and L2 experience on the specific phonetic cues in L2 vowel production provides insight into the intricate relationship between the two factors on the manifestation of L2 phonological knowledge.

Hidden Markov Network 음성인식 시스템의 성능평가에 관한 연구 (A Study on Performance Evaluation of Hidden Markov Network Speech Recognition System)

  • 오세진;김광동;노덕규;위석오;송민규;정현열
    • 융합신호처리학회논문지
    • /
    • 제4권4호
    • /
    • pp.30-39
    • /
    • 2003
  • 본 논문에서는 한국어 음성 데이터를 대상으로 HM-Net(Hidden Markov Network) 음성인식 시스템의 성능평가를 수행하였다. 음향모델 작성은 음성인식에서 널리 사용되고 있는 통계적인 모델링 방법인 HMM(Hidden Markov Model)을 개량한 HM-Net을 도입하였다. HM-Net은 기존의 SSS(Successive State Splitting) 알고리즘을 개량한 PDT(Phonetic Decision Tree)-SSS 알고리즘에 의해 문맥방향과 시간방향의 상태분할을 수행하여 생성되는데, 특히 문맥방향 상태분할의 경우 학습 음성데이터에 출현하지 않는 문맥정보를 효과적으로 표현하기 위해 음소결정트리를 채용하고 있으며, 시간방향 상태분할의 경우 학습 음성데이터에서 각 음소별 지속시간 정보를 효과적으로 표현하기 위한 상태분할을 수행하며, 마지막으로 파라미터의 공유를 통해 triphone 형태의 최적인 모델 네트워크를 작성하게 된다. 인식에 사용된 알고리즘은 음소 및 단어인식의 경우에는 One-Pass Viterbi 빔 탐색을 사용하며 트리 구조 형태의 사전과 phone/word-pair 문법을 채용하고 있다. 연속음성인식의 경우에는 단어 bigram과 단어 trigram 언어모델과 목구조 형태의 사전을 채용한 Multi-Pass 빔 탐색을 사용하고 있다. 전체적으로 본 논문에서는 다양한 조건에서 HM-Net 음성인식 시스템의 성능평가를 수행하였으며, 지금까지 소개된 음성인식 시스템과 비교하여 매우 우수한 인식성능을 보임을 실험을 통해 확인할 수 있었다.

  • PDF

한국인의 영어 발음에 영향을 미치는 개인적 특성 요인 (Personal Factors Affecting Korean Speakers' English Pronunciation)

  • 전은
    • 대한음성학회지:말소리
    • /
    • 제57호
    • /
    • pp.1-14
    • /
    • 2006
  • This study examines personal factors that affect Korean speakers' English pronunciation. Personal factors which are examined here are as follows: personality type, cognitive system, motivational orientation type, interest in English, how often they listen to tapes, and academic achievements. Data were collected through MBTI (Myers Briggs Type Indicator) Test, Group Embedded Figural Test, and a Questionnaire. The participants consisted of 65 college students. All the results were statistically analyzed: Korean students' personality type and cognitive system are not related with their pronunciation, but motivational orientation type, how often they listen to tapes, academic achievements, and interest in English study are correlated with their pronunciation.

  • PDF

발화 유형에 따른 습관적 음도의 차이 (Effect of Speech Tasks on Habitual Pitch)

  • 임혜진;한지연
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.55-58
    • /
    • 2007
  • This study was investigated the effect of speech tasks on habitual pitch. Seven male and female young adult speakers participated in this study. The experiment consisted of seven different speech tasks: counting, reading, sustained phonation /a/, prolonged /i:/, answering /ne/. Data was analyzed via Visi-pitch IV. The results showed that there was no significant F0 difference among speech tasks.

  • PDF

대본 내용에 의한 정서음성 수집과정의 정규화에 대하여 (Normalization in Collection Procedures of Emotional Speech by Scriptual Context)

  • 조철우
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2006년도 춘계 학술대회 발표논문집
    • /
    • pp.123-125
    • /
    • 2006
  • One of the biggest problems unsolved in emotional speech acquisition is how to make or find a situation which is close to natual or desired state from humans. We proposed a method to collect emotional speech data by scriptual context. Several contexts from the scripts of drama were chosen by the experts in the area. Context were divided into 6 classes according to the contents. Two actors, one male and one female, read the text after recognizing the emotional situations in the script.

  • PDF

The Place of Articulation of Korean Affricates Observed in LPC Spectra

  • Kim, Hyun-Soon
    • 음성과학
    • /
    • 제3권
    • /
    • pp.93-108
    • /
    • 1998
  • This paper attempts to acoustically examine the place of articulation of Korean affricates. In order to pursue an acoustic analysis of where Korean affricates are articulated, we resort to LPC spectra of the Korean plain affricate /c/ in intervocalic position, based on theoretical assumptions (e.g., Stevens 1993a), and compare the data to that of the Korean alveolar consonants /t, s/ in the same context. Our phonetic results show that in intervocalic position, the Korean plain affricate is alveolar just like the Korean alveolar consonants /t, s/, supporting the articulatory studies of $Skali{\check{c}}kov{\acute{a}}$ (1960) and Kim (1997).

  • PDF

F0 Peak Lagging and Relative Timing in English Intonation

  • Kim, Sung-A
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2000년도 7월 학술대회지
    • /
    • pp.211-219
    • /
    • 2000
  • In this paper, we examine fO peak lagging phenomenon in English. FO peak lagging refers to the fact that fO peak corresponding to an accent is realized beyond the domain of the host syllable. We present experimental data of fO peak lagging, which shows that fO peak is heavily delayed when the duration of the accented syllable is relatively short. In addition, we show that fO peak is also heavily delayed and realized in the following syllable in a focused word, even where the target vowel is not intrinsically short.

  • PDF

연결 숫자음 인식기 학습용 음성DB 녹음을 위한 최적의 대본 작성 (The Optimal and Complete Prompts Lists for Connected Spoken Digit Speech Corpus)

  • 유하진
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.131-134
    • /
    • 2003
  • This paper describes an efficient algorithm to generate compact and complete prompts lists for connected spoken digits database. In building a connected spoken digit recognizer, we have to acquire speech data in various contexts. However, in many speech databases the lists are made by using random generators. We provide an efficient algorithm that can generate compact and complete lists of digits in various contexts. This paper includes the proof of optimality and completeness of the algorithm.

  • PDF

정보검색 기법과 동적 보간 계수를 이용한 N-gram 언어모델의 적응 (N- gram Adaptation Using Information Retrieval and Dynamic Interpolation Coefficient)

  • 최준기;오영환
    • 대한음성학회지:말소리
    • /
    • 제56호
    • /
    • pp.207-223
    • /
    • 2005
  • The goal of language model adaptation is to improve the background language model with a relatively small adaptation corpus. This study presents a language model adaptation technique where additional text data for the adaptation do not exist. We propose the information retrieval (IR) technique with N-gram language modeling to collect the adaptation corpus from baseline text data. We also propose to use a dynamic language model interpolation coefficient to combine the background language model and the adapted language model. The interpolation coefficient is estimated from the word hypotheses obtained by segmenting the input speech data reserved for held-out validation data. This allows the final adapted model to improve the performance of the background model consistently The proposed approach reduces the word error rate by $13.6\%$ relative to baseline 4-gram for two-hour broadcast news speech recognition.

  • PDF

훈련데이터 기반의 temporal filter를 적용한 한국어 4연숫자 전화음성의 인식실험 (Recognition experiment of Korean connected digit telephone speech using the temporal filter based on training speech data)

  • 정성윤;김민성;손종목;배건성;강점자
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 10월 학술대회지
    • /
    • pp.149-152
    • /
    • 2003
  • In this paper, data-driven temporal filter methods[1] are investigated for robust feature extraction. A principal component analysis technique is applied to the time trajectories of feature sequences of training speech data to get appropriate temporal filters. We did recognition experiments on the Korean connected digit telephone speech database released by SITEC, with data-driven temporal filters. Experimental results are discussed with our findings.

  • PDF