• 제목/요약/키워드: a word boundary

검색결과 76건 처리시간 0.019초

운율경계에 위치한 어두 모음의 성문 특성: 음향적 상관성을 중심으로 (Glottal Characteristics of Word-initial Vowels in the Prosodic Boundary: Acoustic Correlates)

  • 손형숙
    • 말소리와 음성과학
    • /
    • 제2권3호
    • /
    • pp.47-63
    • /
    • 2010
  • This study provides a description of the glottal characteristics of the word-initial low vowels /a, $\ae$/ in terms of a set of acoustic parameters and discusses glottal configuration as their acoustic correlates. Furthermore, it examines the effect of prosodic boundary on the glottal properties of the vowels, seeking an account of the possible role of prosodic structure based on prosodic theory. Acoustic parameters reported to indicate glottal characteristics were obtained from the measurements made directly from the speech spectrum on recordings of Korean and English collected from 45 speakers. They consist of two separate groups of native Korean and native English speakers, each including both male and female speakers. Based on the three acoustic parameters of open quotient (OQ), first-formant bandwidth (B1), and spectral tilt (ST), comparisons were made between the speech of males and females, between the speech of native Korean and native English speakers, and between Korean and English produced by native Korean speakers. Acoustic analysis of the experimental data indicates that some or all glottal parameters play a crucial role in differentiating the speech groups, despite substantial interspeaker variations. Statistical analysis of the Korean data indicates prosodic strengthening with respect to the acoustic parameters B1 and OQ, suggesting acoustic enhancement in terms of the degree of glottal abduction and the glottal closure during a vibratory cycle.

  • PDF

Comparison of English and Korean speakers for the nasalization of English stops

  • Yun, Ilsung
    • 말소리와 음성과학
    • /
    • 제7권3호
    • /
    • pp.3-11
    • /
    • 2015
  • This study compared English and Korean speakers with regard to the nasalization of the English stops /b, d, g, p, t, k/before a nasal within and across a word boundary. Nine English and thirty Korean speakers participated in the experiment. We used 37 speech items with different grammatical structures. Overall the English informants rarely nasalized the stops while the Korean informants generally greatly nasalized them though widely varying from no nasalization to almost complete nasalization. In general, voiced stops were more likely to be nasalized than voiceless stops. Also, the alveolar stops /d, t/tended to be nasalized the most, the bilabial stops /b, p/ the second most, and the velar stops /g, k/ the least. Besides, the closer the grammatical relationship between neighboring words, the more likely the stop nasalization occurred. In contrast, the Korean syllabification - the addition of the vowel /i/ to the final stops - worked against the stop nasalization. On the other hand, different stress (accent) or rhythm effects of the two languages are assumed to contribute to the significantly different nasalization between English and Korean speakers. The spectrum of stop nasalization obtained from this study can be used as an index to measure how close a certain Korean speaker's stop nasalization is to English speakers'.

한글 문장의 자동 띄어쓰기를 위한 어절 블록 양방향 알고리즘 (Eojeol-Block Bidirectional Algorithm for Automatic Word Spacing of Hangul Sentences)

  • 강승식
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제27권4호
    • /
    • pp.441-447
    • /
    • 2000
  • 자동 띄어쓰기는 띄어쓰기가 무시된 한글 문서의 자동색인이나 문자인식 시스템에서 줄바꿈 문자에 대한 공백 삽입 문제 등을 해결하는데 필요하다. 이러한 문서에서 공백이 삽입될 위치를 자동으로 찾아주는 자동 띄어쓰기 알고리즘으로 문장 분할 기법과 양방향 최장일치법을 이용한 어절 인식 방법을 제안한다. 문장 분할은 한글의 음절 특성을 이용하여 어절 경계가 비교적 명확한 어절 블록을 추출하는 것이며, 형태소 분석기를 이용한 양방향 최장일치법에 의해 어절 블록에 나타난 각 어절들을 인식한다. 4,500여 어절로 구성된 두 가지 유형의 문장 집합에 대하여 제안한 방법의 띄어쓰기 정확도를 평가한 결과 '공백 재현율'이 97.3%, '어절 재현율'이 93.2%로 나타났다.

  • PDF

우리글 읽기에서 나타난 성인과 청소년의 고정시간 분포분석과 단일경계 확산모형 제안 (One Boundary Diffusion Model Analysis on Distributions of Eye Fixation Durations in Reading; Eye Movement Tracking Study)

  • 주혜리;고성룡
    • 인지과학
    • /
    • 제32권1호
    • /
    • pp.1-53
    • /
    • 2021
  • 이 연구의 목적은 성인(만20-28세)과 청소년(만13-14세)을 대상으로 글읽기 안구운동 추적 실험을 통해 분포분석하여 단어빈도 효과를 확인하고, 단일경계 확산모형(One-boundary Diffusion Model)의 정보표집율(drift rate) 파라미터가 두 집단의 글읽기 현상의 차이를 설명할 수 있고 단일경계 확산모형이 개인차를 설명하는 도구로써 적절한지 확인하고자 한다. 먼저 단어 빈도와 단어습득연령과 같은 단어 성질을 통제한 두 가지 글읽기 안구운동추적 실험을 하였고, 실험 1과 실험 2에서 중심와 정보처리가 단일경계 확산모형의 정보표집율 파라미터와 연결되는 것을 확인하였다. 실험 1에서는 성인 집단은 고빈도 조건과 저빈도 조건의 반응비율 고정 시간 평균 차이는 0.1분위수 보다 0.9 분위수에서 더 크게 나타났지만 청소년 집단은 고빈도 조건과 저빈도 조건의 반응 비율 고정시간 평균 차이는 0.1분위수과 0.9 분위수에서 차이가 크게 나타나지 않았다. 실험 2에서 두 집단의 초기습득연령 조건과 후기습득연령 조건의 반응 비율 고정시간 평균 차이는 0.1분위수 보다 0.9 분위수에서 더 크게 나타났다. Ratcliff(Ratcliff, & McKoon, 2008)의 반응시간 분포와 유사한 패턴으로 정적 편향 분포로 앞부분 보다는 꼬리 쪽에서 분산이 증가되는 경향이 확인하였으며 단어의 성질에 따른 두 조건의 차이는 분포의 첩점 크기 차이로 나타나는 것을 확인하였다. 본 연구는 안구운동실험 결과를 통해 글읽기에서 나타나는 단어 성질에 따른 효과를 확인하고 단일경계 확산모형의 정보표집율 파라미터가 글읽기에서 중심와 정보처리를 반영하는 것을 강조한다. 나아가 이 연구에서 제안하는 단일경계 확산모형이 글읽기에서 현상을 예측하고 개인차를 설명할 수 있는 도구로써 활용할 수 있는 가능성을 시사한다.

Prosodic Modifications of the Internal Phonetic Structure of Monosyllabic CVC Words in Conversational Speech

  • Mo, Yoonsook
    • 말소리와 음성과학
    • /
    • 제5권1호
    • /
    • pp.99-108
    • /
    • 2013
  • Previous laboratory studies have shown that prosodic structures are encoded in the modulations of phonetic patterns of speech including suprasegmental as well as segmental features. In particular, effects of prosodic context on duration and intensity of syllables and words have been widely reported. Drawing on prosodically annotated large-scale speech data from the Buckeye corpus of conversational speech of American English, the current study attempted to examine whether and how prosodic prominence and phrase boundary of everyday conversational speech, as determined by a large group of ordinary listeners, are related to the phonetic realization of duration and intensity. The results showed that the patterns of word durations and intensities are influenced by prosodic structure. Closer examinations revealed, however, that the effects of prosodic prominence are not the same as those of prosodic phrase boundary. With regard to intensity measures, the results revealed the systematic changes in the patterns of overall RMS intensity near prosodic phrase boundary but the prominence effects are restricted to the nucleus. In terms of duration measures, both prosodic prominence and phrase boundary are the most closely related to the lengthening of the nucleus. Yet, prosodic prominence is more closely related to the lengthening of the onset while phrase boundary lengthens the coda duration more. The findings from the current study suggest that the phonetic realizations of prosodic prominence are different from those of prosodic phrase boundary, and speakers signal different prosodic structures through deliberate modulations of the internal phonetic structure of words and listeners attend to such phonetic variations.

운율 층위에 따른 중국인학습자들의 한국어 유기음화 적용 양상 (Aspects of Chinese Korean learners' production of Korean aspiration at different prosodic boundaries)

  • 윤영숙
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.9-17
    • /
    • 2017
  • The aim of this study is to examine whether Chinese Korean learners (CKL) can correctly produce the aspiration in 'a lenis obstruents /k/, /t/, /p/, /ʧ/+/h/ sound' sequence at the lexical and post-lexical level. For this purpose 4 Korean native speakers (KNS), 10 advanced and 10 intermediate CKL participated in a production test. The material analyzed consisted of 10 Korean sentences in which aspiration can be applied at different prosodic boundaries (syllable, word, accentual phrase). The results showed that for KNS and CKL, the rate of application of aspiration was different according to prosodic boundaries. Aspiration was more frequently applied at the lexical level than at the post-lexical level and it was more frequent at the word boundary than at the accentual phrase boundary. For CKL, pronunciation errors were either non-application of aspiration or coda obstruent omission. In the case of non-application of aspiration, CKL produced the target syllable as an underling form and they did not transform it as a surface form. In the case of coda obstruent ommision, most of the errors were caused by the inherent complexity of phonological process.

Question Similarity Measurement of Chinese Crop Diseases and Insect Pests Based on Mixed Information Extraction

  • Zhou, Han;Guo, Xuchao;Liu, Chengqi;Tang, Zhan;Lu, Shuhan;Li, Lin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권11호
    • /
    • pp.3991-4010
    • /
    • 2021
  • The Question Similarity Measurement of Chinese Crop Diseases and Insect Pests (QSM-CCD&IP) aims to judge the user's tendency to ask questions regarding input problems. The measurement is the basis of the Agricultural Knowledge Question and Answering (Q & A) system, information retrieval, and other tasks. However, the corpus and measurement methods available in this field have some deficiencies. In addition, error propagation may occur when the word boundary features and local context information are ignored when the general method embeds sentences. Hence, these factors make the task challenging. To solve the above problems and tackle the Question Similarity Measurement task in this work, a corpus on Chinese crop diseases and insect pests(CCDIP), which contains 13 categories, was established. Then, taking the CCDIP as the research object, this study proposes a Chinese agricultural text similarity matching model, namely, the AgrCQS. This model is based on mixed information extraction. Specifically, the hybrid embedding layer can enrich character information and improve the recognition ability of the model on the word boundary. The multi-scale local information can be extracted by multi-core convolutional neural network based on multi-weight (MM-CNN). The self-attention mechanism can enhance the fusion ability of the model on global information. In this research, the performance of the AgrCQS on the CCDIP is verified, and three benchmark datasets, namely, AFQMC, LCQMC, and BQ, are used. The accuracy rates are 93.92%, 74.42%, 86.35%, and 83.05%, respectively, which are higher than that of baseline systems without using any external knowledge. Additionally, the proposed method module can be extracted separately and applied to other models, thus providing reference for related research.

Creation of the Conversion Table from Hangeul to the Roman Alphabet

  • Kim, Kyoung-Jing;Rhee, Sang-Burm
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -1
    • /
    • pp.321-324
    • /
    • 2002
  • For a rule-based conversion of Hangout into the Roman alphabet rather than a word-for-word conversion, one must come up with a faultless model for the Korean standard pronunciation rules, which are the basis of the Romanization. It is on this foundation that the Korean-Roman alphabet conversion table can be created. For linguistic modeling using PetriNet, modeling boundary and notation of modeling can be defined. In order to describe PetriNet, which is a dynamic modeling tool, as a static one, one can model the standard Korean pronunciation rules and the Hangout-Roman alphabet notation by conversion into incident matrix Thus, this research attempts to develop a mathematical modeling tool for a natural language using PetriNet, and create a Korean-Roman alphabet conversion table.

  • PDF

A Prosodic Analysis on the Korean Subjective Particles -With Reference to the Establishment of Acoustic Features-

  • Seong, Cheol-Jae
    • The Journal of the Acoustical Society of Korea
    • /
    • 제20권3E호
    • /
    • pp.3-9
    • /
    • 2001
  • This study aims to describe a prosodic pattern on the Korean subjective particles with respect to their discourse function. 4 kinds of Korean subjective particles were mainly investigated with reference to sentential location, grammatical relations that precede or follow the word including subjective particles, and prosodic phrasing. F0 and energy were gradually diminished as the particles moved down to the sentential final position. 'Ga'particle, which has been potentially regarded as having a grammatical focusing function, looks like to show relatively higher F0 in sentential medial in discourse. At sentential medial position, when the words including 'ga, eun, and neun'particles were preceded by adverbials, the acoustic variables of particles tended to be diminished by some ratio in comparison with the mean value. The duration of particles might vary with respect to style variation and especially that it tended to diminish from 150 basic, 50 separate, and finally 50 discourse successively. And there's some specific phenomenon that prosodic phrasing itself was relatively easily taken place after 'eun' and 'neun' particles. Finally, I tried to catch the prosodic characteristics (which would be established as acoustic features) of inter-word position at which specific subjective particles were intervened. These acoustic features can be made up of the duration and F0 fluctuation activated in the successive 3 syllables in which word (or prosodic) boundary was located.

  • PDF

FSN 기반의 대어휘 연속음성인식 시스템 개발 (Development of FSN-based Large Vocabulary Continuous Speech Recognition System)

  • 박전규;이윤근
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.327-329
    • /
    • 2007
  • This paper presents a FSN-based LVCSR system and it's application to the speech TV program guide. Unlike the most popular statistical language model-based system, we used FSN grammar based on the graph theory-based FSN optimization algorithm and knowledge-based advanced word boundary modeling. For the memory and latency efficiency, we implemented the dynamic pruning scheduling based on the histogram of active words and their likelihood distribution. We achieved a 10.7% word accuracy improvement with 57.3% speedup.

  • PDF