• 제목/요약/키워드: word boundary

검색결과 88건 처리시간 0.029초

Articulatory modification of /m/ in the coda and the onset as a function of prosodic boundary strength and focus in Korean

  • 김사향;조태홍
    • 말소리와 음성과학
    • /
    • 제6권4호
    • /
    • pp.3-15
    • /
    • 2014
  • An articulatory study (using an Electromagnetic Articulography, EMA) was conducted to explore effects of prosodic boundary strength (Intonational Phrase/IP versus Word/Wd), and focus (Focused/accented, Neutral, Unfocused/unaccented) on the kinematic realization of /m/ in the coda (${\ldots}$am#i${\ldots}$) and the onset (${\ldots}$a#mi${\ldots}$) conditions in Korean. (Here # refers to a prosodic boundary such as an IP or a Wd boundary). Several important points have emerged. First, the boundary effect on /m/s was most robustly observed in the temporal dimension in both the coda (IP-final) and the onset (IP-initial) conditions, generally in line with cross-linguistically observable boundary-related lengthening patterns. Crucially, however, in contrast with boundary-related slowing-down effects that have been observed in English, both the IP-final and IP-initial temporal expansions of Korean /m/s were not accompanied by an articulatory slowing down. They were, if anything, associated with a faster movement in the lip opening (release) phase (into the vowel). This suggests that the mechanisms underlying boundary-related temporal expansions may differ between languages. Second, observed boundary-induced strengthening effects (both spatial and temporal expansions, especially on the IP-initial /m/s) were remarkably similar to prominence (focus)-induced strengthening effects, which is again counter to phrase-initial strengthening patterns observed in English in which boundary effects are dissociated from prominent effects. This suggests that initial syllables in Korean may be a common focus for both boundary and prominence marking. These results, taken together, imply that the boundary-induced strengthening in Korean is different in nature from that in English, each being modulated by the individual language's prosodic system. Third, the coda and the onset /m/s were found to be produced in a subtly but significantly different way even in a Wd boundary condition, a potentially neutralizing (resyllabification) context. This suggests that although the coda may be phonologically 'resyllabified' into the following syllable in a phrase-medial position, its underlying syllable affiliation is kinematically distinguished from the onset.

한글 문장의 자동 띄어쓰기를 위한 어절 블록 양방향 알고리즘 (Eojeol-Block Bidirectional Algorithm for Automatic Word Spacing of Hangul Sentences)

  • 강승식
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제27권4호
    • /
    • pp.441-447
    • /
    • 2000
  • 자동 띄어쓰기는 띄어쓰기가 무시된 한글 문서의 자동색인이나 문자인식 시스템에서 줄바꿈 문자에 대한 공백 삽입 문제 등을 해결하는데 필요하다. 이러한 문서에서 공백이 삽입될 위치를 자동으로 찾아주는 자동 띄어쓰기 알고리즘으로 문장 분할 기법과 양방향 최장일치법을 이용한 어절 인식 방법을 제안한다. 문장 분할은 한글의 음절 특성을 이용하여 어절 경계가 비교적 명확한 어절 블록을 추출하는 것이며, 형태소 분석기를 이용한 양방향 최장일치법에 의해 어절 블록에 나타난 각 어절들을 인식한다. 4,500여 어절로 구성된 두 가지 유형의 문장 집합에 대하여 제안한 방법의 띄어쓰기 정확도를 평가한 결과 '공백 재현율'이 97.3%, '어절 재현율'이 93.2%로 나타났다.

  • PDF

Extraction of ObjectProperty-UsageMethod Relation from Web Documents

  • Pechsiri, Chaveevan;Phainoun, Sumran;Piriyakul, Rapeepun
    • Journal of Information Processing Systems
    • /
    • 제13권5호
    • /
    • pp.1103-1125
    • /
    • 2017
  • This paper aims to extract an ObjectProperty-UsageMethod relation, in particular the HerbalMedicinalProperty-UsageMethod relation of the herb-plant object, as a semantic relation between two related sets, a herbal-medicinal-property concept set and a usage-method concept set from several web documents. This HerbalMedicinalProperty-UsageMethod relation benefits people by providing an alternative treatment/solution knowledge to health problems. The research includes three main problems: how to determine EDU (where EDU is an elementary discourse unit or a simple sentence/clause) with a medicinal-property/usage-method concept; how to determine the usage-method boundary; and how to determine the HerbalMedicinalProperty-UsageMethod relation between the two related sets. We propose using N-Word-Co on the verb phrase with the medicinal-property/usage-method concept to solve the first and second problems where the N-Word-Co size is determined by the learning of maximum entropy, support vector machine, and naïve Bayes. We also apply naïve Bayes to solve the third problem of determining the HerbalMedicinalProperty-UsageMethod relation with N-Word-Co elements as features. The research results can provide high precision in the HerbalMedicinalProperty-UsageMethod relation extraction.

운율경계에 위치한 어두 모음의 성문 특성: 음향적 상관성을 중심으로 (Glottal Characteristics of Word-initial Vowels in the Prosodic Boundary: Acoustic Correlates)

  • 손형숙
    • 말소리와 음성과학
    • /
    • 제2권3호
    • /
    • pp.47-63
    • /
    • 2010
  • This study provides a description of the glottal characteristics of the word-initial low vowels /a, $\ae$/ in terms of a set of acoustic parameters and discusses glottal configuration as their acoustic correlates. Furthermore, it examines the effect of prosodic boundary on the glottal properties of the vowels, seeking an account of the possible role of prosodic structure based on prosodic theory. Acoustic parameters reported to indicate glottal characteristics were obtained from the measurements made directly from the speech spectrum on recordings of Korean and English collected from 45 speakers. They consist of two separate groups of native Korean and native English speakers, each including both male and female speakers. Based on the three acoustic parameters of open quotient (OQ), first-formant bandwidth (B1), and spectral tilt (ST), comparisons were made between the speech of males and females, between the speech of native Korean and native English speakers, and between Korean and English produced by native Korean speakers. Acoustic analysis of the experimental data indicates that some or all glottal parameters play a crucial role in differentiating the speech groups, despite substantial interspeaker variations. Statistical analysis of the Korean data indicates prosodic strengthening with respect to the acoustic parameters B1 and OQ, suggesting acoustic enhancement in terms of the degree of glottal abduction and the glottal closure during a vibratory cycle.

  • PDF

운율 층위에 따른 중국인학습자들의 한국어 유기음화 적용 양상 (Aspects of Chinese Korean learners' production of Korean aspiration at different prosodic boundaries)

  • 윤영숙
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.9-17
    • /
    • 2017
  • The aim of this study is to examine whether Chinese Korean learners (CKL) can correctly produce the aspiration in 'a lenis obstruents /k/, /t/, /p/, /ʧ/+/h/ sound' sequence at the lexical and post-lexical level. For this purpose 4 Korean native speakers (KNS), 10 advanced and 10 intermediate CKL participated in a production test. The material analyzed consisted of 10 Korean sentences in which aspiration can be applied at different prosodic boundaries (syllable, word, accentual phrase). The results showed that for KNS and CKL, the rate of application of aspiration was different according to prosodic boundaries. Aspiration was more frequently applied at the lexical level than at the post-lexical level and it was more frequent at the word boundary than at the accentual phrase boundary. For CKL, pronunciation errors were either non-application of aspiration or coda obstruent omission. In the case of non-application of aspiration, CKL produced the target syllable as an underling form and they did not transform it as a surface form. In the case of coda obstruent ommision, most of the errors were caused by the inherent complexity of phonological process.

Comparison of English and Korean speakers for the nasalization of English stops

  • Yun, Ilsung
    • 말소리와 음성과학
    • /
    • 제7권3호
    • /
    • pp.3-11
    • /
    • 2015
  • This study compared English and Korean speakers with regard to the nasalization of the English stops /b, d, g, p, t, k/before a nasal within and across a word boundary. Nine English and thirty Korean speakers participated in the experiment. We used 37 speech items with different grammatical structures. Overall the English informants rarely nasalized the stops while the Korean informants generally greatly nasalized them though widely varying from no nasalization to almost complete nasalization. In general, voiced stops were more likely to be nasalized than voiceless stops. Also, the alveolar stops /d, t/tended to be nasalized the most, the bilabial stops /b, p/ the second most, and the velar stops /g, k/ the least. Besides, the closer the grammatical relationship between neighboring words, the more likely the stop nasalization occurred. In contrast, the Korean syllabification - the addition of the vowel /i/ to the final stops - worked against the stop nasalization. On the other hand, different stress (accent) or rhythm effects of the two languages are assumed to contribute to the significantly different nasalization between English and Korean speakers. The spectrum of stop nasalization obtained from this study can be used as an index to measure how close a certain Korean speaker's stop nasalization is to English speakers'.

Question Similarity Measurement of Chinese Crop Diseases and Insect Pests Based on Mixed Information Extraction

  • Zhou, Han;Guo, Xuchao;Liu, Chengqi;Tang, Zhan;Lu, Shuhan;Li, Lin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권11호
    • /
    • pp.3991-4010
    • /
    • 2021
  • The Question Similarity Measurement of Chinese Crop Diseases and Insect Pests (QSM-CCD&IP) aims to judge the user's tendency to ask questions regarding input problems. The measurement is the basis of the Agricultural Knowledge Question and Answering (Q & A) system, information retrieval, and other tasks. However, the corpus and measurement methods available in this field have some deficiencies. In addition, error propagation may occur when the word boundary features and local context information are ignored when the general method embeds sentences. Hence, these factors make the task challenging. To solve the above problems and tackle the Question Similarity Measurement task in this work, a corpus on Chinese crop diseases and insect pests(CCDIP), which contains 13 categories, was established. Then, taking the CCDIP as the research object, this study proposes a Chinese agricultural text similarity matching model, namely, the AgrCQS. This model is based on mixed information extraction. Specifically, the hybrid embedding layer can enrich character information and improve the recognition ability of the model on the word boundary. The multi-scale local information can be extracted by multi-core convolutional neural network based on multi-weight (MM-CNN). The self-attention mechanism can enhance the fusion ability of the model on global information. In this research, the performance of the AgrCQS on the CCDIP is verified, and three benchmark datasets, namely, AFQMC, LCQMC, and BQ, are used. The accuracy rates are 93.92%, 74.42%, 86.35%, and 83.05%, respectively, which are higher than that of baseline systems without using any external knowledge. Additionally, the proposed method module can be extracted separately and applied to other models, thus providing reference for related research.

A Prosodic Analysis on the Korean Subjective Particles -With Reference to the Establishment of Acoustic Features-

  • Seong, Cheol-Jae
    • The Journal of the Acoustical Society of Korea
    • /
    • 제20권3E호
    • /
    • pp.3-9
    • /
    • 2001
  • This study aims to describe a prosodic pattern on the Korean subjective particles with respect to their discourse function. 4 kinds of Korean subjective particles were mainly investigated with reference to sentential location, grammatical relations that precede or follow the word including subjective particles, and prosodic phrasing. F0 and energy were gradually diminished as the particles moved down to the sentential final position. 'Ga'particle, which has been potentially regarded as having a grammatical focusing function, looks like to show relatively higher F0 in sentential medial in discourse. At sentential medial position, when the words including 'ga, eun, and neun'particles were preceded by adverbials, the acoustic variables of particles tended to be diminished by some ratio in comparison with the mean value. The duration of particles might vary with respect to style variation and especially that it tended to diminish from 150 basic, 50 separate, and finally 50 discourse successively. And there's some specific phenomenon that prosodic phrasing itself was relatively easily taken place after 'eun' and 'neun' particles. Finally, I tried to catch the prosodic characteristics (which would be established as acoustic features) of inter-word position at which specific subjective particles were intervened. These acoustic features can be made up of the duration and F0 fluctuation activated in the successive 3 syllables in which word (or prosodic) boundary was located.

  • PDF

Development of the Korean Handwriting Assessment for Children Using Digital Image Processing

  • Lee, Cho Hee;Kim, Eun Bin;Lee, Onseok;Kim, Eun Young
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권8호
    • /
    • pp.4241-4254
    • /
    • 2019
  • The efficiency and accuracy of handwriting measurement could be improved by adopting digital image processing. This study developed a computer-based Korean Handwriting Assessment tool. Second graders participated in this study by performing writing tasks of consonants, vowels, words, and sentences. We extracted boundary parameters for each letter using digital image processing and calculated the variables of size, size coefficient of variation (CV), misalignment, inter-letter space, inter-word space, and ratio of inter-letter space to inter-word space. Children were also administered traditional handwriting and visuomotor tests. Digital variables from image processing were correlated with these previous tests. Using these correlations, we established a three-point scoring system that computed test scores for each variable. We analyzed inter-rater reliability between the computer rater and human rater and test-retest reliability between the first and second performances. The validity was examined by analyzing the relationship between the Korean Handwriting Assessment and previous handwriting and visuomotor tests. We suggested the Korean Handwriting Assessment to measure size, size consistency, misalignment, inter-letter space, inter-word space, and space ratio using digital image processing. This Korean Handwriting Assessment tool proved to have reliability and validity. It is expected to be useful for assessing children's handwriting.

Creation of the Conversion Table from Hangeul to the Roman Alphabet

  • Kim, Kyoung-Jing;Rhee, Sang-Burm
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -1
    • /
    • pp.321-324
    • /
    • 2002
  • For a rule-based conversion of Hangout into the Roman alphabet rather than a word-for-word conversion, one must come up with a faultless model for the Korean standard pronunciation rules, which are the basis of the Romanization. It is on this foundation that the Korean-Roman alphabet conversion table can be created. For linguistic modeling using PetriNet, modeling boundary and notation of modeling can be defined. In order to describe PetriNet, which is a dynamic modeling tool, as a static one, one can model the standard Korean pronunciation rules and the Hangout-Roman alphabet notation by conversion into incident matrix Thus, this research attempts to develop a mathematical modeling tool for a natural language using PetriNet, and create a Korean-Roman alphabet conversion table.

  • PDF