• Title/Summary/Keyword: 읽기형태소 정보 추출

Search Result 2, Processing Time 0.014 seconds

Morphological Parafoveal Preview Benefit Effects in Reading Korean (우리글 읽기에서 형태소정보의 미리보기 효과)

  • Lee, Sangeun;Choo, Hyeree;Koh, Sungryong
    • Korean Journal of Cognitive Science
    • /
    • v.31 no.2
    • /
    • pp.25-54
    • /
    • 2020
  • While there is no evidence for parafoveal processing in alphabetic languages such as English and Finnish, there is some evidence that morphological information is processed in syllabic languages like Chinese. Korean writing system, Hangul, would be able to provide morphological preview benefit effects since it is an "alphabetic syllabary" which contains both alphabetic and syllabic features. This study explored morphological parafoveal preview benefit effects during reading Korean using irregular verbs, which have phonological and orthographical differences between fundamental and conjugated forms. In the Experiment, the target word was irregular conjugated form, and there were four preview conditions: identical (e.g. 구워), fundamental form (e.g. 굽다), orthographically related (e.g. 굼다), and unrelated control (e.g. 죨어). In the result of study, identical was shortest and morphological, orthographical, unrelated preview were followed. Moreover, measures of first-pass reading of morphological preview were significantly shorter than those of unrelated control preview. This results support the hypothesis of morphological preview benefit effects in Korean. The implications of the results are discussed.

The Unsupervised Learning-based Language Modeling of Word Comprehension in Korean

  • Kim, Euhee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.11
    • /
    • pp.41-49
    • /
    • 2019
  • We are to build an unsupervised machine learning-based language model which can estimate the amount of information that are in need to process words consisting of subword-level morphemes and syllables. We are then to investigate whether the reading times of words reflecting their morphemic and syllabic structures are predicted by an information-theoretic measure such as surprisal. Specifically, the proposed Morfessor-based unsupervised machine learning model is first to be trained on the large dataset of sentences on Sejong Corpus and is then to be applied to estimate the information-theoretic measure on each word in the test data of Korean words. The reading times of the words in the test data are to be recruited from Korean Lexicon Project (KLP) Database. A comparison between the information-theoretic measures of the words in point and the corresponding reading times by using a linear mixed effect model reveals a reliable correlation between surprisal and reading time. We conclude that surprisal is positively related to the processing effort (i.e. reading time), confirming the surprisal hypothesis.