• Title/Summary/Keyword: 어휘복잡성

Search Result 44, Processing Time 0.023 seconds

The Study of Convergence on Lexical Complexity, Syntax Complexity, and Correlation among Language Variables (한국어 학습자의 어휘복잡성, 구문복잡성 및 언어능력 변인들 간의 상관에 관한 융합 연구)

  • Kyung, Lee-MI;Noh, Byungho;Kang, Anyoung
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.4
    • /
    • pp.219-229
    • /
    • 2017
  • The study was conducted to find out lexical complexity and syntactic complexity for Korean learners by telling stories to see pictures. The results were as follows. First, there was no meaningful difference according to nationality. Second, we checked the differences on lexical complexity and syntactic complexity according to Korean studying period, only number of difference words showed meaningful difference among lexical complexity sub variables, but there was no difference among syntactic complexity sub variables. Third, we also checked correlation among staying period of Korea, Korean studying period, and other language related variables. It showed meaningful correlation staying period in Korea and other language related variable except Korean studying period and TTR. The directions for teaching Korean learners were suggested on the point of converge view according to results.

Foreign Language Education of Korean Peninsula: Insights from Nogeldae (『노걸대』 분석을 통해서 바라본 우리 반도의 외국어 교육)

  • Kim, Jeong-ryeol
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.6
    • /
    • pp.408-414
    • /
    • 2017
  • This paper aims to investigate the value and resilience of Nogeoldae which was written at the end of Koryo dynasty and has been used as the most important foreign language education materials throughout the 500 years of Chosun dynasty. To this end, 106 volumes of dialogues, 12 of meeting, 17 of lodging, 21 of Daedo bound, 34 of Daedo lives and 11 of return in Nogeoldae are analyzed by an average length of the sentences, an average length of words, type-token ratio, number of words before main verbs and number of words before nouns to identify the progressive degree of the complexity. The result of the analysis shows that Nogeoldae presents a desired progressive complexity found in modern foreign language textbooks.

Query Translation for Resolving the Difference between User Query Words and Ontology Resources (온톨로지 검색에 있어서 사용자 질의어와 온톨로지 리소스와의 상이성 해소를 위한 질의어 변환)

  • Kim, Tae-Wan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.3
    • /
    • pp.35-44
    • /
    • 2011
  • Ontologies are playing an important role in semantic web which is emerging as a next stage of the web revolution because various kinds of metadata are described in ontologies. Correspondingly, many query languages like SPARQL, RDQL etc. have been proposed for querying these ontologies. But users have to know the structures and resource names of ontologies completely to get search results even if they have expertise on complex formal logic and syntax of the query languages. Especially, casual users do not know the resource names and may use different words from resource names when they write their query language. This vocabulary gap problem have to be solved to raise the success rate. In this paper, an approach for translating user's search words to corresponding resource names has been proposed. This approach uses semantic similarity between user created search words and ontology resource names.

An Analysis on Suitability of Words and Sentences in Mathematics Textbooks for Elementary First Grade (초등학교 1학년 수학 교과서의 어휘 및 문장 적합성 분석)

  • Chang, Hyewon;Lim, Miin
    • Journal of Educational Research in Mathematics
    • /
    • v.26 no.2
    • /
    • pp.247-267
    • /
    • 2016
  • It has been pointed out that the mathematics textbooks according to 2009 revised national curriculum cause difficulty not by mathematical knowledge but concomitantly by words and sentences for the first graders who just started learning Korean alphabets. This study focused on the suitability of words and sentences in mathematics textbooks for elementary first grade. We analyzed the degree of difficulty and familiarity in terms of words and the structure, length, and expression in terms of sentences. The results show some causes that lead the first graders to the difficulty. In more detail, we found 108 difficult words and 6 unfamiliar words for the first graders. And it is noticed that the textbooks contain 37 compound sentences, 727 complex sentences, and 38 compound-complex sentences. They also contain 237 long sentences that are composed of 9 words or more, 168 sentences that assign two activities or more, and 52 sentences that contain three nouns or adjectives or more successively. Based on these results and discussions, we suggested several implications for writing mathematics textbooks for the lower grades in elementary school.

A Composite Study on the Writing Characteristics of Korean Learners - Focused on Syntax Production, Syntax Complexity and Syntax Errors (한국어 학습자의 쓰기 특성에 관한 융복합적 연구 - 구문산출성, 구문복잡성 및 구문오류를 중심으로)

  • Lee, MI Kyung;Noh, Byungho
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.11
    • /
    • pp.315-324
    • /
    • 2018
  • For Korean learners, writing is a harder part than any other areas in Korean languages. But in the future, the ability to organize and write systematically is essential for future koran languages learners to take classes, do assignments and presentations at school, and then adapt to job situations. Therefore, there is a need to devise a direction for this. In general, writing characteristics are viewed in many ways, including writing productivity, writing complexity, and writing errors. Accordingly, the study provided drawings and A4 paper for Vietnamese Korean learners, Chinese Korean learners, and Korean university students, before writing freely. Based on the their writing results, we looked at syntax factors (total C-units, total number of words), syntax complexity (number of words per C-unit and clause density), and writing errors (postposition, spell errors, and connective suffix, space errors) According to the study, Vietnamese and Chinese Korean language learners showed significantly lower syntax productivity and complexity than Korean university students, and showed more writing errors than Korean students in postposition and clause density. Based on the results of the study, we discussed writing guidelines for Korean languages learners. However, this study did not validate the differences in writing characteristics according to the Korean language level and length of residences for the study subjects. Therefore, it is necessary to consider this in future research.

Korean Composed Noun Phrase Chunking Using CRF (CRF를 이용한 한국어 문장의 복합명사 상당어구 묶음)

  • Park, Byul;Seon, Choong-Nyoung;Seo, Jung-Yun
    • Annual Conference on Human and Language Technology
    • /
    • 2011.10a
    • /
    • pp.90-93
    • /
    • 2011
  • 구분분석은 문장을 분석하여 문장의 구문 구조를 밝히는 작업으로, 문장이 길어질수록 문장의 중의성이 높아져 구문분석 복잡도를 증사시키고 성능이 떨어진다. 구문분석의 복잡도를 감소시키기 위한 방법 중 하나로 구묶음을 하는데 본 논문에서는 하나의 명사처럼 쓰일 수 있는 둘 이상의 연속된 명사, 대명사, 수사, 숫자와 이를 수식하는 관형사, 접두사 및 접미사를 묶어서 복합명사 상당어구라고 정의하고 복합명사 상당어구 인식 시스템을 제안한다. 본 논문은 복합명사 상당어구 인식을 기계학습을 이용한 태그 부착 문제로 간주하였다. 문장 내 띄어쓰기, 어절의 어휘 정보, 어절 내 형태소들의 품사 정보와 품사-어휘 정보를 함께 자질로 사용하였다. 실험을 위하여 세종 구문분석 말뭉치 7만여 문장을 학습과 평가에 사용했으며, 실험결과는 95.97%의 정확률과 95.11%의 재현율, 95.54%의 $F_1$-평가치를 보였고, 구문분석의 전처리로써 사용하였을 때 구문분석의 성능과 속도가 향상됨을 보였다.

  • PDF

Prospective Changes of English Digital Textbook Based on the Universal Design for Learning (보편적 학습 설계에 근거한 영어과 디지털 교과서 개선 방안)

  • Kim, Jeong-ryeol
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.7
    • /
    • pp.674-683
    • /
    • 2015
  • One of the issues with the textbooks pertinent to the current study is whether or not the Universal Design for Learning (UDL) factors have been dealt to satisfy students with different aptitudes in learning the core objectives of the lessons. This study develops a modified version of the UDL analysis criteria from the cross curricular criteria to language teaching and learning and uses it to analyze the sequence of digital English textbooks to investigate the descriptive statistics of the UDL factors in the new textbooks. The result shows that the textbook is designed most favorably to the students with the talent of linguistic aptitude and less favorably to the students with other types of aptitudes. The sequence analysis shows that sentence/word length and appearance of new words are incrementally sequenced as students advance upper grades. However, the syntactic complexity of middle school curves up steeply which is different from the elementary school textbooks. The UDL analysis will provide learning factors to consider when designing digital English textbooks to cover different aptitudinal groups.

Developing RDF Meta data Graph for Transportation Open Data Platform (교통데이터 유통을 위한 RDF 메타 데이터 그래프 구축방안)

  • Park, Eun Mi;Kang, Jung Hyun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.6
    • /
    • pp.110-116
    • /
    • 2021
  • W3C enacted RDF(Resource Description Framework based DCAT meta data standard, which is world-widely accepted so far. To guarantee the inter-operability and integrity of data from various sources and even from various countries, it is considered that transportation meta data should also follow the DCAT standard. But still, to represent the transportation domain-specific features, it is necessary to define new properties and vocabularies in addition to the DCAT standard. This research identified the additional properties and vocabularies for transportation metadata, considering uniqueness of transportation data. The revised RDF schema and RDF graph proposed in this research should be able to lead the transportation open data platform revitalization.

A Model of English Part-Of-Speech Determination for English-Korean Machine Translation (영한 기계번역에서의 영어 품사결정 모델)

  • Kim, Sung-Dong;Park, Sung-Hoon
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.3
    • /
    • pp.53-65
    • /
    • 2009
  • The part-of-speech determination is necessary for resolving the part-of-speech ambiguity in English-Korean machine translation. The part-of-speech ambiguity causes high parsing complexity and makes the accurate translation difficult. In order to solve the problem, the resolution of the part-of-speech ambiguity must be performed after the lexical analysis and before the parsing. This paper proposes the CatAmRes model, which resolves the part-of-speech ambiguity, and compares the performance with that of other part-of-speech tagging methods. CatAmRes model determines the part-of-speech using the probability distribution from Bayesian network training and the statistical information, which are based on the Penn Treebank corpus. The proposed CatAmRes model consists of Calculator and POSDeterminer. Calculator calculates the degree of appropriateness of the partof-speech, and POSDeterminer determines the part-of-speech of the word based on the calculated values. In the experiment, we measure the performance using sentences from WSJ, Brown, IBM corpus.

  • PDF

The Effect of Spatial Attention in Hangul Word Recognition: Depending on Visual Factors (한글 단어 재인에서 시각적 요인에 따른 공간주의의 영향)

  • Ko Eun Lee;Hye-Won Lee
    • Korean Journal of Cognitive Science
    • /
    • v.34 no.1
    • /
    • pp.1-20
    • /
    • 2023
  • In this study, we examined the effects of spatial attention in Hangul word recognition depending on visual factors. The visual complexity of words (Experiment 1) and contrast (Experiment 2) were manipulated to examine whether the effect of spatial attention differs depending on visual quality. Participants responded to words with and without codas in experiment 1 and words in high-contrast and low-contrast conditions in experiment 2. The effects of spatial attention were investigated by calculating the difference in performance between the condition where spatial cues were given at the target location (valid trial) and the condition where the spatial cues were not given at the target location (invalid trial) as the cuing effects. As a result, the cuing effects were similar depending on the complexity of the words. It indicates that the effects of spatial attention were not different across the visual complexity conditions. The cuing effects were greater in the low-contrast condition than in the high-contrast condition. The greater effect of spatial attention when the contrast is low was explained as a mechanism of signal enhancement.