• Title/Summary/Keyword: Korean comparative sentence

Search Result 37, Processing Time 0.025 seconds

Transformation-based Learning for Korean Comparative Sentence Classification (한국어 비교 문장 유형 분류를 위한 변환 기반 학습 기법)

  • Yang, Seon;Ko, Young-Joong
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.2
    • /
    • pp.155-160
    • /
    • 2010
  • This paper proposes a method for Korean comparative sentence classification which is a part of comparison mining. Comparison mining, one area of text mining, analyzes comparative relations from the enormous amount of text documents. Three-step process is needed for comparison mining - 1) identifying comparative sentences in the text documents, 2) classifying those sentences into several classes, 3) analyzing comparative relations per each comparative class. This paper aims at the second task. In this paper, we use transformation-based learning (TBL) technique which is a well-known learning method in the natural language processing. In our experiment, we classify comparative sentences into seven classes using TBL and achieve an accuracy of 80.01%.

The Study on the Influence that the Understanding Degree about the Sentence Stated Math. Problems Reach the Extension of the Problem Solving Capacity. - Focusing on the Unit of Equation and Inequality in Middle School - (문장제에 대한 이해정도가 문제해결력 신장에 미치는 영향에 대한 연구 -중학교 방정식과 부등식 단원을 중심으로-)

  • 지재근;오세열
    • Journal of the Korean School Mathematics Society
    • /
    • v.3 no.1
    • /
    • pp.189-200
    • /
    • 2000
  • The purpose of this thesis is that the students understand the sentence stated math problems closely related to the real life and adapted the right solving strategies try to find the solution to a problem. The following research problem were proposed. 1. How repeated thinking lessons develop the understanding of problems and influence the usage of correct problem solving strategies and extensions of problem solving. 2. There are how much differences of achievement for each type of sentence stated problems by using comparative analysis of upper class, intermediate class, and lower class for each level between the experimental and comparative classes. In order to conduct this research the classes were divided into three different level - upper class, intermediate class and lower class. Each level include an experimental class and a comparative class. The two classes (experimental class and comparative class) of the same level were tested on the basis of class division record with the experimental class repeated learning papers for two weeks were used to guide the fixed thinking algorism for each sentence stated math problems. Eight common problems were chosen from a variety of textbooks : number calculation problems, velocity-distance-time problems, the density of a mixture, benefit problems, distribution problems, problems about working, ratio problems, the length of a figure problems. After conducting this research experiment The differences in achievement level between the experimental class and comparative class, were compared and analyzed through achievement tests made from the achievement test papers with seven problems, which were worth seventy points (total score). The conclusions of this thesis are as follows: Firstly, leaning activities through the usage of repeated learning papers for each level class produce an even development of achievement level especially in the case of the upper class learners, they have particular differences (between experimental class and comparative class) compared to the intermediate level and lower classes. Secondly, according to the analysis about achievement development each problems, learners easily accept the strategies of solution through the formula setting up to the problem of velocity -distance-time, and to the density of the mixture they adapted the picture drawing strategies interestingly, However each situation requires a variety of appropriate solution strategies. Teachers will have to employ other interesting solution strategies which relate to real life.

  • PDF

A Comparative Study of Spoken and Written Sentence Production in Adults with Fluent Aphasia (유창성 실어증 환자의 구어와 문어 문장산출 능력 비교)

  • Ha, Ji-Wan;Pyun, Sung-Bom;Hwang, Yu Mi;Yi, Hoyoung;Sim, Hyun Sub
    • Phonetics and Speech Sciences
    • /
    • v.5 no.3
    • /
    • pp.103-111
    • /
    • 2013
  • Traditionally it has been assumed that written abilities are completely dependent on phonology. Therefore spoken and written language skills in aphasic patients have been known to exhibit similar types of impairment. However, a number of latest studies have reported the findings that support the orthographic autonomy hypothesis. The purpose of this study was to examine whether fluent aphasic patients have discrepancy between speaking and writing skills, thereby identifying whether the two skills are realized through independent processes. To this end, this study compared the K-FAST speaking and writing tasks of 30 aphasia patients. In addition, 16 aphasia patients, who were capable of producing sentences not only in speaking but also in writing, were compared in their performances at each phase of the sentence production process. As a result, the subjects exhibited different performances between speaking and writing, along with statistically significant differences between the two language skills at positional and phonological encoding phases of the sentence production process. Therefore, the study's results suggest that written language is more likely to be produced via independent routes without the mediation of the process of spoken language production, beginning from a certain phase of the sentence production process.

A Comparative Study of English Vowel Lengths between Koreans and Americans (한국인과 미국인의 영어 모음길이 비교연구)

  • Park, Hee-Suk
    • Speech Sciences
    • /
    • v.2
    • /
    • pp.135-147
    • /
    • 1997
  • This thesis describes pronunciation differences of vowel lengths between Koreans and Americans speaking English words and sentences. This study also analizes the reasons for these differences with the help of acoustic instruments. Sixteen sentences and eight words were selected as the experimental material. The informants for this study were 9 males; 3 Americans and 6 Koreans, who were asked to pronounce the test words and sentences five times. In this study, the acoustical analysis to measure duration was done through computer digital techniques. According to the results of the experiment, duration of 8 English vowels pronounced between Koreans and Americans shows very different features. When Koreans pronounce English vowels, the duration of the stressed vowel in the sentence-final position is much shorter than in other positions, such as in the sentence-initial and in word position. On the contrary, when Americans pronounce English vowels, the duration of the stressed vowel in the sentence-final position is much longer than in other positions. If the correlation between length and stress were to be studied in a more detailed manner, it would give fundamental help to the study of relation between stress and length.

  • PDF

Comparative Study of Tokenizer Based on Learning for Sentiment Analysis (고객 감성 분석을 위한 학습 기반 토크나이저 비교 연구)

  • Kim, Wonjoon
    • Journal of Korean Society for Quality Management
    • /
    • v.48 no.3
    • /
    • pp.421-431
    • /
    • 2020
  • Purpose: The purpose of this study is to compare and analyze the tokenizer in natural language processing for customer satisfaction in sentiment analysis. Methods: In this study, a supervised learning-based tokenizer Mecab-Ko and an unsupervised learning-based tokenizer SentencePiece were used for comparison. Three algorithms: Naïve Bayes, k-Nearest Neighbor, and Decision Tree were selected to compare the performance of each tokenizer. For performance comparison, three metrics: accuracy, precision, and recall were used in the study. Results: The results of this study are as follows; Through performance evaluation and verification, it was confirmed that SentencePiece shows better classification performance than Mecab-Ko. In order to confirm the robustness of the derived results, independent t-tests were conducted on the evaluation results for the two types of the tokenizer. As a result of the study, it was confirmed that the classification performance of the SentencePiece tokenizer was high in the k-Nearest Neighbor and Decision Tree algorithms. In addition, the Decision Tree showed slightly higher accuracy among the three classification algorithms. Conclusion: The SentencePiece tokenizer can be used to classify and interpret customer sentiment based on online reviews in Korean more accurately. In addition, it seems that it is possible to give a specific meaning to a short word or a jargon, which is often used by users when evaluating products but is not defined in advance.

The Effects of Shape of Advance Organizers on Scientific Concept Attainment of Elementary School Students (선행조직자의 유형이 초등학생들의 과학개념 획득에 미치는 영향)

  • Kim, Soon-Shik;Lee, Yong-Seob
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.5 no.2
    • /
    • pp.197-203
    • /
    • 2012
  • The main purpose of this research is to search the effect of different types of advance organizers in elementary science classes, to the students in acquiring scientific concepts and its retentions. In order to proceed with the research, We have chosen 93 students of three different classes of fifth graders, studying in M elementary school in U metropolitan city. We have randomly chosen two classes as an experimental group and one class as a comparative group. In the first experimental group (Group1), advance organizer was put in a form of a sentence., and the second experimental group (Group2), advance organizer was put in a form of a picture. For the comparative group, We have not put any advance organizer, and the method of class was proceeded as it was before. The result of this research are as follows. Firstly, the result of 'One-way analysis of variance' on scientific concept grade, right after the experiment on three groups, has shown a meaningful difference that, the shape of advance organizer does affect the acquisition of elementary school students' scientific concept. Secondly, even though Group1, who was given with a sentence as an advance organizer, got higher scientific concept scores after the experiment than that of comparative group, However, there was not a meaningful difference between Group1 and comparative group. Thirdly, Group2, who was given with a picture as an advance organizer, has shown a meaningful difference from comparative group. Judging from the facts above, utilizing an advance organizer of a picture in elementary science class, will be very affective on students' acquiring scientific concept.

A Comparative Study on Teaching Chinese and Korean Topic Sentences (주제문을 통한 한국학생의 중국어 학습지도 연구 - 중·한 주제문의 비교를 중심으로)

  • Choo, Chui-Lan
    • Cross-Cultural Studies
    • /
    • v.19
    • /
    • pp.389-409
    • /
    • 2010
  • Chinese is a topic-prominent language, so when we learn Chinese we should know the discourse function of the Chinese language. Most of the Korean student think Chinese sentences should appear in the order of S-V-O and they always make mistakes when they use Chinese. I think Korean is very similar with Chinese in the discourse function. Hence, in this paper, I try to find a method of teaching Chinese topic sentence. It does so by comparing Chinese with Korean in the light of discourse function. I think when Korean student know how to use Korean topic sentence to explain the discourse functions of the Chinese language, they will not make similar mistakes. With this understanding in mind, chapter 2 tries to show various topic sentences to prove that 'topic' is very important in Chinese sentences. This is why we say Chinese is a topic-prominent language. In chapter 3, I analysis the sentences that students made, and highlight the reasons why they made mistake. The result lies in the reason whereby they always think Chinese should appear in the order of S-V-O. They do not understand why some sentences appear in the order of O-(S)V or S-O-V. It show that they do not know what is topic sentence and do not know how to make topic sentences. Sometime I have them translate them into Korean, but they also make Korean sentences like in the order of Chinese S-V-O. Therefore, I think, under this circumstance, to let them to translate and to speak in Korean in topic sentence, get some feelings about Chinese topic sentences, and tell and make Chinese topic sentences are naturally critical in their training.

Korean Sentence Boundary Detection Using Memory-based Machine Learning (메모리 기반의 기계 학습을 이용한 한국어 문장 경계 인식)

  • Han Kun-Heui;Lim Heui-Seok
    • The Journal of the Korea Contents Association
    • /
    • v.4 no.4
    • /
    • pp.133-139
    • /
    • 2004
  • This paper proposes a Korean sentence boundary detection system which employs k-nearest neighbor algorithm. We proposed three scoring functions to classify sentence boundary and performed comparative analysis. We uses domain independent linguistic features in order to make a general and robust system. The proposed system was trained and evaluated on the two kinds of corpus; ETRI corpus and KAIST corpus. As experimental results, the proposed system shows about $98.82\%$ precision and $99.09\%$ recall rate even though it was trained on relatively small corpus.

  • PDF

A Comparative Study on Korean Connective Morpheme '-myenseo' to the Chinese expression - based on Korean-Chinese parallel corpus (한국어 연결어미 '-면서'와 중국어 대응표현의 대조연구 -한·중 병렬 말뭉치를 기반으로)

  • YI, CHAO
    • Cross-Cultural Studies
    • /
    • v.37
    • /
    • pp.309-334
    • /
    • 2014
  • This study is based on the Korean-Chinese parallel corpus, utilizing the Korean connective morpheme '-myenseo' and contrasting with the Chinese expression. Korean learners often struggle with the use of Korean Connective Morpheme especially when there is a lexical gap between their mother language. '-myenseo' is of the most use Korean Connective Morpheme, it usually contrast to the Chinese coordinating conjunction. But according to the corpus, the contrastive Chinese expression to '-myenseo' is more than coordinating conjunction. So through this study, can help the Chinese Korean language learners learn easier while studying '-myenseo', because the variety Chinese expression are found from the parallel corpus that related to '-myenseo'. In this study, firstly discussed the semantic features and syntactic characteristics of '-myenseo'. The significant semantic features of '-myenseo' are 'simultaneous' and 'conflict'. So in this chapter the study use examples of usage to analyse the specific usage of '-myenseo'. And then this study analyse syntactic characteristics of '-myenseo' through the subject constraint, predicate constraints, temporal constraints, mood constraints, negatives constraints. then summarize them into a table. And the most important part of this study is Chapter 4. In this chapter, it contrasted the Korean connective morpheme '-myenseo' to the Chinese expression by analysing the Korean-Chinese parallel corpus. As a result of the analysis, the frequency of the Chinese expression that contrasted to '-myenseo' is summarized into

    . It can see from the table that the most common Chinese expression comparative to '-myenseo' is non-marker patterns. That means the connection of sentence in Korean can use connective morpheme what is a clarifying linguistic marker, but in Chinese it often connect the sentence by their intrinsic logical relationships. So the conclusion of this chapter is that '-myenseo' can be comparative to Chinese conjunction, expression, non-marker patterns and liberal translation patterns, which are more than Chinese conjunction that discovered before. In the last Chapter, as the conclusion part of this study, it summarized and suggest the limitations and the future research direction.

  • A Comparative study on the Effectiveness of Segmentation Strategies for Korean Word and Sentence Classification tasks (한국어 단어 및 문장 분류 태스크를 위한 분절 전략의 효과성 연구)

    • Kim, Jin-Sung;Kim, Gyeong-min;Son, Jun-young;Park, Jeongbae;Lim, Heui-seok
      • Journal of the Korea Convergence Society
      • /
      • v.12 no.12
      • /
      • pp.39-47
      • /
      • 2021
    • The construction of high-quality input features through effective segmentation is essential for increasing the sentence comprehension of a language model. Improving the quality of them directly affects the performance of the downstream task. This paper comparatively studies the segmentation that effectively reflects the linguistic characteristics of Korean regarding word and sentence classification. The segmentation types are defined in four categories: eojeol, morpheme, syllable and subchar, and pre-training is carried out using the RoBERTa model structure. By dividing tasks into a sentence group and a word group, we analyze the tendency within a group and the difference between the groups. By the model with subchar-level segmentation showing higher performance than other strategies by maximal NSMC: +0.62%, KorNLI: +2.38%, KorSTS: +2.41% in sentence classification, and the model with syllable-level showing higher performance at maximum NER: +0.7%, SRL: +0.61% in word classification, the experimental results confirm the effectiveness of those schemes.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.