• Title/Summary/Keyword: 유사문장 비교

Search Result 109, Processing Time 0.024 seconds

An Use of the Patterns for an Efficient Example-Based Machine Translation (효율적인 예제 기반 기계번역을 위한 패턴의 사용)

  • Lee, Gi-Yeong;Kim, Han-U
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.37 no.3
    • /
    • pp.1-11
    • /
    • 2000
  • An example-based machine translation approach is a new paradigm for resolving various problems caused by the rules of conventional rule-based machine translation. But, in pure example-based machine translation, it is very hard to find similar examples matched with input sentences by using reasonable parallel corpus. This problem causes large overheads in the process of sentence generation. This paper proposes new method of English-Korean transfer using both patterns and examples. The patterns are composed of sentence patterns and phrase patterns. Meta parts of the patterns make the example-based machine translation more practical by raising the probability to find similar examples. The use of patterns and examples can reduce the ambiguities in source language analysis and give us a high quality of MT. And experimental results with a test corpus are discussed.

  • PDF

Teacher's Perception of Activity Materials in Housing Area of Middle School Technology & Home Economics Textbook (중학교 기술.가정 주생활영역 활동자료에 대한 교사의 인식)

  • Lee, Young-Doo;Cho, Jea-Soon
    • Journal of Korean Home Economics Education Association
    • /
    • v.20 no.3
    • /
    • pp.215-230
    • /
    • 2008
  • Activity materials in textbook could facilitate students' oriented self-help learning. The purpose of this paper is to find out characteristics of activity materials in the housing area of middle school Technology and Home Economics and teacher's perception of them. The data were collected from 253 middle school teachers who had ever taught the housing unit in any of 6 textbooks. The results showed that the number of activity materials were differed by the characteristics of the materials such as type of materials, feature of non sentence materials, and type of activity, depend on authors as well as textbooks. In general, teachers interests in the materials were higher than those of students even the trends of the interests were the same. Adequacy of activity contents and related knowledge of teachers were higher than adequacy of level. Teachers thought time and extra search beyond class were barrier to full the interests of students. Further research is suggested to find out whether higher interests in the materials are related to the higher activating rate of them.

  • PDF

System Implement to Identify Copyright Infringement Based on the Text Reference Point (텍스트 기준점 기반의 저작권 침해 판단 시스템 구현)

  • Choi, Kyung-Ung;Park, Soon-Cheol;Yang, Seung-Won
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.15 no.1
    • /
    • pp.77-84
    • /
    • 2015
  • Most of the existing methods make the index key with every 6 words in every sentence in a document in order to identify copyright infringement between two documents. However, these methods has the disadvantage to take a long time to inspect the copyright infringement because of the long indexing time for the large-scale document. In this paper, we propose a method to select the longest word (called a feature bock) as an index key in the predetermined-sized window which scans a document character by character. This method can be characterized by removing duplicate blocks in the process of scanning a document, dramatically reducing the number of the index keys. The system with this method can find the copyright infringement positions of two documents very accurately and quickly since relatively small number of blocks are compared.

An English Essay Scoring System Based on Grammaticality and Lexical Cohesion (문법성과 어휘 응집성 기반의 영어 작문 평가 시스템)

  • Kim, Dong-Sung;Kim, Sang-Chul;Chae, Hee-Rahk
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.3
    • /
    • pp.223-255
    • /
    • 2008
  • In this paper, we introduce an automatic system of scoring English essays. The system is comprised of three main components: a spelling checker, a grammar checker and a lexical cohesion checker. We have used such resources as WordNet, Link Grammar/parser and Roget's thesaurus for these components. The usefulness of an automatic scoring system depends on its reliability. To measure reliability, we compared the results of automatic scoring with those of manual scoring, on the basis of the Kappa statistics and the Multi-facet Rasch Model. The statistical data obtained from the comparison showed that the scoring system is as reliable as professional human graders. This system deals with textual units rather than sentential units and checks not only formal properties of a text but also its contents.

  • PDF

Neural Network Model for Named Entitiy Linking using Wikipedia Link Data (위키피디아 링크 데이터를 이용한 Neural Network Model 기반 한국어 개체명 연결)

  • Lee, Young-Hoon;Na, Seung-Hoon
    • Annual Conference on Human and Language Technology
    • /
    • 2018.10a
    • /
    • pp.163-166
    • /
    • 2018
  • 개체명 연결이란 주어진 문장에 출현한 단어를 위키피디아와 같은 지식 기반 상의 하나의 개체와 연결하여 특정 개체가 무엇인지 식별하여 모호성을 해결하는 작업이다. 본 연구에서는 위키피디아의 링크를 이용하여 개체 표현(Entity mention)과 학습 데이터, 지식 기반을 구축한다. 또한, Mention/Context 쌍의 표현과 Entity 표현의 코사인 유사도를 이용하여 Score를 구하고, 이를 통해 개체명 연결 문제를 랭킹 문제로 변환한다. 개체의 이름과 분류뿐만 아니라 개체의 설명, 개체 임베딩 등의 자질을 이용하여 모델을 확장하고 결과를 비교한다. 확장된 모델의 개체 링킹 성능은 89.63%의 정확도를 보였다.

  • PDF

An Application for Sharing Travel Activities Information by Using Deep Learning Models (딥러닝 모델을 활용한 관광지 활동 정보 공유 애플리케이션 )

  • Jiho Shin;Eunhye Gwon;Byungook Ryu;Byungjeong Lee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.11a
    • /
    • pp.319-320
    • /
    • 2023
  • 일반적인 여행 커뮤니티는 사진과 텍스트 기반의 사용자 리뷰를 바탕으로 정보 공유를 한다. 본 연구에서는 관광지에서 수행한 활동을 한 문장의 형태로 공유하는 애플리케이션을 제안한다. ChatGPT를 활용하여 활동을 산책, 사진, 음식 등 9가지 태그로 분류하여 관광지가 가지는 특징을 용이하게 파악한다. 또한, 사용자가 작성한 활동을 임베딩하고 관광지 소개 글 벡터와 유사도를 비교하여 관광지를 추천한다. 본 애플리케이션을 통해 사용자가 긴 설명이나 사진 없이 관광지가 가지는 정보를 쉽게 공유하고 관광지 추천을 하는 새로운 여행 커뮤니티를 제공할 수 있을 것으로 기대한다.

A Sentiment Analysis of Internet Movie Reviews Using String Kernels (문자열 커널을 이용한 인터넷 영화평의 감정 분석)

  • Kim, Sang-Do;Yoon, Hee-Geun;Park, Seong-Bae;Park, Se-Young;Lee, Sang-Jo
    • Annual Conference on Human and Language Technology
    • /
    • 2009.10a
    • /
    • pp.56-60
    • /
    • 2009
  • 오늘날 인터넷은 개인의 감정, 의견을 서로 공유할 수 있는 공간이 되고 있다. 하지만 인터넷에는 너무나 방대한 문서가 존재하기 때문에 다른 사용자들의 감정, 의견 정보를 개인의 의사 결정에 활용하기가 쉽지 않다. 최근 들어 감정이나 의견을 자동으로 추출하기 위한 연구가 활발하게 진행되고 있으며, 감정 분석에 관한 기존 연구들은 대부분 어구의 극성(polarity) 정보가 있는 감정 사전을 사용하고 있다. 하지만 인터넷에는 나날이 신조어가 새로 생기고 언어 파괴 현상이 자주 일어나기 때문에 사전에 기반한 방법은 한계가 있다. 본 논문은 감정 분석 문제를 긍정과 부정으로 구분하는 이진 분류 문제로 본다. 이진 분류 문제에서 탁월한 성능을 보이는 Support Vector Machines(SVM)을 사용하며, 문서들 간의 유사도 계산을 위해 문장의 부분 문자열을 비교하는 문자열 커널을 사용한다. 실험 결과, 실제 영화평에서 제안된 모델이 비교 대상으로 삼은 Bag of Words(BOW) 모델보다 안정적인 성능을 보였다.

  • PDF

A Study on the Correlation between Sound Spectrogram and Sasang Constitution (성문(聲紋)과 사상체질(四象體質)과의 상관성(相關性)에 관(關)한 연구(硏究))

  • Yang, Seung-hyun;Kim, Dal Lae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.8 no.2
    • /
    • pp.191-202
    • /
    • 1996
  • Sasang constitution classification is very important subject, so many medical men studied the Sasang constitution classification but there is no certain method to classify objectively. And the purpose of this study is to help classifying Sasang constitution through correlation with sound spectrogram. This study was done it under the suppose that Sasang costitution hag correlation with sound spectrogram. The following results were obtained about correlation between sound spectrogram and Sasang constitution by comparison and analysis the pitch and reading speed of Sasang constitutions; 1. There was a similar tendency in the composition reading speed between taeeumin, soeumin and soyangin. 2. Taeeumin's center was lower measured more than soeumin's and soyangin's in the pitch graph and graph by normal curve fit and there was a similar tendency between soeumin and soyangin. 3. There was a similar tendency in the pitch graph's width between all constitutions. 4. There was a significant difference between taeeumin and soeum in the mean of three constitution's pitch, this means that taeeumin uses lower voice more than soeumin. According to the results, it is considered that there is a correlation between pitch of sound spectrogram and Sasang constitution. And method of Sasang constitution classification through sound spectrogram analysis can be one method as assistant for the objectification of Sasang constitution classification.

  • PDF

Voice Personality Transformation Using an Optimum Classification and Transformation (최적 분류 변환을 이용한 음성 개성 변환)

  • 이기승
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.5
    • /
    • pp.400-409
    • /
    • 2004
  • In this paper. a voice personality transformation method is proposed. which makes one person's voice sound like another person's voice. To transform the voice personality. vocal tract transfer function is used as a transformation parameter. Comparing with previous methods. the proposed method makes transformed speech closer to target speaker's voice in both subjective and objective points of view. Conversion between vocal tract transfer functions is implemented by classification of entire vector space followed by linear transformation for each cluster. LPC cepstrum is used as a feature parameter. A joint classification and transformation method is proposed, where optimum clusters and transformation matrices are simultaneously estimated in the sense of a minimum mean square error criterion. To evaluate the performance of the proposed method. transformation rules are generated from 150 sentences uttered by three male and on female speakers. These rules are then applied to another 150 sentences uttered by the same speakers. and objective evaluation and subjective listening tests are performed.

Bukpo's History and Transition of the Hemp Fabric Production Technique (북포(北布)의 내력과 제섬(製纖) 기술의 변천)

  • Kong, Sang-Hui
    • Korean Journal of Heritage: History & Science
    • /
    • v.50 no.3
    • /
    • pp.44-63
    • /
    • 2017
  • 'Bukpo' is called 'Tongpo' or 'Balnaepo,' which respectively mean hemp fabric that goes into a small bamboo tube and women's table utensil 'bari' in Chosen. It is fine hemp fabric produced in Yukjin, Hamgyeong province. Korea has been divided into North and South since the Korean War in 1950. As it is hard to get information about Northern life style or their traditional technology, their hemp fabric production is also left unknown. This study demonstrates characteristics of the production of 'Bukpo' through "Ojuyeonmunjangjeonsango", the only document that marked about 'Bukpo' making process of the late Chosen dynasty. It aims to analyze the transition of the technique and the meaning by comparing the characteristics of the production of 'Bukpo' with the modern era's documents. In this process, I discovered that the hemp fabric production technique at 19th century shares some sort of similarities with that of Europe or Chinese Miao(hmong). But the hemp fabric production technique changed before the 20th century. The evolution of Northern hemp fabric production technique can be a good example to examine the context of the traditional craft technique.