• Title/Summary/Keyword: 문맥

Search Result 1,005, Processing Time 0.03 seconds

A Method for Detection and Correction of Pseudo-Semantic Errors Due to Typographical Errors (철자오류에 기인한 가의미 오류의 검출 및 교정 방법)

  • Kim, Dong-Joo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.10
    • /
    • pp.173-182
    • /
    • 2013
  • Typographical mistakes made in the writing process of drafts of electronic documents are more common than any other type of errors. The majority of these errors caused by mistyping are regarded as consequently still typo-errors, but a considerable number of them are developed into the grammatical errors and the semantic errors. Pseudo semantic errors among these errors due to typographical errors have more noticeable peculiarities than pure semantic errors between senses of surrounding context words within a sentence. These semantic errors can be detected and corrected by simple algorithm based on the co-occurrence frequency because of their prominent contextual discrepancy. I propose a method for detection and correction based on the co-occurrence frequency in order to detect semantic errors due to typo-errors. The co-occurrence frequency in proposed method is counted for only words with immediate dependency relation, and the cosine similarity measure is used in order to detect pseudo semantic errors. From the presented experimental results, the proposed method is expected to help improve the detecting rate of overall proofreading system by about 2~3%.

Word Sense Disambiguation Based on Local Syntactic Relations and Sense Co-occurrence Information (국소 구문 관계 및 의미 공기 정보에 기반한 명사 의미 모호성 해소)

  • Kim, Young-Kil;Hong, Mun-Pyo;Kim, Chang-Hyun;Seo, Young-Ae;Yang, Seong-Il;Ryu, Chul;Huang, Yin-Xia;Choi, Sung-Kwon;Park, Sang-Kyu
    • Annual Conference on Human and Language Technology
    • /
    • 2002.10e
    • /
    • pp.184-188
    • /
    • 2002
  • 본 논문에서는 단순히 주변에 위치하는 어휘들간의 문맥 공기 정보를 이용하는 방식과는 달리 국소 구문 관계 및 의미 공기 정보에 기반한 명사 의미 모호성 해소 방안을 제안한다. 기존의 WSD 방법은 구조 분석의 어려움으로 인하여 문장의 구문 관계를 충분히 고려하지 못하고 주변 어휘들과의 공기 관계로 그 의미를 파악하려 했다. 그러나 본 논문에서는 동사구의 논항 의미 관계뿐만 아니라 명사구내에서의 의미 관계도 고려한 국소 구문관계를 고려한 명사 의미 모호성 해소 방법을 제안한다. 이 때, 명사들의 의미는 자동번역 시스템의 목적에 맞게 공기(co-occurrence)하는 동사들에 따라 분류하였다. 그리고 한중 자동 번역 지식으로 사용되는 명사 의미 코드가 부착된 74,880 의미 격틀의 의미 공기정보를 이용하였으며 형태소 태깅된 말뭉치로부터 의미모호성이 발생하지 않게 의미 공기정보 및 명사구 의미 공기 정보를 자동으로 추출하였다. 실험 결과, 의미 모호성이 발생하는 명사들에 대해서 83.9%의 의미 모호성 해소 정확률을 보였다.

  • PDF

Part-Of-Speech Tagging using multiple sources of statistical data (이종의 통계정보를 이용한 품사 부착 기법)

  • Cho, Seh-Yeong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.4
    • /
    • pp.501-506
    • /
    • 2008
  • Statistical POS tagging is prone to error, because of the inherent limitations of statistical data, especially single source of data. Therefore it is widely agreed that the possibility of further enhancement lies in exploiting various knowledge sources. However these data sources are bound to be inconsistent to each other. This paper shows the possibility of using maximum entropy model to Korean language POS tagging. We use as the knowledge sources n-gram data and trigger pair data. We show how perplexity measure varies when two knowledge sources are combined using maximum entropy method. The experiment used a trigram model which produced 94.9% accuracy using Hidden Markov Model, and showed increase to 95.6% when combined with trigger pair data using Maximum Entropy method. This clearly shows possibility of further enhancement when various knowledge sources are developed and combined using ME method.

Requirements Redundancy and Inconsistency Analysis for Use Case Modeling (유스케이스 모델링을 위한 요구사항 중복 및 불일치 분석)

  • 최진재;황선영
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.7
    • /
    • pp.869-882
    • /
    • 2004
  • This paper proposes an effective method to create logically consistent and structured requirement model by applying consistency control approach of the formal method to the use-case modeling. This method integrates the multi-perspective scattered requirement segments that may overlap and conflict each other into a structured requirement model. The model structure can be analyzed based on context goal and concerned area overlap analysis. The model consistency can be achieved by using specification overlap-based consistency checking method as an integration vehicle. An experimental application to case study shows that the Proposed method can successfully identify requirement overlaps and inconsistency. It can also transfer multi-viewpoint requirement segments into a consistently integrated use-case model to clarify software behaviors and functionality This method helps users to enhance capability to identify specification inconsistency in the use-case modeling at the early stage of software engineering development. The proposed approach can also facilitate communication between users and developers to ensure customer satisfaction.

An XML Tag Indexing Method Using on Lexical Similarity (XML 태그를 분류에 따른 가중치 결정)

  • Jeong, Hye-Jin;Kim, Yong-Sung
    • The KIPS Transactions:PartB
    • /
    • v.16B no.1
    • /
    • pp.71-78
    • /
    • 2009
  • For more effective index extraction and index weight determination, studies of extracting indices are carried out by using document content as well as structure. However, most of studies are concentrating in calculating the importance of context rather than that of XML tag. These conventional studies determine its importance from the aspect of common sense rather than verifying that through an objective experiment. This paper, for the automatic indexing by using the tag information of XML document that has taken its place as the standard for web document management, classifies major tags of constructing a paper according to its importance and calculates the term weight extracted from the tag of low weight. By using the weight obtained, this paper proposes a method of calculating the final weight while updating the term weight extracted from the tag of high weight. In order to determine more objective weight, this paper tests the tag that user considers as important and reflects it in calculating the weight by classifying its importance according to the result. Then by comparing with the search performance while using the index weight calculated by applying a method of determining existing tag importance, it verifies effectiveness of the index weight calculated by applying the method proposed in this paper.

A Reranking Model for Korean Morphological Analysis Based on Sequence-to-Sequence Model (Sequence-to-Sequence 모델 기반으로 한 한국어 형태소 분석의 재순위화 모델)

  • Choi, Yong-Seok;Lee, Kong Joo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.4
    • /
    • pp.121-128
    • /
    • 2018
  • A Korean morphological analyzer adopts sequence-to-sequence (seq2seq) model, which can generate an output sequence of different length from an input. In general, a seq2seq based Korean morphological analyzer takes a syllable-unit based sequence as an input, and output a syllable-unit based sequence. Syllable-based morphological analysis has the advantage that unknown words can be easily handled, but has the disadvantages that morpheme-based information is ignored. In this paper, we propose a reranking model as a post-processor of seq2seq model that can improve the accuracy of morphological analysis. The seq2seq based morphological analyzer can generate K results by using a beam-search method. The reranking model exploits morpheme-unit embedding information as well as n-gram of morphemes in order to reorder K results. The experimental results show that the reranking model can improve 1.17% F1 score comparing with the original seq2seq model.

Improvement of korean Braille-Code System for Automatic Reverse Braille Translation (자동 역점역을 가능하게 하는 한글점자 부호체계의 개선)

  • Kihi, Tae-Yeong;Kim, Suk-Il;Kim, Hong-Gi
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.3
    • /
    • pp.703-714
    • /
    • 1998
  • Translation of a Korean text into a braille text causes no problem under the existing Korean braille-code System that maintains I: I correspondence between Korean characters and braille codes. However, reverse translation of a braille text into a Korean text would cause unavoidable mis-translation due to I : N correspondences between braille codes and Korean characters. The analysis shows that in Korean braille-code System, the major reasons for mis-translation are the conflicts between Korean Chosung (initial consonant) letters and numbers, between Korean Jongsung (final consonant) letters and punctuations, between Korean Jongsung (final consonant) letters and English mode delimiter codes, and overlapping use of the same braille code such as opening and closing parentheses. In this paper, we firstly established a reverse translation rule, that is to use an ordinary braille code unless the braille code causes any conflict in braille code sequences. If any ordinary braille code may cause some conflict during the reverse translation, an extended braille code that we propose in this paper must be used. In the experimentation, we cannot get any translation errors with the newly designed braille-code System compared with 25% of errors with the existing braille-code System.

  • PDF

Analysis of Effect of Learning to Solve Word Problems through a Structure-Representation Instruction. (문장제 해결에서 구조-표현을 강조한 학습의 교수학적 효과 분석)

  • 이종희;김부미
    • School Mathematics
    • /
    • v.5 no.3
    • /
    • pp.361-384
    • /
    • 2003
  • The purpose of this study was to investigate students' problem solving process based on the model of IDEAL if they learn to solve word problems of simultaneous linear equations through structure-representation instruction. The problem solving model of IDEAL is followed by stages; identifying problems(I), defining problems(D), exploring alternative approaches(E), acting on a plan(A). 160 second-grade students of middle schools participated in a study was classified into those of (a) a control group receiving no explicit instruction of structure-representation in word problem solving, and (b) a group receiving structure-representation instruction followed by IDEAL. As a result of this study, a structure-representation instruction improved word-problem solving performance and the students taught by the structure-representation approach discriminate more sharply equivalent problem, isomorphic problem and similar problem than the students of a control group. Also, students of the group instructed by structure-representation approach have less errors in understanding contexts and using data, in transferring mathematical symbol from internal learning relation of word problem and in setting up an equation than the students of a control group. Especially, this study shows that the model of direct transformation and the model of structure-schema in students' problem solving process of I and D stages.

  • PDF

The Development and the Application of Product Design Database for Product Opportunity Analysis (제품기회탐색을 위한 제품디자인 데이터베이스 구축과 이의 활용)

  • 박정순;이건표
    • Archives of design research
    • /
    • v.12 no.4
    • /
    • pp.119-128
    • /
    • 1999
  • Product opportunity analysis in product planning is to analyze the feasibility for success over the detail product concept, and to grasp the new possible market based on exploring the trends of market and product itself. Therefore, the correct analysis and insight with various data of product and market is needed for product opportunity analysis. As product environment changes rapidly, it is especially important to collect more plentiful informations, and to put these information to practical use pertinently. It is consequently indispensible to clarify the types of information to be needed and to construct product database. However, there has no meaning to gather simple information which is lying here and there. Product database has to be systematically organized and each product information is to be transformed into contextual one. This study clarifies a conceptual framework of product design database based on product attributes and develops prototype of product database for product planning. Case study of camera is exemplified for analyzing the product trends and exploring product opportunity with developed product database.

  • PDF

A Entropy Coding Method using Temporal and Spatial Correlation on HEVC (HEVC에서 시공간적 상관관계를 이용한 엔트로피 부호화 방법)

  • Kim, Tae-Ryong;Kim, Kyung-Yong;Lee, Han-Soo;Park, Gwang-Hoon
    • Journal of Broadcast Engineering
    • /
    • v.17 no.1
    • /
    • pp.191-194
    • /
    • 2012
  • The split flag and the skip flag in CU syntax have high correlation on spatial domain as well as temporal domain. This paper suggests a method for enhancing coding efficiency by using not only spatial correlation but also temporal correlation when coding CU information. In the CABAC case, temporal collocated CU information is used for selecting context model of the split flag and the skip flag. In the CAVLC case, current CU information is estimated from temporal collocated CU information then encoded. As a result, a coding efficiency was increased by 0.1%~0.6% in CABAC, 0.1%~0.4% in CAVLC compared with HM 3.0. This method shows better performance on lowdelay condition which uses reference frame close to current frame.