• Title/Summary/Keyword: Annotation ambiguity

Search Result 10, Processing Time 0.022 seconds

Design of An Interface for Explicit Free-farm Annotation Creation (명확한 free-form annotation 생성을 위한 인터페이스 설계)

  • 손원성;김재경;최윤철;임순범
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.139-141
    • /
    • 2002
  • Free-form annotation 환경에서 정확한 annotation 정보를 생성하기 위해서는 free-form 마킹의 기하 정보와 annotated part간의 관계를 분석하는 과정에서 발생하는 ambiguity를 인식 및 해결할 수 있어야 한다. 따라서 본 논문에서는 먼저 XML 기반의 annotation 환경에서 free-form 마킹과 다양한 컨텍스트 간에 발생할 수 있는 ambiguity를 분석하였으며 이를 해결하기 위한 annotation 보정 기법을 제안한다. 제안 기법은 free-form 마킹과 annotated part간의 다양한 textual 및 문서구조를 포함하는 컨텍스트를 기반으로 하며 본 연구에서 구현한 annotation 시스템을 통하여 출력 및 교환된다. 그 결과 본 연구의 제안 기법을 통하여 생성된 free-form 마킹 정보는 기존의 기법보다 사용자가 원하는 annotated part 영역을 포함할 수 있으며 따라서 다중사용자 및 서로 다른 문서환경에서도 명확한 교환 결과를 보장할 수 있다.

  • PDF

A Method of Context based Free-form Annotation in XML Documents (XML문서 환경에서의 내용기반 자유형 Annotation 생성 기법)

  • 손원성;김재경;임순범;최윤철
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.9
    • /
    • pp.850-861
    • /
    • 2003
  • When creating annotation information in a free~form environment, ambiguity arises during the analysis stage between geometric information and the annotations. This needs to be resolved so that the accurate creation of annotation information in a free-form annotation environment is possible. This paper identifies and analyzes the ambiguities, specifying methods that are tailored to each of the various contexts that can cause conflicts with free-form marking in a XML-based annotation environment. The proposed general method is based on context which includes various textual and structure information between free-form marking and the annotations themselves. The context information used is expressed in XML based DTD, within the paper. The results are printed and shared through a system specifically implemented for this study. The results from the implementation of the Proposed method show that the annotated areas included in the free-form marking information are more accurate, achieving more accurate exchange results amongst multiple users in a heterogeneous document environment.

Annotation Modeling and System Implementation for Hand-held Environment (휴대용 단말기 환경을 위한 Annotation 모델링 및 시스템 구현)

  • Sohn, Won-Sung
    • Journal of The Korean Association of Information Education
    • /
    • v.10 no.2
    • /
    • pp.219-226
    • /
    • 2006
  • For the accurate creation of annotation information in a free-form annotation environment, the ambiguity that arises in the analysis stage between the geometric information and annotations needs to be resolved. Therefore, this This paper identifies, analyzes, and proposes presents solutions methods for the ambiguity that can occur between free-form marking and various contexts in XML-based annotation environment. The proposed method is based on context which includes various textual and structure information between free-form marking and annotated part. The proposed method show that the annotated portions areas included in the free-form marking information are more accurate, achieving more accurate exchange results amongst multiple users in a heterogeneous document environment. This study can be effectively applied to eLearning, Cyber-Class, and IETM

  • PDF

Modeling and Implementation of Context based Annotation for XML Documents

  • Sohn, Won-Sung;Ko, Myeong-Cheol;Kim, Jae-Kyung;Lim, Soon-Bum;Choy, Yoon-Chul
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.4
    • /
    • pp.565-575
    • /
    • 2003
  • This paper proposed context based annotation model and annotation ambiguity correction methods. The proposed model provides various annotation types, semantic models, and pen-based free drawing interface. Annotation correction method that is specifically based on the context which includes various textual and structure information between free-form marking and annotation. Also, interface for XML environment using the proposed model and correction methods is proposed and possibilities of application is looked at. The results from the implementation of the proposed method show that the annotated areas included in the free-form marking information are more accurate, achieving more accurate exchange results amongst multiple users in a heterogeneous document environment

  • PDF

The Semantics of Semantic Annotation

  • Bunt, Harry
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.13-28
    • /
    • 2007
  • This is a speculative paper, describing a recently started effort to give a formal semantics to semantic annotation schemes. Semantic annotations are intended to capture certain semantic information in a text, which means that it only makes sense to use semantic annotations if these have a well-defined semantics. In practice, however, semantic annotation schemes are used that lack any formal semantics. In this paper we outline how existing approaches to the annotation of temporal information, semantic roles, and reference relations can be integrated in a single XML-based format and can be given a formal semantics by translating them into second-order logic. This is argued to offer an incremental aproach to the incorporation of semantic information in natural language processing that does not suffer from the problems of ambiguity and lack of robustness that are common to traditional approaches to computational semantics.

  • PDF

A Study on Ambiguity Resolving for Pen-based Proofreading of Web Documents (펜 기반 웹 문서 교정을 위한 모호성 문제 해결에 관한 연구)

  • Sohn, Won-Sung
    • Journal of The Korean Association of Information Education
    • /
    • v.11 no.1
    • /
    • pp.107-116
    • /
    • 2007
  • To produce accurate editing results, the ambiguity of editing scopes related to marked correction signs should be solved. Proofreading the web document modifies the document structures, and the modified structures should be robustly valid for the defined DTD. This paper presents a pen-based proof-reading interface in the XML document. In the proposed interface, correction signs are free-drawn, and the editing scopes are recognized and revised based on the contexts of the document to minimize the ambiguity of the editing scopes. The proposed interface provides both implicit and explicit modification methods for document structures. As a result, the editing scopes processed in the proposed interface are more accurate, and the document structures are maintained valid for DTD after the editing.

  • PDF

A Pen-based Proofreading Interface in XML Documents (XML 문서에서의 펜 기반 교정 인터페이스)

  • Sohn Won-Sung;Kim Jae-Kyung;Choy Yoon-Chul;Lim Soon-Bum;Kim Woo-Sung
    • Journal of KIISE:Software and Applications
    • /
    • v.33 no.2
    • /
    • pp.231-242
    • /
    • 2006
  • Accurate proofreading Interface requires resolving the ambiguity that occurs when the system determines the relations between the free-form by the user and editing scopes of the document. Proofreading for structure documents such as XML/XHTML involves modification of document structures and modified document also should follow the pre-defined DTD. This paper present a CPI (Context-based Proofreading Interface) based on the XML. The CPI supports free-form drawing of correction marks and provides context-based scope recognition and revision methods. CPI provides both implicit and explicit modification methods for document structures. As a result, the correction mark information produced in this paper includes more accurate scope information than that in other systems.

Automated Classification of PubMed Texts for Disambiguated Annotation Using Text and Data Mining

  • Choi, Yun-Jeong;Park, Seung-Soo
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.101-106
    • /
    • 2005
  • Recently, as the size of genetic knowledge grows faster, automated analysis and systemization into high-throughput database has become hot issue. One essential task is to recognize and identify genomic entities and discover their relations. However, ambiguity of name entities is a serious problem because of their multiplicity of meanings and types. So far, many effective techniques have been proposed to analyze documents. Yet, accuracy is high when the data fits the model well. The purpose of this paper is to design and implement a document classification system for identifying entity problems using text/data mining combination, supplemented by rich data mining algorithms to enhance its performance. we propose RTP ost system of different style from any traditional method, which takes fault tolerant system approach and data mining strategy. This feedback cycle can enhance the performance of the text mining in terms of accuracy. We experimented our system for classifying RB-related documents on PubMed abstracts to verify the feasibility.

  • PDF

Ranking Tag Pairs for Music Recommendation Using Acoustic Similarity

  • Lee, Jaesung;Kim, Dae-Won
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.15 no.3
    • /
    • pp.159-165
    • /
    • 2015
  • The need for the recognition of music emotion has become apparent in many music information retrieval applications. In addition to the large pool of techniques that have already been developed in machine learning and data mining, various emerging applications have led to a wealth of newly proposed techniques. In the music information retrieval community, many studies and applications have concentrated on tag-based music recommendation. The limitation of music emotion tags is the ambiguity caused by a single music tag covering too many subcategories. To overcome this, multiple tags can be used simultaneously to specify music clips more precisely. In this paper, we propose a novel technique to rank the proper tag combinations based on the acoustic similarity of music clips.

Building an Annotated English-Vietnamese Parallel Corpus for Training Vietnamese-related NLPs

  • Dien Dinh;Kiem Hoang
    • Proceedings of the IEEK Conference
    • /
    • summer
    • /
    • pp.103-109
    • /
    • 2004
  • In NLP (Natural Language Processing) tasks, the highest difficulty which computers had to face with, is the built-in ambiguity of Natural Languages. To disambiguate it, formerly, they based on human-devised rules. Building such a complete rule-set is time-consuming and labor-intensive task whilst it doesn't cover all the cases. Besides, when the scale of system increases, it is very difficult to control that rule-set. So, recently, many NLP tasks have changed from rule-based approaches into corpus-based approaches with large annotated corpora. Corpus-based NLP tasks for such popular languages as English, French, etc. have been well studied with satisfactory achievements. In contrast, corpus-based NLP tasks for Vietnamese are at a deadlock due to absence of annotated training data. Furthermore, hand-annotation of even reasonably well-determined features such as part-of-speech (POS) tags has proved to be labor intensive and costly. In this paper, we present our building an annotated English-Vietnamese parallel aligned corpus named EVC to train for Vietnamese-related NLP tasks such as Word Segmentation, POS-tagger, Word Order transfer, Word Sense Disambiguation, English-to-Vietnamese Machine Translation, etc.

  • PDF