• Title/Summary/Keyword: Annotation Markup Language

Search Result 11, Processing Time 0.027 seconds

From Tombstones to Corpora: TSML for Research on Language, Culture, Identity and Gender Differences

  • Streiter, Oliver;Voltmer, Leonhard;Goudin, Yoann
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.450-458
    • /
    • 2007
  • Tombstone inscriptions represent a linguistic genre which yields insights in culture and language. Creating corpora from tombstones is thus a complementary approach for the study of languages and cultures. For the annotation of tombstone corpora, we propose TSML, the Tombstone-Markup-Language, developed during the massive annotation of Taiwanese tombstones and a number of tombstones from China, Indonesia and Europe. We discuss our conceptual framework in the annotation of tombstones and derive successively and present preliminary research data to show how the usefulness of the annotations. Finally, we will encourage researchers to participate in the specification of TSML to obtain soon an annotation language for annotations across cultures and languages.

  • PDF

Design of GeoPhoto Contents Markup Language for u-GIS Contents (u-GIS 콘텐츠를 위한 GeoPhoto 콘텐츠 언어의 설계)

  • Park, Jang-Yoo;Nam, Kwang-Woo;Jin, Heui-Chae
    • Journal of Korea Spatial Information System Society
    • /
    • v.11 no.1
    • /
    • pp.35-42
    • /
    • 2009
  • This paper proposes a new GeoPhoto contents markup language that can create u-GIS contents by using the spatial photos. GeoPhoto contents markup language has designed the GeoPhoto contents model and markup language for contents that can be used the spatial photos information. GeoPhoto con tents markup language is represented by the convergence of GIS information, location information, photos information respectively. GeoPhoto contents markup language to provide a variety of pictures related to the content model consists of GeoPhoto contents model and operations between the GeoPhoto contents. GeoPh oto contents model supports GeoPhoto model, CubicPhoto model, Photo model and SequenceGeoPhoto mod el. In addition, this paper propose the Annotation operation, Enlargement operation and Overlay operation for represent the GeoPhoto contents. GeoPhoto Contents Markup Language has the advantage of supportin g user custom contents model of u-GIS.

  • PDF

Annotation Modeling and System Implementation for Hand-held Environment (휴대용 단말기 환경을 위한 Annotation 모델링 및 시스템 구현)

  • Sohn, Won-Sung
    • Journal of The Korean Association of Information Education
    • /
    • v.10 no.2
    • /
    • pp.219-226
    • /
    • 2006
  • For the accurate creation of annotation information in a free-form annotation environment, the ambiguity that arises in the analysis stage between the geometric information and annotations needs to be resolved. Therefore, this This paper identifies, analyzes, and proposes presents solutions methods for the ambiguity that can occur between free-form marking and various contexts in XML-based annotation environment. The proposed method is based on context which includes various textual and structure information between free-form marking and annotated part. The proposed method show that the annotated portions areas included in the free-form marking information are more accurate, achieving more accurate exchange results amongst multiple users in a heterogeneous document environment. This study can be effectively applied to eLearning, Cyber-Class, and IETM

  • PDF

Design & Implementation of a Motion Capture Database Based on Motion Ontologies (온톨로지 기반의 모션 캡처 데이터베이스 설계 및 구현)

  • Chung Hyun-Sook
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.5
    • /
    • pp.618-632
    • /
    • 2005
  • A framework for semantic annotation oi human motion sequences is proposed in this paper. Motion capture technology is widely used for manuiacturing animation since it produces high qualify character motion similar to the actual motion of the human body. However, motion capture has a significant weakness due to the lack of an industry wide standard for archiving and retrieving motion capture data. It is difficult for animators to retrieve the desired motion sequences from motion capture files as there is no semantic annotation on already captured motion data. Our goal is to improve the reusability of motion capture data. To archive our goal first, we propose a standard format for integrating different motion capture file formals. Our standard format is called MCML (Motion Capture Markup Language). It is a markup language based on XML (extensible Markup Language). The purpose of MCML is not only to facilitate the conversion or integration of different formats, but also to allow for greater reusability of motion capture data, through the construction of a motion database storing the MCML documents Second, we define motion ontologies that are used to annotate and semantically organize human motion sequences. This ontology-based approach provides the means for discovering and exploiting the information and knowledge surrounding motion capture data.

  • PDF

Construction of Dialogue Corpus and Structured Documentation of Annotation Information (대화 코퍼스의 구축 및 주석 정보의 구조적 문서화)

  • 강창규;김영일;김봉완;이용주
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2003.11a
    • /
    • pp.269-272
    • /
    • 2003
  • 음성인식의 연구 대상은 낭독음성에서 대화음성으로 발전해가고 있다. 이를 위해서는 대량의 대화코퍼스가 필요하다. 그러나 아직 충분한 양의 대화코퍼스가 구축되어 있지 못하며 코퍼스의 주석 정보 또한 복잡하고 다양하게 표현하고 있어 효율적인 활용이 어렵다. 따라서 본 논문에서는 대화 영역으로 텔래뱅킹 영역을 설정하고 대화코퍼스를 구축하여 구축된 대화코퍼스의 주석 정보를 XML(Extensible Markup Language)로 표준화할 수 있도록 DTD(Document Type Definition)를 정의하여 문서 구조화하였다.

  • PDF

The Korean TimeML: A Study of Event and Temporal Information in Korean Text (한국어 TimeML-텍스트의 사건 및 시간 정보 연구)

  • You, Hyun-Jo;Jang, Ha-Yeon;Jo, Yu-Mi;Kim, Yoon-Shin;Nam, Seung-Ho;Shin, Hyo-Pil
    • Language and Information
    • /
    • v.15 no.1
    • /
    • pp.31-62
    • /
    • 2011
  • TimeML is a markup language for events and temporal expressions in natural language, proposed in Pustejovsky et al. (2003) and latter standardized as ISO-TimeML (ISO 24617-1:2009). In this paper, we propose the further specification of ISO-TimeML for the Korean language with the concrete and thorough examination of real world texts. Since Korean differs significantly from English, which is the first and almost only extensively tested language with TimeML, one continuously run into theoretical and practical difficulties in the application of TimeML to Korean. We focus on the discussion for the consistent and efficient application of TimeML: how to consistently apply TimeML in accordance with Korean specificity and what to be annotated and what not to be, i.e. which information is meaningful in the temporal interpretation of Korean text, for efficient application of TimeML.

  • PDF

Semantic Types and Representation of Korean Set Time Expressions (한국어 집합 시간 표현의 의미 유형과 표상)

  • Kim, Mun-Hyong;Jo, Yu-Mi;You, Hyun-Jo;Jang, Ha-Yeon;Kim, Yoon-Shin;Nam, Seung-Ho;Shin, Hyo-Pil
    • Language and Information
    • /
    • v.16 no.1
    • /
    • pp.25-43
    • /
    • 2012
  • This study introduces set-denoting time expressions in Korean, which can be divided into simple and complex types. It was found that while the simple type expressions are easily represented within ISO-TimeML, a time-expression markup language, some complex type set-denoting expressions are not. Therefore, this study analyzes the reason for these difficulties in representing complex type expressions, as well as suggests the introduction of @measure and @interpretation attributes in the TIMEX3 tag. The @measure attribute represents the time interval, and the @interpretation attribute is used to distinguish distributive readings from cumulative readings. Additionally this paper suggests that a mapping between these and other attributes are required in TLINK.

  • PDF

Recent Development in Text-based Medical Image Retrieval (텍스트 기반 의료영상 검색의 최근 발전)

  • Hwang, Kyung Hoon;Lee, Haejun;Koh, Geon;Kim, Seog Gyun;Sun, Yong Han;Choi, Duckjoo
    • Journal of Biomedical Engineering Research
    • /
    • v.36 no.3
    • /
    • pp.55-60
    • /
    • 2015
  • An effective image retrieval system is required as the amount of medical imaging data is increasing recently. Authors reviewed the recent development of text-based medical image retrieval including the use of controlled vocabularies - RadLex (Radiology Lexicon), FMA (Foundational Model of Anatomy), etc - natural language processing, semantic ontology, and image annotation and markup.

Development of Bioinformatic Database and Converting Tools based on BSML (BSML 기반의 유전자 데이터베이스와 변환기의 구축)

  • 윤애란;이수정;이희전;용환승
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.04a
    • /
    • pp.638-640
    • /
    • 2003
  • 최근 바이오인포매틱스 분야의 발전에 따라 방대한 양의 유전체 데이터에 대한 연구가 진행되고 있으며, 이러한 데이터를 효율적으로 다루기 위해 다양한 형태의 파일과 데이터베이스들이 사용되고 있다. 하지만 표준화의 미비로 인하여 데이터의 관리와 변환에 어려움이 많다. 본 논문에서는 이러한 문제점을 해결하기 위하여 바이오인포매틱스 데이터를 다루기 위한 표준으로 다양한 XML 포맷들 중에서 BSML(Bioinformatic Sequence Markup Language)을 채택하고, Genbank 파일을 변환하여 관계형 데이터베이스에 저장하는 모듈을 개발한다. 또한 관계형 데이터베이스 형태의 유전체 데이터를 BSML 형태로, Genbank 파일 형태를 BSML 형태로 그리고 AGAVE(Architecture for Genomic Annotation)파일 형태를 BSML 형태로 변환하는 변환기롤 개발하고자 한다.

  • PDF

W3C based Interoperable Multimodal Communicator (W3C 기반 상호연동 가능한 멀티모달 커뮤니케이터)

  • Park, Daemin;Gwon, Daehyeok;Choi, Jinhuyck;Lee, Injae;Choi, Haechul
    • Journal of Broadcast Engineering
    • /
    • v.20 no.1
    • /
    • pp.140-152
    • /
    • 2015
  • HCI(Human Computer Interaction) enables the interaction between people and computers by using a human-familiar interface called as Modality. Recently, to provide an optimal interface according to various devices and service environment, an advanced HCI method using multiple modalities is intensively studied. However, the multimodal interface has difficulties that modalities have different data formats and are hard to be cooperated efficiently. To solve this problem, a multimodal communicator is introduced, which is based on EMMA(Extensible Multimodal Annotation Markup language) and MMI(Multimodal Interaction Framework) of W3C(World Wide Web Consortium) standards. This standard based framework consisting of modality component, interaction manager, and presentation component makes multiple modalities interoperable and provides a wide expansion capability for other modalities. Experimental results show that the multimodal communicator is facilitated by using multiple modalities of eye tracking and gesture recognition for a map browsing scenario.