• Title/Summary/Keyword: HTML 문서

Search Result 396, Processing Time 0.043 seconds

An Efficient Method for Logical Structure Analysis of HTML Tables (HTML 테이블의 논리적 구조분석을 위한 효율적인 방법)

  • Kim Yeon-Seok;Lee Kyong-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.9
    • /
    • pp.1231-1246
    • /
    • 2006
  • HTML is a format for rendering Web documents visually and uses tables to present a relational information. Since HTML has limits in terms of information processing and management by a computer, it is important to transform HTML tables into XML documents, which is able to represent logical structure information. As a prerequisite for extracting information from the Web, this paper presents an efficient method for extracting logical structures from HTML tables and transforming them into XML documents. The proposed method consists of two phases: Area segmentation and structure analysis. The area segmentation step removes noisy areas and extracts attribute and value areas through visual and semantic coherency checkup. The hierarchical structure between attribute and value areas are analyzed and transformed into XML representations using a proposed table model. Experimental results with 1,180 HTML tables show that the proposed method performs better than the conventional method, resulting in an average precision of 86.7%.

  • PDF

Design of Document-HTML Generation Technique for Authorized Electronic Document Communication (공인전자문서 소통을 위한 Document-HTML 문서 생성 기법의 설계)

  • Hwang, Hyun-Cheon;Kim, Woo-Je
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.44 no.1
    • /
    • pp.51-59
    • /
    • 2021
  • Electronic document communication based on a digital channel is becoming increasingly important with the advent of the paperless age. The electronic document based on PDF format does not provide a powerful customer experience for a mobile device user despite replacing a paper document by providing the content integrity and the independence of various devices and software. On the other hand, the electronic document based on HTML5 format has weakness in the content integrity as there is no HTML5 specification for the content integrity despite its enhanced customer experience such as a responsive web technology for a mobile device user. In this paper, we design the Document-HTML, which provides the content integrity and the powerful customer experience by declaring the HTML5 constraint rules and the extended tags to contain the digital signature based on PKI. We analyze the existing electronic document that has been used in the major financial enterprise to develop a sample. We also verify the Document-HTML by experimenting with the sample of HTML electronic communication documents and analyze the PKI equation. The Document-HTML document can be used as an authorized electronic document communication and provide a powerful customer experience in the mobile environment between an enterprise and a user in the future.

XML Representation of a Sheet Music for Chorus (합창곡 악보의 XML 표현)

  • 김정훈;김선호;채진석
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10a
    • /
    • pp.72-74
    • /
    • 1999
  • XML은 HTML의 단순성과 SGML의 복잡성을 동시에 극복하기 위한 노력으로 시작되어 HTML이나 SGML과는 다른 새로운 세계를 만들어 내고 있으며, 인터넷 문서 표현과 관련된 여러 분야에서 활발하게 연구되고 있다. 이 논문에서는 차세대 인터넷 문서 표현 언어로 주목받고 있는 XML을 이용하여 합창곡의 악보를 표현하는 기법을 제시한다. 이 논문에서는 합창곡 악보를 표현하기 위해 정의된 새로운 마크업 언어인 ScoreML(Score Markup Language)을 소개하고, ScoreML로 작성된 XML 문서를 웹에서 볼 수 있도록 ScoreML 브라우저의 설계 및 구현에 대해 기술한다. ScoreML을 사용하여 작성된 XML 문서는 악보 표현뿐만 아니라 음악 데이터의 저장과 검색에도 사용될 수 있다.

  • PDF

Analysis and Implementation of a Web Document Converter for Wireless Internet Use XHTML On Mobile Communication Environment (이동통신환경에서 XHTML을 이용한 무선인터넷 문서변환기 분석 및 구현)

  • 백진영;이종옥;조성언;조경룡
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2001.10a
    • /
    • pp.105-108
    • /
    • 2001
  • This paper is purposed in design and implement of a device which can convert XHTML documents in web-Server into WML documents when users access the web by using portable devices. Users access XHTML(so-called HTML) web page and ask for informations, this document convertor recognizes of XHTML documents structures, reconstructs into simple WML documents by using