• Title/Summary/Keyword: Document-HTML

Search Result 182, Processing Time 0.023 seconds

Design of Document-HTML Generation Technique for Authorized Electronic Document Communication (공인전자문서 소통을 위한 Document-HTML 문서 생성 기법의 설계)

  • Hwang, Hyun-Cheon;Kim, Woo-Je
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.44 no.1
    • /
    • pp.51-59
    • /
    • 2021
  • Electronic document communication based on a digital channel is becoming increasingly important with the advent of the paperless age. The electronic document based on PDF format does not provide a powerful customer experience for a mobile device user despite replacing a paper document by providing the content integrity and the independence of various devices and software. On the other hand, the electronic document based on HTML5 format has weakness in the content integrity as there is no HTML5 specification for the content integrity despite its enhanced customer experience such as a responsive web technology for a mobile device user. In this paper, we design the Document-HTML, which provides the content integrity and the powerful customer experience by declaring the HTML5 constraint rules and the extended tags to contain the digital signature based on PKI. We analyze the existing electronic document that has been used in the major financial enterprise to develop a sample. We also verify the Document-HTML by experimenting with the sample of HTML electronic communication documents and analyze the PKI equation. The Document-HTML document can be used as an authorized electronic document communication and provide a powerful customer experience in the mobile environment between an enterprise and a user in the future.

Design and Implementation of the Document HTML System for Preserving Content Integrity

  • Hyun Cheon Hwang;Ji Su Park;Jin Gon Shon
    • Journal of Information Processing Systems
    • /
    • v.19 no.3
    • /
    • pp.334-346
    • /
    • 2023
  • An electronic document based on PDF has been widely used in customer communication between an enterprise and a customer to deliver personalized content. However, electronic documents based on PDF in the form of paper layouts are not suitable for mobile environments because of low readability and lack of interactive interaction. Even though HTML is an essential language in a mobile environment, electronic document based on PDF is still used as it has a content integrity verification feature with a digital signature. It means that a user is sacrificing user experience in a mobile environment for content integrity and using paper-layout electronic documents. In this research, we design the Document HTML specification by setting the Document HTML conformance, adding the extended meta tags, and signing the message digest with a digital signature based on public key infrastructure (PKI). Furthermore, we implemented the Document HTML system, which has REST API services to generate and verify the Document HTML, and did experimental verification of the theory. As a result, we have confirmed that the Document HTML has both content integrity and user experience on mobile. Furthermore, the Document HTML is expected to be an alternative document format to deliver personalized content from an enterprise to a customer in a mobile environment instead of the paper layout electronic document such as PDF.

Design and Implementation of the HTML-WML Converter (무선 인터넷을 위한 HTML-WML 변환기 설계 및 구현)

  • 민영수;강형일;유재수
    • Journal of Internet Computing and Services
    • /
    • v.2 no.2
    • /
    • pp.37-50
    • /
    • 2001
  • To access massive and various HTML documents that are in the web using wireless Internet equipments, another WML document that is equal to the HTML document must be written, In the case Web documents written by HTML are massive, the construction of a WML site with the same information needs much cost of space and time, This paper designs and implements the HTML-XML converter that alleviates such a problem. The HTML-WML converter translates the Web document written by HTML to the WML document for portable wireless equipments, The HTML-XML converter has advantages that it reconstructs WML document dynamically according to portable wireless equipments and processes various image formats such as GIF, JPG, BMP, and so on, The HTML-WML converter can be used as not only a utility of the WML editor but also a real-time converter on wireless Internet.

  • PDF

HTML Tag Depth Embedding: An Input Embedding Method of the BERT Model for Improving Web Document Reading Comprehension Performance (HTML 태그 깊이 임베딩: 웹 문서 기계 독해 성능 개선을 위한 BERT 모델의 입력 임베딩 기법)

  • Mok, Jin-Wang;Jang, Hyun Jae;Lee, Hyun-Seob
    • Journal of Internet of Things and Convergence
    • /
    • v.8 no.5
    • /
    • pp.17-25
    • /
    • 2022
  • Recently the massive amount of data has been generated because of the number of edge devices increases. And especially, the number of raw unstructured HTML documents has been increased. Therefore, MRC(Machine Reading Comprehension) in which a natural language processing model finds the important information within an HTML document is becoming more important. In this paper, we propose HTDE(HTML Tag Depth Embedding Method), which allows the BERT to train the depth of the HTML document structure. HTDE makes a tag stack from the HTML document for each input token in the BERT and then extracts the depth information. After that, we add a HTML embedding layer that takes the depth of the token as input to the step of input embedding of BERT. Since tokenization using HTDE identifies the HTML document structures through the relationship of surrounding tokens, HTDE improves the accuracy of BERT for HTML documents. Finally, we demonstrated that the proposed idea showing the higher accuracy compared than the accuracy using the conventional embedding of BERT.

The Design and Implementation of HTML Document Integrity Management System (HTML 문서의 무결성 유지 시스템의 설계 및 구현)

  • 조이기;이영운;황인문;양수영;김원중
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.05a
    • /
    • pp.380-383
    • /
    • 2002
  • It Is difficult to manage broken link with dangling reference, inaccurate reference m the manual site that is consist of HTML documents of much quantity as KLDP(Korean Linux Documentation Project, http:/ /kldp.org) Web site. In this paper. we define relationship and constrain renditions that exist between Web site's HTML documents. And we design and implement HIMS(HTML Document Integrity Management System), which notify user that integrity violation happens or launch trigger operation to keep integrity between HTML documents in case of insert, delete, update.

  • PDF

A Monitoring Agent System for the Comparative Information Search in Real Time Auction (실시간 경매정보 비교 검색을 위한 모니터링 에이전트 시스템)

  • 김철기;이상용
    • Journal of Internet Computing and Services
    • /
    • v.2 no.2
    • /
    • pp.1-9
    • /
    • 2001
  • To access massive and various HTML documents that are in the web using wireless Internet equipments, another WML document that is equal to the HTML document must be written. In the case Web documents written by HTML are massive, the construction of a WML site with the same information needs much cost of space and time. This paper designs and implements the HTML-XML converter that alleviates such a problem. The HTML-XML converter translates the Web document written by HTML to the WML document for portable wireless equipments. The HTML-XML converter has advantages that it reconstructs WML document dynamically according to portable wireless equipments and processes various image formats such as GIF, JPG, BMP, and so on. The HTML-WML converter can be used as not only a utility of the WML editor but also a real time converter on wireless internet.

  • PDF

Adaptive Conversion of Web Content for Mobile Terminals (이동단말을 위한 적응적 웹 문서 변환)

  • Kang, Sueng-Chun;Chung, Kwang-Sue
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.6 no.6
    • /
    • pp.635-642
    • /
    • 2000
  • In this paper, we proposed an efficient document conversion mechanism to provide a adaptive web document to mobile terminals. We also proposed a RHTML(Reduced HTML) to archive the adaptive tag reduction. Markup error correction process in the proposed adaptive document conversion mechanism converts a HTML(HyperText Markup Language) document into a XML(Extensible Markup Language) application document. This. process makes web document easy to handle with a DOM (Document Object Mode)) as the tree model and removes the hardware overhead in mobile terminals. Also, tag reduction process provides the adaptive web document with three DTD(Document Type Definition)s in the RHTML.

  • PDF

Automatically Converting HTML Documents with Similar Pattern into XML Documents (유사 패턴을 갖는 HTML 문서의 XML 자동 변환)

  • O, Geum-Yong;Hwang, In-Jun
    • The KIPS Transactions:PartD
    • /
    • v.9D no.3
    • /
    • pp.355-364
    • /
    • 2002
  • Recently, WWW(World Wide Web) has become a source of a large amount of information, and is now recognized not only as an information-sharing tool, but also as an information repository. Currently, the majority of documents on the web were created using HTML(Hypertext Markup Language). Although HTML is simple and easy to learn, its inherent lack of describing document structure makes it difficult to retrieve information effectively. One possible solution would be to convert such HTML documents into XML (extensible Markup Language) documents. This is a standard markup language for exchanging data on the web. It can describe a document structure freely by defining its own DTD (Document Type Definition). This makes it possible to integrate, store, and retrieve data on the web efficiently In this paper, we will propose a converter that automatically converts HTML documents with similar pattern into XML documents by analyzing the document structure and recognizing its path information.

XSLT document editing for XML document conversion in WYSIWYG environment (WYSIWYG 환경에서 XML 문서 변환을 위한 XSLT 문서편집 시스템)

  • 차원준;박주상;이용준;정회경
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.10a
    • /
    • pp.500-503
    • /
    • 2003
  • XML been using extensively by standard of data exchanging in the Internet is observed by a technology to replace existent document creation language of HTML etc. Biggest characteristic of this XML is that logic information and physical information that express style of document that do that express structural substance of document were detached. Hereupon, W3C advised XSL that oner style function of form similar to HTML for XML's style and data conversion. Also, XSL's conversion function offers function that change XML document to other data format, and can describe style information through conversion of various document format. But, a XML document conversion technology that use XSLT know-how in domestic is unprepared real condition, and necessity about solution that can edit XSLT document efficiently is putting. This paper does XML document so that conversion and output are available in various document format. And offered research of XSLT document editing system that can edit and create XSLT document efficiently under WYSIWYG environment.

  • PDF

Web Information Retrieval Exploiting Markup Pattern (마크업 패턴을 이용한 웹 검색)

  • Kim, Min-Soo;Kim, Min-Koo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.13 no.6
    • /
    • pp.407-411
    • /
    • 2007
  • Over the years, great attention has been paid to the question of exploiting inherent semantic of HTML in the area of web document retrieval. Although HTML is mainly presentation oriented, HTML tags implicitly contain useful semantics that can be catch meaning of text. Focusing on this idea. in this paper we define 'markup pattern' and try to improve performance of web document retrieval using markup patterns. Markup pattern is a mirror of intends of web document publisher and an internal semantic of text on web document. To discover the markup pattern and exploit it, we suggest a new scheme for extracting concepts and weighting documents. For evaluation task, we select two domains-BBC and CNN web sites, and use their search engines to gather domain documents. We re-weight and re-score documents using proposed scheme, and show the performance improvement in the two domains.