• 제목/요약/키워드: Document

검색결과 4,925건 처리시간 0.031초

ODA에 근거한 문서 클래스 에디터 설계 및 구현 (Implementation and Design of Document Class Editor based on ODA)

  • 정회경;이수연
    • 한국통신학회논문지
    • /
    • 제17권12호
    • /
    • pp.1412-1422
    • /
    • 1992
  • 본 논문은 이 기종 문서처리 시스템간에 문서교환을 위해 국제 표준으로 재정된 ODA에 따른 문서 클래스(class) 에디터 설계 및 구현에 대하여 기술하였다. ODA에서처럼 문서구조를 공통 논리구조와 배치구조로 분리하여 처리하였으며, 문서 프로화일을 작성 할 수 있도록 설계하였다. 문서가 정확하게 작성되었는지를 객체(object) 단위로 확인할 수 있는 유틸리티(utility)를 구현하였다. 또한 그 문서의 ODIF 스트림(stream) 데이타가 정확한지를 확인하였다. 본 에디터는 국제 문서 응용 프로화일 (DAP : Document Application Profile)인 DAP 단계 2의 제안에 따라 설계하였으며, UNIX 운영체제의 SUN 워크스테이션상에서 이식성이 좋고 일관된 사용자 인터페이스(interface)를 제공하는 X 윈도우 및 Motif 환경하에서 구현하였다. 본 연구를 통하여 구현된 에디터는 특정 문서구조를 갖는 실제 ODA 문서를 작성시 이용될 수 있다.

  • PDF

Design and Implementation of the Document HTML System for Preserving Content Integrity

  • Hyun Cheon Hwang;Ji Su Park;Jin Gon Shon
    • Journal of Information Processing Systems
    • /
    • 제19권3호
    • /
    • pp.334-346
    • /
    • 2023
  • An electronic document based on PDF has been widely used in customer communication between an enterprise and a customer to deliver personalized content. However, electronic documents based on PDF in the form of paper layouts are not suitable for mobile environments because of low readability and lack of interactive interaction. Even though HTML is an essential language in a mobile environment, electronic document based on PDF is still used as it has a content integrity verification feature with a digital signature. It means that a user is sacrificing user experience in a mobile environment for content integrity and using paper-layout electronic documents. In this research, we design the Document HTML specification by setting the Document HTML conformance, adding the extended meta tags, and signing the message digest with a digital signature based on public key infrastructure (PKI). Furthermore, we implemented the Document HTML system, which has REST API services to generate and verify the Document HTML, and did experimental verification of the theory. As a result, we have confirmed that the Document HTML has both content integrity and user experience on mobile. Furthermore, the Document HTML is expected to be an alternative document format to deliver personalized content from an enterprise to a customer in a mobile environment instead of the paper layout electronic document such as PDF.

인터넷 원거리출판의 응용과 PDF의 인쇄활용에 관한 연구 (A Study on the online of PDF Electronic Documents System)

  • 유영수;강영립;김병현;이광수
    • 한국인쇄학회:학술대회논문집
    • /
    • 한국인쇄학회 2001년도 국제학술발표회
    • /
    • pp.63-77
    • /
    • 2001
  • PDF(Portable Document Format) is a file format that Adobe advances postscritp technique and use in managing document information or electric publishing(internet, CD-ROM, DVD). PDF is a devised document type for being able to read and print anywhere, independent of OS, printer type, resolution, and the kind of computer etc. Because this includes a compressing function, it transfers document through a small size of file in internet or intranet. In addition, that is a file format has various advantages-sharing of information and transfering documents in on line or off line environment. In this paper, we developed electronic document system using PDF format. Electronic document system consists of filter, automatic indexing, special searching system and web server. The information used in this paper is database made using Zwon\`s DocuCom. The filter recognizes various kinds of document structure. And according to property of document, it produces ASCII output. In addition to processing various formats of document, the filter can extract keywords in documents of MS WORD, Excel, Powerpoint, PDF, CAD etc. This filter uses the structure of window printer drive and can extract the information for text, page, font type and size from relevant document. The automatic indexing recognizes the formatted tag of document form ASCII text produced by filter and extracts adequate keyword to structure and property of document. PDF electronic document systems proposed in this paper can be used in Internet, PC communication. Users can choose and read electronic documents by two ways. First, users can choose and read relevant books using PDF electronic document homepage. Second, users can use PDF integrated-search system. User can search after inputing keyword and choose reference field and type of data. But, now, PDF products of Adobe can\`t support the Korean character. If this problem is resolved, we thick that PDF applications system looks active. Although there is limited function in case of using Zwon DocuCom used in this study, we think that there isn\`t a great deal of difficulty in electronic document and building digital database.

  • PDF

A Machine-Learning Based Approach for Extracting Logical Structure of a Styled Document

  • Kim, Tae-young;Kim, Suntae;Choi, Sangchul;Kim, Jeong-Ah;Choi, Jae-Young;Ko, Jong-Won;Lee, Jee-Huong;Cho, Youngwha
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권2호
    • /
    • pp.1043-1056
    • /
    • 2017
  • A styled document is a document that contains diverse decorating functions such as different font, colors, tables and images generally authored in a word processor (e.g., MS-WORD, Open Office). Compared to a plain-text document, a styled document enables a human to easily recognize a logical structure such as section, subsection and contents of a document. However, it is difficult for a computer to recognize the structure if a writer does not explicitly specify a type of an element by using the styling functions of a word processor. It is one of the obstacles to enhance document version management systems because they currently manage the document with a file as a unit, not the document elements as a management unit. This paper proposes a machine learning based approach to analyzing the logical structure of a styled document composing of sections, subsections and contents. We first suggest a feature vector for characterizing document elements from a styled document, composing of eight features such as font size, indentation and period, each of which is a frequently discovered item in a styled document. Then, we trained machine learning classifiers such as Random Forest and Support Vector Machine using the suggested feature vector. The trained classifiers are used to automatically identify logical structure of a styled document. Our experiment obtained 92.78% of precision and 94.02% of recall for analyzing the logical structure of 50 styled documents.

인자점수와 자기조직화지도를 이용한 희소한 문서데이터의 군집화 (Sparse Document Data Clustering Using Factor Score and Self Organizing Maps)

  • 전성해
    • 한국지능시스템학회논문지
    • /
    • 제22권2호
    • /
    • pp.205-211
    • /
    • 2012
  • 통계학과 기계학습의 다양한 기법을 이용하여 문서집합을 군집화하기 위해서는 우선 군집화분석에 적합한 데이터구조로 대상 문서집합을 변환해야 한다. 문서군집화를 위한 대표적인 구조가 문서-단어행렬이다. 각 문서에서 발생한 특정단어의 빈도값을 갖는 문서-단어행렬은 상당부분의 빈도값이 0인 희소성문제를 갖는다. 이 문제는 문서군집화의 성능에 직접적인 영향을 주어 군집화결과의 성능감소를 초래한다. 본 논문에서는 문서-단어행렬의 희소성문제를 해결하기 위하여 인자분석을 통한 인자점수를 이용하였다. 즉, 문서-단어행렬을 문서-인자점수행렬로 바꾸어 문서군집화의 입력데이터로 사용하였다. 대표적인 문서군집화 알고리즘인 자기조직화지도에 적용하여 문서-단어행렬과 문서-인자점수행렬에 대한 문서군집화의 결과들을 비교하였다.

탬플릿 기반 XML 문서 생성기의 설계 및 구현 (A Design and Implementation of XML Document Generator based on Template)

  • 염세훈;방혜자
    • 디지털산업정보학회논문지
    • /
    • 제8권4호
    • /
    • pp.73-81
    • /
    • 2012
  • Web development and Internet technology development bring many kinds of works to web. This is the main reason why XML, document standard is popular. XML in web can be used to express document template or standard. XML with java can be more powerful and general. For example, XML can be used to transmit data and to print data into the screen using Ajax in JSP(Java Server Page) and to make interfaces in android, which is useful to reduce development cycle. However, XML is not easy to learn for the novice. In this paper, we propose the easy and effective way to reduce the learning curve of XML and to make and use XML documents. For the purpose, we suggest template base XML document generation and we design and implement XML document generator based on Template. XML document generator of template-based provides user interface and layout of XML document. So, users can generate XML document easily and effectively.

정보보호 시스템의 CC기반 평가를 위한 문서 스키마 (Document Schema for the CC-based evaluation of information technology security system)

  • 김점구
    • 융합보안논문지
    • /
    • 제12권3호
    • /
    • pp.45-52
    • /
    • 2012
  • 정보보호시스템의 국제공통평가기준인 CC(또는 ISO/IEC 15408)에서는 평가용 문서(즉, 제출물)에 대한 세부지침을 포함하지 않고 있으므로, CC기반 평가체계를 구축하기 위해서는 문서 스키마(즉, 목차와 내용요구사항)를 개발해야 한다. 본 논문에서는 CC기반 평가체계에서 활용할 수 있는 문서 스키마를 개발하였다. CC내의 보증클래스로부터 Weakest precondition함수, 문서량 축소규칙, 문서 종속성 분석방법을 적용하여 문서스키마와 DTD를 개발하였다. 본 연구의 접근방법은 소프트웨어 품질의 평가체계에서 사용할 문서스키마 또는 DTD를 개발하는데 응용될 수 있다.

웹 기반의 전자상거래를 위한 도서검색 시스템 설계 (A Design of Book Retrieval System for Electronic Commerce in based Web)

  • 하추자;정종근;박종훈;김철원
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국해양정보통신학회 2005년도 춘계종합학술대회
    • /
    • pp.659-662
    • /
    • 2005
  • XML is standard of web document, and is used in language for document data exchange. XML document is used as example that change existing document to XML or makes new document by XML increases and XML search system to search XML document efficiently accordingly is requiring. This paper describes design and implementation of query processing system for translating XML elements and data between XML documents and relational database and consist of XML to DB processor, DB to XML processor and XML document management processor. Through this, described for design and embodiment of efficient XML document search system of JAVA base using XQL that is proposed in language of quality of XML document.

  • PDF

Web에서 데이터 흐름제어가 가능한 Mail Browser의 설계 및 구현 (Design and Implementation of a Mail Browser that can control Data-Flow on the Web)

  • 박규석;김성후
    • 한국정보처리학회논문지
    • /
    • 제6권10호
    • /
    • pp.2752-2763
    • /
    • 1999
  • On account of the text based mail system has it's limit to support multimedia applications, GUI based mail system platform was developed to control document flow and automatize information process. The existing mail systems's to transmit data must need additional functions to automate document flow control. The platform of document flow control is deeply related to EDMAS(Electronic document Management System), workflow, Electronic Banking, DMS(Document Management System) automation, so it needs an ability to control proper data and document correctly. To resolve this problems, we are need of browser and engine to design work flow and to control documents flow. In this paper, we develope a mail browser to design document flow by follow user's requirements. This system can generate executive script code for document flow, and we add the function of workflow and process management to automatize the document flow in this system, and then we implement this Data flow engine.

  • PDF

전자무역문서보관소(電子貿易文書保管所) 운영상(運營上)의 문제점(問題點)에 관한 연구(硏究) (A Study on the Operational Problems of e-Trade Document Repositary)

  • 안병수;임성철
    • 통상정보연구
    • /
    • 제8권1호
    • /
    • pp.125-141
    • /
    • 2006
  • It is no unnecessary to tell the importance of foreign trade in Korea economics. Nevertheless, government's direct support is impossible owing to WTO's regulation. Accordingly, government have brought focus into trade facilitation as paperless trade. e-Trade document repositary building by government's budget and private sector's cooperation is a part of e-Trade platform and necessary function in connection with relay and certification of e-Trade document. This study examined the estimated operational problems of e-Trade document repositary as compared Licensed Electronic Document Repositary. Firstly, the operator of e-Trade document repositary undertake multiple role and function as Licensed Certification Authorities(e-sign Act), Licensed Electronic Document Repositary(Framework Act on Electronic Transaction) etc. Secondly, sufficient levy that meet operating cost of the e-trade document is the key point of e-Trade document repositary's success, because additional budget invest in that operation is too hard to do. Thirdly, the operator of the e-Trade document repositary have to keep fairness, objectivity and transparency because the operational right is exclusive.

  • PDF