• Title/Summary/Keyword: Documents

Search Result 5,362, Processing Time 0.033 seconds

The Schema Extraction Method using the frequency of Label Path in XML documents (XML 문서에서의 레이블 경로 발생 빈도수에 따른 스키마 추출 방법)

  • 김성림;윤용익
    • Journal of Internet Computing and Services
    • /
    • v.2 no.4
    • /
    • pp.11-24
    • /
    • 2001
  • XML documents found over internet are generally fairly irregular and hove no fixed schema, The SQL and OQL are not suitable for query processing in XML documents, So, there are many researches about schema extraction and query language for XML documents, We propose a schema extraction method using the frequency of label path in XML documents, Our proposed method produces multi-level schemas and those are useful for query processing.

  • PDF

Syntax-Directed Document Editor based XML DTD (XML DTD 기반의 구문지향 문서 작성기)

  • Kim, Young-Chul;Kim, Sung-Keun;Choi, Jong-Myung
    • The Journal of Korean Association of Computer Education
    • /
    • v.7 no.4
    • /
    • pp.67-75
    • /
    • 2004
  • XML is being accepted as a standard for the next generation web documents, as it enables to extend the document structures. However, general users have difficulties in writing valid and well-formed XML documents, since the documents should satisfy the grammatical constraints of XML. In this paper, we present a syntax-directed XML document editor which will ease users in writing valid XML documents. The editor will help users, and increase productivity in writing XML documents.

  • PDF

Document Classification Model Using Web Documents for Balancing Training Corpus Size per Category

  • Park, So-Young;Chang, Juno;Kihl, Taesuk
    • Journal of information and communication convergence engineering
    • /
    • v.11 no.4
    • /
    • pp.268-273
    • /
    • 2013
  • In this paper, we propose a document classification model using Web documents as a part of the training corpus in order to resolve the imbalance of the training corpus size per category. For the purpose of retrieving the Web documents closely related to each category, the proposed document classification model calculates the matching score between word features and each category, and generates a Web search query by combining the higher-ranked word features and the category title. Then, the proposed document classification model sends each combined query to the open application programming interface of the Web search engine, and receives the snippet results retrieved from the Web search engine. Finally, the proposed document classification model adds these snippet results as Web documents to the training corpus. Experimental results show that the method that considers the balance of the training corpus size per category exhibits better performance in some categories with small training sets.

A Review on Needs and Related Law of Authorized Electronic Data Depot for the Diffusion in the Use of Electronic Document (전자문서 이용확산을 위한 공인 전자문서보관소의 도입 필요성과 입법내용)

  • Kim, Sun-Kwang
    • International Commerce and Information Review
    • /
    • v.6 no.1
    • /
    • pp.191-210
    • /
    • 2004
  • There are increasing needs for an individual or enterprise to interchange documents electronically through communication network to enhance the efficiency of business, owing to rapid process of transactions. But e-commerce encounters the problems regarding the handling the electronic documents, that is to say, deposit and proof of the electronic documents. This paper deals with Authorized Electronic Data Depot as an integrated system for processing, relaying and proving documents that. Authorized Electronic Data Depot operates as e-enabler in exchanging documents in trust among administrative agencies and a comprehensive government directory, digital government seal certification system and DNS system. Authorized Electronic Data Depot leads public and private sectors to save the paper-related costs. But the regulations concerning an authorized electronic data depot is introduced in the course of revision of Electronic Transactions Act. The purpose of this paper is to suggest some guidelines in legalizing the authorized electronic data depot.

  • PDF

Design and Implementation of an XML Document Management System Based on $O_2$ ($O_2$기반의 XML 문서관리 시스템 설계 및 구현)

  • 유재수
    • The Journal of Information Technology and Database
    • /
    • v.7 no.1
    • /
    • pp.27-39
    • /
    • 2000
  • In this paper, we design and implement a XML management system based on OODBMS that supports structured information retrieval of XML documents. We also propose an object oriented modeling to store and fetch XML documents, to manage image data, and to support versioning for the XML document management system(XMS). The XMS consists of a repository manager that maintains the interfaces for external application programs, a XML instance storage manager that stores XML documents in the database, a XML instance manager that fetches XML documents stored in the database, a XML index manager that creates index for the structure information and the contents of documents, and a query processor that processes various queries.

  • PDF

A Schema Extraction Method using Elements Information in XML Documents (XML 문서에서의 엘리먼트 정보를 이용한 스키마 추출방법)

  • Kim, Seong-Rim;Yun, Yong-Ik
    • The KIPS Transactions:PartD
    • /
    • v.9D no.3
    • /
    • pp.381-388
    • /
    • 2002
  • XML documents, which are becoming new standard for expressing and exchanging data in the Internet, don't have defined schema. It is not adequate to directly apply XML documents to the existing SQL or OQL. Research on how to extract Schema for XML documents and query language is going on actively. For users' query, the results could be too tony or too less. It Is important to give the users adequate results. This paper suggests the way to extract many levelized schema according to the frequency of element occurrence in XML documents. The Schema can be reduced or extended to correspond to the users' query more flexibly.

Query Processing based Branch Node Stream for XML Message Broker

  • Ko, Hye-Kyeong
    • International journal of advanced smart convergence
    • /
    • v.10 no.2
    • /
    • pp.64-72
    • /
    • 2021
  • XML message brokers have a lot of importance because XML has become a practical standard for data exchange in many applications. Message brokers covered in this document store many users. This paper is a study of the processing of twig pattern queries in XML documents using branching node streams in XML message broker structures. This work is about query processing in XML documents, especially for query processing with XML twig patterns in the XML message broker structure and proposed a method to reduce query processing time when parsing documents with XML twig patterns by processing information. In this paper, the twig pattern query processing method of documents using the branching node stream removes the twigging value of the branch node that does not include the labeling value of the branch node stream when it receives a twig query from the client. In this paper, the leaf node discovery time can be reduced by reducing the navigation time of nodes in XML documents that are matched to leaf nodes in twig queries for client twig queries. Overall, the overall processing time to respond to queries is reduced, allowing for rapid question-answer processing.

An Efficient Search Method For XML document

  • Qian, Xie;Cho, Dong-Sub
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.1287-1290
    • /
    • 2011
  • Because of the rapid development of internet, there are more and more documents stored by the XML-based format. When there is a great deal of XML documents, how to get the valuable Information is an important subject. This paper proposes an effective XML document search method to search text contents and structures of XML documents. We build the keyword matrix of text contexts and structure matrixes of structures in XML documents to improve the efficiency of query time. When there is a great deal of XML documents, the search method we propose can improve much efficiency of query time.

XED: Model-based XML Editor Generator for Data-Centric XML Documents (XED: 데이타 중심 XML문서를 위한 모델 기반의 XML 편집기 생성 도구)

  • 최종명;유재우
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.10
    • /
    • pp.894-903
    • /
    • 2003
  • Though XML is widely used, it is still hard for end users to write XML documents. A lot of XML documents are data-centric documents which have formal data format. Even novices can easily write the data-centric XML documents if they use form-based GUIs. In this paper, we introduce a new method for generating form-based XML editor for data-centric XML documents automatically and an XML editor generator called XED. The DTD consists of sequence, choice, and repetition, and this structure can be represented with Document Decomposition Graph(DDG). XED allows users to generate an XML editor by applying the presentation rules to DDG. It also permits users to modify generated editor through changing editor`s GUI properties with direct manipulation.

Design and Implementation of an XML Editor based on WYSIWYG (WYSIWYG 기반의 XML 편집기 설계 및 구현)

  • Son Chung-Beom;Yoo Jae-Soo
    • Journal of Internet Computing and Services
    • /
    • v.4 no.2
    • /
    • pp.47-60
    • /
    • 2003
  • As XML has been used as a markup language for standard documents in various areas, it requires an XML editing tool which writes XML documents easily and presents the style of documents. Therefore, many companies have developed XML editing tools and have released the products. However, it is hard for beginners to write XML documents using such tools. In this paper, we design and implement an XML editor based on WYSIWYG(What You See Is What You Get) that beginners as well as exports can use easily. Our XML editor allows beginners to write easily XML documents and style documents. We also integrate our XML editor with the XML repository system developed in our laboratory.

  • PDF