• Title/Summary/Keyword: 문서지

Search Result 2,040, Processing Time 0.025 seconds

A Clustering Technique using Common Structures of XML Documents (XML 문서의 공통 구조를 이용한 클러스터링 기법)

  • Hwang, Jeong-Hee;Ryu, Keun-Ho
    • Journal of KIISE:Databases
    • /
    • v.32 no.6
    • /
    • pp.650-661
    • /
    • 2005
  • As the Internet is growing, the use of XML which is a standard of semi-structured document is increasing. Therefore, there are on going works about integration and retrieval of XML documents. However, the basis of efficient integration and retrieval of documents is to cluster XML documents with similar structure. The conventional XML clustering approaches use the hierarchical clustering algorithm that produces the demanded number of clusters through repeated merge, but it have some problems that it is difficult to compute the similarity between XML documents and it costs much time to compare similarity repeatedly. In order to address this problem, we use clustering algorithm for transactional data that is scale for large size of data. In this paper we use common structures from XML documents that don't have DTD or schema. In order to use common structures of XML document, we extract representative structures by decomposing the structure from a tree model expressing the XML document, and we perform clustering with the extracted structure. Besides, we show efficiency of proposed method by comparing and analyzing with the previous method.

Design and Implementation of an Access Control System for XML Documents on the Web (웹에서의 XML 문서 접근 제어 시스템의 설계 및 구현)

  • Lee, Yong-Kyu
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.11S
    • /
    • pp.3623-3632
    • /
    • 2000
  • Until now the XML document is allowed users to access the whole content of it However, for some applications such as those in the field of electronic commerce, there are cases that the whole content should not be delivered. Therefore, access authorization is required for XML documents in order to protect illegal accesses to some critical parts of them. In this paper. we design and implement a system which authorizes users to XML documents and controls access to them based on the access rights. We set the user group as a basic unit of the authorization subject and the element of an XML document as a basic unit of authorization object The owner of a document authorize; user groups to access the elements of it When an XML document is accessed, the access rights of the requester are checked using an access control list and only the authorized parts are delivered_ As the result, we can authorize XML documents, which has been previously impossible.

  • PDF

A Method of Selective Crawling for Web Document Using URL Pattern (URL 패턴을 이용한 웹문서의 선택적 자동수집 방안)

  • Jeong, Jun-Yeong;Jang, Mun-Su
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.41-44
    • /
    • 2007
  • 특정 분야별로 구축되는 온톨로지에 관하여 그 언스턴스를 쉽고 빠르게 구축하기 위해서는 구조화된 문서를 이용하는 것이 효율적이다. 그러나, 일반적인 웹 문서는 모든 분야에 대하여 다양한 형식으로 표현되어 존재하기 때문에, 대상이 되는 구조 문서를 자동으로 수집하기는 쉽지 않다. 본 논문에서는 웹사이트의 URL 패턴을 XML 기반의 스크립트로 정의하여, 필요한 웹 문서만을 지능적으로 수집하는 방안을 제안한다. 제안하는 수집 방안은 구조화된 형태로 정보를 제공하는 사이트에 대해서 매우 빠르고 효율적으로 적용될 수 있다. 본 논문에서는 제안하는 방법을 적용하여 5만개 이상의 웹 문서를 수집하였다.

  • PDF

Design and Implementation of XML Document Presentation System applying XSL-fo (XSL-fo를 적용한 XML 문서표현 시스템의 설계 및 구조)

  • Kim, Jin-Su;Gang, Chi-Won;Ryu, Geun-Ho;Jeong, Hoe-Gyeong
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.7 no.3
    • /
    • pp.229-239
    • /
    • 2001
  • 본 논문은 XML 문서의 내용 및 구조 정보를 XSL 스타일시트(stylesheet)의 포맷팅(formatting) 정보를 적용하여 표현하는 포매팅 시스템의 설계 및 구현에 관한 것이다. 본 시스템은 XML 문서를 XSLT(XSL Transformations) 및 Xpath(XML Path Language)를 이용하여 문서를 변환하고, XSL-fo(XSL Formatting Objects)를 적용하여 포맷팅을 지정하는 XML 문서 표현 시스템을 설계 및 구현하였다. 이 XML 문서 표현 시스템은 웹 표준화 기구인 W3C에서 제안하는 XSL 포매팅 처리에 대한 구성을 기반으로 구현함으로써 표준화에 입각한 처리시스템으로써 변화에 능동적으로 대처 가증하고 모듈화 되어 있어 부분적인 수정 및 대체가 가능하도록 설계하였다. 본 시스템은 IBM 호환 PC에서 동작하며, 운영체제는 Windows 2000 환경에서 Visual C++6.0을 사용하여 개발하였다.

  • PDF

Design and Implementation of XML Documents Storage System using UNISQL/X (UNISQL/X를 이용한XML 문서 저장 시스템 설계 및 구현)

  • 안병태;김현아
    • Journal of the Korea Society of Computer and Information
    • /
    • v.6 no.1
    • /
    • pp.38-44
    • /
    • 2001
  • Nowadays, the study on XML as a standard method of exchange of information is active as Internet technology develops. This thesis designs and implements XML documents storage system using uniSQL/X in object-relational database. The system takes advantage of the merits of object-oriented databases, so that the structural information of XML documents can be effectively represented. The system uses split-storage method to allow frequent modifications of XML documents and suggests DTD-independent model so that it can store XML documents without DTD. And retrieval speed is improved by solving the issue of data duplication.

  • PDF

A Method of XML Mapping Canonicalization for E-Business Integration (전자상거래 통합을 위한 XML 매핑 정형화 기법)

  • 안우영;홍창범
    • Journal of the Korea Society of Computer and Information
    • /
    • v.9 no.1
    • /
    • pp.1-8
    • /
    • 2004
  • XML is becoming the standard of the new document exchanging. Due to the ablility expressing various types of document structure through XML, RosettaNet and BizTalk are using XML as a core technology in the part of e-Business. Framework is running Business process each other different standard. Internal documents in each company should be transformed differently without any loss to work with other companies. In this paper, transforming Processor based on XML mapping information from XML document information.

  • PDF

Ranking Decision Method of Retrieved Documents Using User Profile from Searching Engine (검색 엔진에서 사용자 프로파일을 이용한 문서 순위결정 방법)

  • Kim Yong-Ho;Kim Hyeong-Gyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.9
    • /
    • pp.1590-1595
    • /
    • 2006
  • This paper proposes a technique of user oriented document ranking using user refile to provide more satisfied results which reflect preference of specific users. User profile is constructed to represent his or her preference. User pfofile consists of 'term array' and 'preference vector' according to the interest field of one. And the User profile for a particular person is updated by 'user access', 'latent relaeon', 'User Profile' proposed in this paper. The latent structures of documents in same domain are analysed by singular value decomposition(SVD). Then, the rank of documents is determined by comparison of user profile with analyzed document on the basis of relevance.

A Study on Dynamic Formatting Method (동적 포맷팅 방식에 관한 연구)

  • 임광택;이수연
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.18 no.5
    • /
    • pp.730-738
    • /
    • 1993
  • This paper proposes a dynamic formatting method for processing large amounts of document in a device independent manner. And it is very useful for cross-referencing among pages in a single document and for presenting multiple pages simultaneously. The method can be applied usefully to hypertext's application such as establishing a link and a cross-reference among pages in a multiple document. We implemented an electronic publishing system of WYSIWYG type using X window system and Motif graphical user interface.

  • PDF

(Design and Implementation of DTD Authoring Tools for XML Documents) (XML 문서를 위한 DTD 저작 도구의 설계 및 구현)

  • 김현주
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.8
    • /
    • pp.1093-1104
    • /
    • 2002
  • XML is a markup language which has been accepted in various fields such as digital libraries, electronic commerce, and web applications. Research for creation, storage, management, and retrieval of XML documents is essential to develope XML application systems. This paper presents design and implementation details of powerful and convenient DTD authoring tools for XML documents. The design principles are authoring convenience, semi-automatic creation of valid and reliable document DTD by systematic guidance to reduce the possibility of syntax errors, and visualization of document structures.

  • PDF

An Efficient Index Structure Supporting Structure Queries for Video Documents (비디오 문서의 구조 질의를 위한 효율적 인덱스 구조)

  • Lee, Yong-Kyu
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.5
    • /
    • pp.1109-1118
    • /
    • 1998
  • Recently, much attention has been focused on video databases. Video documents also have a hierarchical logical structure like text documents. By exploiting this structure using structure queries, users can obtain greater benefits than by using only content queries. In order to process structure queries efficiently, an index structure supporting fast video element access must be provided. However, there has been little attention to the index structure for video documents. In this paper, we present a tree-structured video document model and a new inverted index structure for video documents. We evaluate the storage requirement and the disk access time of the scheme and present the analytical results.

  • PDF