• Title/Summary/Keyword: XML Structure

Search Result 499, Processing Time 0.024 seconds

Design and Implementation of BADA-IV/XML Query Processor Supporting Efficient Structure Querying (효율적 구조 질의를 지원하는 바다-IV/XML 질의처리기의 설계 및 구현)

  • 이명철;김상균;손덕주;김명준;이규철
    • The Journal of Information Technology and Database
    • /
    • v.7 no.2
    • /
    • pp.17-32
    • /
    • 2000
  • As XML emerging as the Internet electronic document language standard of the next generation, the number of XML documents which contain vast amount of Information is increasing substantially through the transformation of existing documents to XML documents or the appearance of new XML documents. Consequently, XML document retrieval system becomes extremely essential for searching through a large quantity of XML documents that are storied in and managed by DBMS. In this paper we describe the design and implementation of BADA-IV/XML query processor that supports content-based, structure-based and attribute-based retrieval. We design XML query language based upon XQL (XML Query Language) of W3C and tightly-coupled with OQL (a query language for object-oriented database). XML document is stored and maintained in BADA-IV, which is an object-oriented database management system developed by ETRI (Electronics and Telecommunications Research Institute) The storage data model is based on DOM (Document Object Model), therefore the retrieval of XML documents is executed basically using DOM tree traversal. We improve the search performance using Node ID which represents node's hierarchy information in an XML document. Assuming that DOW tree is a complete k-ary tree, we show that Node ID technique is superior to DOM tree traversal from the viewpoint of node fetch counts.

  • PDF

An Index Mechanism and Structure Information for Efficient Retrieval of XML DTD (XML DTD의 효율적인 검색을 위한 구조 정보 및 인덱스 메카니즘)

  • 김영란
    • Journal of the Korea Society of Computer and Information
    • /
    • v.8 no.3
    • /
    • pp.80-86
    • /
    • 2003
  • XML is being watched with keen interest for the communication and saving of information. Information represented in XML provides more accuracy and a higher-speed of reference after the process of being implication. But, it is difficult that XML document is exchanged or shared in different area such as electronic commerce or digital library. Because, XML document is being different in syntax but similar in logic, with using structured difference analysis. In this thesis, we converted object-oriented class diagram to XML DTD and designed an index mechanism based on the structure information for the converted XML DTD. With our methods, we could effectively and lastly retrieve the specific element and respect to usefully access element by simple operations.

  • PDF

An Incremental Clustering Technique of XML Documents using Cluster Histograms (클러스터의 히스토그램을 이용한 XML 문서의 점진적 클러스터링 기법)

  • Hwang, Jeong-Hee
    • Journal of KIISE:Databases
    • /
    • v.34 no.3
    • /
    • pp.261-269
    • /
    • 2007
  • As a basic research to integrate and to retrieve XML documents efficiently, this paper proposes a clustering method by structures of XML documents. We apply an algorithm processing the many transaction data to the clustering of XML documents, which is a quite different method from the previous algorithms measuring structure similarity. Our method performs the clustering of XML documents not only using the cluster histograms that represent the distribution of items in clusters but also considering the global cluster cohesion. We compare the proposed method with the existing techniques by performing experiments. Experiments show that our method not only creates good quality clusters but also improves the processing time.

Design and Implementation of Concept Information Based Universal DTD Generator (개념정보를 포함한 포괄적 DTD 생성기의 설계 및 구현)

  • 최인석;공용해
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.7
    • /
    • pp.831-836
    • /
    • 2002
  • There are various information resources on the Internet and people are taking more interest in XML day by day. In XML, the structure of information can be freely defined so that the standardization of documents can be hardly made. If DTD, which is applied to an XML Document representing specific information, is including concept information, it can be freely applied to the structure of document and also contributes to the convenience in information retrieval. In this study, we developed universal DTD Generator in order to automatically generate DTD including concept information. For the generation of universal DTD, the conceptualization of information is required; to conceptualize information, the hierarchical structuring and propertizing are required. The hierarchical structuring represents the inclusive relation of routine concepts for representing information in hierarchical structure, and the propertizing represents the property and mutual relation that the each concept represented in hierarchical structure can have. The defined hierarchical structure and propertization come to generate the universal DTD Generator. The universal DTD generated by DTD Generator can be applied to all the XML Documents representing the same information in different structure. However, the most ideal way is that the information of universal DTD, which can be applied to various documents, is including all the cases. Therefore, the study for designing correct concept information is necessary.

  • PDF

Extracting Maximal Similar Paths between Two XML Documents using Sequential Pattern Mining (순차 패턴 마이닝을 사용한 두 XML 문서간 최대 유사 경로 추출)

  • 이정원;박승수
    • Journal of KIISE:Databases
    • /
    • v.31 no.5
    • /
    • pp.553-566
    • /
    • 2004
  • Some of the current main research areas involving techniques related to XML consist of storing XML documents, optimizing the query, and indexing. As such we may focus on the set of documents that are composed of various structures, but that are not shared with common structure such as the same DTD or XML Schema. In the case, it is essential to analyze structural similarities and differences among many documents. For example, when the documents from the Web or EDMS (Electronic Document Management System) are required to be merged or classified, it is very important to find the common structure for the process of handling documents. In this paper, we transformed sequential pattern mining algorithms(1) to extract maximal similar paths between two XML documents. Experiments with XML documents show that our transformed sequential pattern mining algorithms can exactly find common structures and maximal similar paths between them. For analyzing experimental results, similarity metrics based on maximal similar paths can exactly classify the types of XML documents.

Data Transformation through Mapping between XML and Relation Database (XML과 관계형 데이타베이스 매핑을 통한 자료의 변환)

  • Kim Gil-Choon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.9 no.4 s.32
    • /
    • pp.5-12
    • /
    • 2004
  • The data transformation between XML and Relation Database is made through the Principle of mapping bewtween them. There are two ways to access SQL Server, one is to assign SQL query to URL and the other is to use template file. MS-SQL server takes advantage of OpenXML function to transform the results of executing SQL query into XML documents. That is, OpenXML first makes node tree and then transforms row set data of XML documents into XML data of relation type. In order to insert XML data into database data. data is extracted from parsing XML documents using sp_xml_preparedocument procedure, and then the document structure is mapped into tree structure and stored in a table of database. Consequently, Data transformation between XML and Relation Database is made through mapping bewtween them. This article proposes the principle of mapping between XML and Relation Database and then shows the implementation of transformation between them so that it introduces the possibilty of bringing the extension and efficiency of data and various effects.

  • PDF

Design of Automatic Database Schema Generator Based on XML Schema (XML 스키마 기반의 데이터베이스 스키마 생성기 설계)

  • Lim, Jong-Seon;Kim, Kyung-Soo
    • Convergence Security Journal
    • /
    • v.7 no.3
    • /
    • pp.79-86
    • /
    • 2007
  • B2B e-business is an economic transaction formed between companies through various networks including internet. At present, e-business between companies partly applies information distribution between companies, but many enterprises expect that a corporate basic system will be gradually changed into XML basis if web service is earnestly materialized, so they are competing with each other in developing XML DBMS. Existing XML DBMS studies used XML DTD in order to represent the structure of XML document. Such XML DTD defines the expression of a simple type, so there are many difficulties in defining the structure of XML document. To cope with this, in this paper, the author will develop database schema generator utilizing relational database generally used in storing contents of data, on the basis of XML schema selected as a standard of W3C. Also, to store XML data, the author proposed the automatic conversion method of relational database schema that used XML schema.

  • PDF

XML Document Repository System for structured retrieval (구조 검색을 위한 XML 문서 저장 시스템)

  • 임산송;현득창;정회경
    • The Journal of Information Technology
    • /
    • v.4 no.4
    • /
    • pp.89-100
    • /
    • 2001
  • XML (extensible Markup Language) is selected and published as a representative standard of electronic documents by W3C (World Wide Web Consortium). The structured information can be created and also transferred in XML documents. By utilizing XML, you can express the meaningful information unit as a structure comparing existed file typed information. With structured information, you can also manage retrieve, and reposit documents. According to the above facts, in this paper, it is the purpose to design and implement XML documents repository system to reposit and retrieve using structured information of XML documents. As a model it was designed to be stored by element unit which is the basic unit of documents and was also designed to retrieve the stored XML information by structured unit. It was, especially, designed to manage and reposit the structure of various documents effectively through creating schema as to DTD(Document Type Definition) and instance.

  • PDF

A Transformation Military Databases based on the Relational Data model into XML Databases (관계형 데이터 모델 기반 군사용 데이터베이스의 XML 데이터베이스로의 변환)

  • Kim, Chang-Seok;Kim, Eong-Su
    • Journal of National Security and Military Science
    • /
    • s.1
    • /
    • pp.269-310
    • /
    • 2003
  • AS Extensible Markup Language(XML) is emerging as the data format of the Internet era, there are increasing needs to efficiently transform between database and XML documents. In this paper, we propose a schema transformation method from relational database to XML database. To transform the schema, we represent input schema as Entity-Relationship diagram. Entity-Relationship model translator scans the input Entity-Relationship diagram using BFS (breadth First Search) and translates the diagram into hierarchical structure model. The XML Schema generator produces XML Scema code using the transformed hierarchical structure model. The proposed method has a merit that having reusability facility of XML Schema property in comparison with existing researches.

  • PDF

Design and Implementation of a UDDI Service Middleware based on the ebXML Registry (ebXML레지스트리 기반의 UDDI 서비스 미들웨어 설계 및 구현)

  • 박재홍;김상균;이규철;김경일;김록원;송병열;조현규
    • Journal of KIISE:Databases
    • /
    • v.31 no.3
    • /
    • pp.307-319
    • /
    • 2004
  • Recently, ebXML and Web Services are emerging as the XML-based electronic business frameworks. To provide a set of registry services which users or applications can store and search the business information, ebXML and Web Services use ebXML registry and UDDI registry, respectively. They use the different registries each other, but there are something that similiar to the structure and functions of both of registries. In this paper, we analyze the similarity of ebXML Registry Information Model(RIM) and UDDI data structure, and construct the mapping information. So, we design and implement the UDDI Service Middleware which translates the request of UDDI registry service into the request of ebXML registry service using the algorithm. Consequently, we could use the ebXML registry like ebXML registry as well as UDDI registry without any changes in the ebXML registry itself so that we can not need to have both of registries for e-business.