• Title/Summary/Keyword: XML warehouse

Search Result 11, Processing Time 0.027 seconds

Supporting XML Materialized Views Using Materialized Views of RDBMS (관계 DBMS의 실체뷰 기능을 이용한 XML 실체뷰 지원)

  • Kim, Seung-Hun;Kang, Hyun-Chul
    • The Journal of Society for e-Business Studies
    • /
    • v.11 no.4
    • /
    • pp.33-48
    • /
    • 2006
  • Since the emergence of XML as the standard for data exchange on the Web, XML warehousing technology is required to efficiently support Web business applications such as e-Commerce. When the RDBMS is employed as the storage for XML warehouse, XML materialized views of the XML warehouse could be provided by leveraging the materialized views of the RDBMS Because XML documents are mapped into relational tuples, an XML query defining an XML materialized view needs to be transformed into SQL. If relational materialized views were defined with the transformed SQL statements, the XML materialized view could be obtained just by XML-tagging the tuples of the corresponding relational materialized views. The foremost advantage of such a scheme is that the RDBMS does take care of XML materialized view consistency except XML tagging whenever their source XML documents are updated. In this paper, we proposed such a scheme of providing XML materialized views, and implemented it using a commercial RDBMS equipped with materialized view facility in Java on Windows 2000 Professional environment. XML documents in TPC-W, Web e-Commerce Benchmark, were used in performance experiments. The experimental results showed that our proposed scheme for XML materialized views was very effective.

  • PDF

Web Information Extraction and Multidimensional Analysis Using XML (XML을 이용한 웹 정보 추출 및 다차원 분석)

  • Park, Byung-Kwon
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.5
    • /
    • pp.567-578
    • /
    • 2008
  • For analyzing a huge amount of web pages available in the Internet, we need to extract the encoded information in web pages. In this paper, we propose a method to extract and convert web information from web pages into XML documents for multidimensional analysis. For extracting information from web pages, we propose two languages: one for describing web information extraction rules based on the object-oriented model, and another for describing regular expressions of HTML tag patterns to search for target information. For multidimensional analysis on XML documents, we propose a method for constructing an XML warehouse and various XML cubes from it like the way we do for relational data. Finally, we show the validness of our method through the application to US patent web pages.

  • PDF

A Multidimensional Analysis Framework for XML Warehouses (XML 웨어하우스에 대한 다차원 분석 프레임워크)

  • Park, Byung-Kwon;Lee, Jong-Hak
    • Asia pacific journal of information systems
    • /
    • v.15 no.4
    • /
    • pp.153-164
    • /
    • 2005
  • Nowadays, large amounts of XML documents are available in the Internet. Thus, we need to analyze them multidimensionally in the same way as relational data. In this paper, we propose a new framework for multidimensional analysis of XML documents, which we call XML-OLAP. We base XML-OLAP on XML warehouses where all fact and dimension data are stored as XML documents. We build XML cubes from XML warehouses. We propose a new OLAP language for XML cubes, which we call XML-MDX. XML-MDX statements target XML cubes and use XQuery expressions to designate measure, axis and slicer. They incorporate text mining operations for aggregating text data. We apply XML-OLAP to the United States patent XML warehouse to demonstrate multidimensional analysis of XML documents.

Multidimensional Analysis of XML Documents using XML Cubes (XML 큐브를 이용한 다차원 XML 문서 분석)

  • Park, Byung-Kwon
    • Proceedings of the Korea Association of Information Systems Conference
    • /
    • 2005.05a
    • /
    • pp.65-78
    • /
    • 2005
  • Nowadays, large amounts of XML documents are available on the Internet. Thus, we need to analyze them multi-dimensionally in the same way as relational data. In this paper, we propose a new frame-work for multidimensional analysis of XML documents, which we call XML-OLAP. We base XML-OLAP on XML warehouses where every fact data as well as dimension data are stored as XML documents. We build XML cubes from XML warehouses. We propose a new multidimensional expression language for XML cubes, which we call XML-MDX. XML-MDX statements target XML cubes and use XQuery expressions to designate the measure data. They specify text mining operators for aggregating text constituting the measure data. We evaluate XML-OLAP by applying it to a U.S. patent XML warehouse. We use XML-MDX queries, which demonstrate that XML-OLAP is effective for multi-dimensionally analyzing the U.S. patents.

  • PDF

Design of Formalized message exchanging method using XMDR (XMDR을 이용한 정형화된 메시지 교환 기법 설계)

  • Hwang, Chi-Gon;Jung, Kye-Dong;Choi, Young-Keun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.6
    • /
    • pp.1087-1094
    • /
    • 2008
  • Recently, XML has been widely used as a standard for a data exchange, and there has emerged the tendency that the size of XML document becomes larger. The data transfer can cause problems due to the increase in traffic, especially when a massive data such as Data Warehouse is being collected and analyzed. Therefore, an XMDR wrapper can solve this problem since it analyzes the tree structures of XML Schema, regenerates XML Schema using the analyzed tree structures, and sends it to each station with an XMDR Query. XML documents which are returned as an outcome encode XML tags according to XML Schema, and send standardized messages. As the formalized XML documents decrease network traffic and comprise XML class information, they are efficient for extraction, conversion, and alignment of data. In addition, they are efficient for the conversion process through XSLT, too, as they have standardized forms. In this paper we profuse a method in which XML Schema and XMDR_Query sent to each station are generated through XMDR(extended Meta-Data Registry) and the generation of products and XML conversion occur in each station wrapper.

An Approach for Integrated Modeling of Protein Data using a Fact Constellation Schema and a Tree based XML Model (Fact constellation 스키마와 트리 기반 XML 모델을 적용한 실험실 레벨의 단백질 데이터 통합 기법)

  • Park, Sung-Hee;Li, Rong-Hua;Ryu, Keun-Ho
    • The KIPS Transactions:PartD
    • /
    • v.11D no.3
    • /
    • pp.519-532
    • /
    • 2004
  • With the explosion of bioinformatics data such proteins and genes, biologists need a integrated system to analyze and organize large datasets that interact with heterogeneous types of biological data. In this paper, we propose a integration system based on a mediated data warehouse architecture using a XML model in order to combine protein related data at biology laboratories. A fact constellation model in this system is used at a common model for integration and an integrated schema it translated to a XML schema. In addition, to track source changes and provenance of data in an integrated database employ incremental update and management of sequence version. This paper shows modeling of integration for protein structures, sequences and classification of structures using the proposed system.

UML based Design of OLAP Meta Data Diagram Model (UML 기반 OLAP 메타 데이터의 다이어그램 모델 설계)

  • Kim Kyung-ju;Lee Yun-bae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2004.11a
    • /
    • pp.133-136
    • /
    • 2004
  • 데이터 웨어하우스(Data Warehouse : DW)는 데이터베이스에 저장되어 있는 데이터를 신속한 의사 결정 지원을 위해 최종 사용자가 여러 곳의 기업 내에 흩어져 있는 방대한 데이터를 손쉽고 빠르게 접근할 수 있도록 활용되고 있다. 현재 데이터 웨어하우스의 중요성이 부각되고 있는 가운데 온라인 분석 처리(On Line Analytical Processing : OLAP) 시스템이 데이터 웨어하우스 안에서 활용되고 발전되고 있다. 기존 연구에서는 서로 다른 OLAP 제품에서 공통으로 사용할 수 있는 모델을 적용하여 OLAP 메타데이터 교환 시스템을 설계해왔다. 그러나 본 논문에서는 서로 다른 OLAP 제품을 공통으로 사용할 수 있는 질의 언어 시스템 설계 전 단계인 논리적 설계를 UML snowflake 다이어그램을 이용하여 설계 하였다. 실험결과, XML 문서의 변환된 OLAP 메타 데이터를 이용하여 UML snowflake 다이어그램 설계를 통해 통합된 OLAP 제품의 XML 문서 구조가 논리적으로 설계되어 메타 데이터가 통합됨을 알 수가 있다.

  • PDF

Building the Data Mart on Antibiotic Usage for Infection Control (감염관리를 위한 항생제 사용량 데이터마트의 구축)

  • Rheem, Insoo
    • Korean Journal of Clinical Laboratory Science
    • /
    • v.48 no.4
    • /
    • pp.348-354
    • /
    • 2016
  • Data stored in hospital information systems has a great potential to improve adequacy assessment and quality management. Moreover, an establishment of a data warehouse has been known to improve quality management and to offer help to clinicians. This study constructed a data mart that can be used to analyze antibiotic usage as a part of systematic and effective data analysis of infection control information. Metadata was designed by using the XML DTD method after selecting components and evaluation measures for infection control. OLAP-a multidimensional analysis tool-for antibiotic usage analysis was developed by building a data mart through modeling. Experimental data were obtained from data on antibiotic usage at a university hospital in Cheonan area for one month in July of 1997. The major components of infection control metadata were antibiotic resistance information, antibiotic usage information, infection information, laboratory test information, patient information, and infection related costs. Among them, a data mart was constructed by designing a database to apply antibiotic usage information to a star schema. In addition, OLAP was demonstrated by calculating the statistics of antibiotic usage for one month. This study reports the development of a data mart on antibiotic usage for infection control through the implementation of XML and OLAP techniques. Building a conceptual, structured data mart would allow for a rapid delivery and diverse analysis of infection control information.

A GML-based Schema for Data Cube Construction in a Spatial Data Warehouse (공간 데이터 웨어하우스에서 데이터큐브 구축을 위한 GML 기반의 스키마)

  • Kwak Dong-Uk;You Byeong-Seob;Lee Dong-Uk;Lee Jae-Dong;Bae Hae-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2006.05a
    • /
    • pp.93-96
    • /
    • 2006
  • 본 논문에서는 OGC 의 공간 정보 인코딩 표준 명세인 GML 을 기반으로 공간 데이터 웨어하우스를 구축하는 스키마를 제안한다. GML 기반의 스키마는 비공간 정보뿐만 아니라 공간 정보에 대한 정의가 가능하다. 그리고 XML 스키마를 이용하여 전체 큐브 스키마, 차원 스키마 및 사실테이블에 대한 스키마 정의의 예를 보인다. 따라서 제안 기법은 GML 을 이용하여 이질적인 시스템간의 데이터 통합이 용이하고, 비공간 정보뿐만 아니라 공간 정보의 활용이 가능하다. 그리고 공간 데이터 웨어하우스의 개념계층 관계에 대한 표현이 용이하고 구조에 대한 이해가 쉽다.

  • PDF

An Efficiency Analysis of Management System for Academy Affairs Process Based on MDA (MDA기반 학사관리 프로세스 유효성 분석)

  • Yoon, Jung-Mo;Kim, Chi-Ho
    • The KIPS Transactions:PartD
    • /
    • v.15D no.2
    • /
    • pp.187-196
    • /
    • 2008
  • The system development approach method of the new paradigm, as being standardizing MDA(Model Driven Architecture) in OMG(Object Management Group), the core technique of MDA definite technique structure for system materialization focusing on Model, is to build the design and the statement for system in PIM(Platform Independent Model), Materialization technique, and to build PSM(Platform Specific Model) adapt to materialization environment, and then to be the technique transforming into language Platform suitable to materialization environment. It includes MOF(Meta-Object Facility), UML, XMI(XML Metadata Interchange), CWM(Common Warehouse Metamodel), the core of MDA. Though these operations MDA support automatic tools product the practicable system after carrying out transform operation with the above information. In this thesis, it will be approached how to design and materialize the Bachelor management system based on MDA and the importance of modeling should be emphasized by applying to the approach method based on MDA. It should be suggested how to develop software efficiently, written it out in PIM, and suggested the process transforming PSM into EJB by J2EE Platform. The system is designed and implemented using automatic tool, edit machine(Together Architect 2006 for Eclipse, Edit plus2) supporting MDA and Database Modeling tool(ER/WIN 4.1). In conclusion, it should be suggested the efficiency analysis result for development process of traditional and based on MDA.