Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2002.9D.3.381

A Schema Extraction Method using Elements Information in XML Documents  

Kim, Seong-Rim (동덕여자대학교 정보학부 컴퓨터학전공 강의전임)
Yun, Yong-Ik (숙명여자대학교 정보학부 멀티미디어학과)
Abstract
XML documents, which are becoming new standard for expressing and exchanging data in the Internet, don't have defined schema. It is not adequate to directly apply XML documents to the existing SQL or OQL. Research on how to extract Schema for XML documents and query language is going on actively. For users' query, the results could be too tony or too less. It Is important to give the users adequate results. This paper suggests the way to extract many levelized schema according to the frequency of element occurrence in XML documents. The Schema can be reduced or extended to correspond to the users' query more flexibly.
Keywords
XML; document; schema extraction; frequency;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Also to appear in Journal of Applied Systems Studies, Cambridge International Sci-ence Publishing, Cambridge, UK, 2001
2 http : //us.imdb.com/top_250_films
3 Roy Goldman, Jennifer Widom, 'DataGuides : Enabling Query Formulation and Optimization in Semistructured Data-bases,' In Proceedings of VLDB, 1997
4 Jiawei Han, Jian Pei, Yiwen Yin, 'Mining Frequent Patterns without Candidate Generation,' Proceedings of the 2000 ACM SIGMOD on Management of data, pp.1-12, 2000   DOI   ScienceOn
5 Patrick O'Neil, 'Improved Query Performance with Variant Indexes,' Proceedings of ACM SIGMOD, pp.38-49, 1997   DOI   ScienceOn
6 Theodore Johnson 'Performance Measurements of Com-pressed Bitmap Indices,' VLDB, pp.278-289, 1999
7 Alon Levy, 'More on Data Management for XML,' Uni-versity of Washington, May 9th, http : //www.cs.washing-ton.edu/homes/alon/widom-response.html, 1999
8 J. McHugh, S. Abiteboul, R. Goldman, D. Quass, J. Widom, 'Lore : A Database Management System for Semistruc-tured Data,' SIGMOD Record, 26(3), pp.54-66, September, 1997   DOI   ScienceOn
9 Jennifer Widom, 'Data Management for XML,' Working Document, intial draft appeared April 1999, Also IEEE Data Engineering Bulletin, Special Issue on XML, 22(3) : 44-52, September, 1999
10 Dan Suciu, 'Semistructured Data and XML,' Proceed-ings of International Conference on Foundation of Data Organization, 1998
11 Jayavel Shanmugasundaran, Kristin Tufte, Gang He, Chun Zhang, David DeWit, Jeffrey Naughton, 'Relational Data-bases for Querying XML Documents : Limitations and Op-portunities,' Proceedings of the 25th VLDB Conference, 1999
12 M. C. Wu, A. P. Buchmann, 'Encoded Bitmap Indexing for Data Warehouses,' Proc. ICDE '98, pp.220-230
13 Jon Bosak, 'XML, Java and the Future of the Web,' http : //webreview.com/wr/pub/97/12/19/xml/index.html
14 Ke Wang, Huiqing Liu, 'Schema Discovery from Semis-tructured Data,' International Conference on Knowledge Discovery and Data Mining, pp.271-274, August, 1997
15 Ke Wang, Huiqing Liu, 'Discovering Typical Structures of Documents : A Road Map Approach,' The ACM SIGR conference on Research and Development in Information Retrieval, pp.146-154, August, 1998   DOI
16 Ming-Chuan Wu., 'Query optimization for selections using bitmaps,' Proceedings of the 1999 ACM SIGMOD inter-national conference on Management of data, pp.227-238
17 J. Yoon, S. Kim, 'Schema Extraction for Multimedia XML Document Retrieval,' in Proc. of International Database Symposium on Mobile, XML and Post-Relational Data-bases Hong Kong, June, 2000   DOI