Browse > Article

An Efficient Index Structure for Semantic-based XML Keyword Search  

Lee, Hyung-Dong (서울대학교 컴퓨터공학부)
Kim, Sung-Jin (서울대학교 컴퓨터공학부)
Kim, Hyoung-Joo (서울대학교 컴퓨터공학부)
Abstract
Search results of XML keyword search are defined generally as the most specific elements containing all query keywords in the literature. The labels of XML elements and semantic information such as ontology, conceptual model, thesaurus, and so on, are used to improve the preciseness of the search results. This paper presents a hierarchical index for an efficient XML keyword query processing on the condition that returnable search concepts are defined and users' query concepts can be interpreted with the help of the semantic information. The hierarchical index separately stores the XML elements containing a keyword on the basis of the hierarchical relations of the concepts that the XML elements belong to, and makes it possible to obtain least common ancestors, which are candidates for the search results, with selectively reading the elements belonging to the concepts relevant to query concepts and without considering all the combinations of the elements having been read. This paper deals with how to organize the hierarchical index and how to process XML keyword queries with the index. In our experiment with the DBLP XML document and the XML documents in the INEX2003 test set, the hierarchical index worked well.
Keywords
XML(extensible Markup Language); Keyword Search; Hierarchical Index;
Citations & Related Records
연도 인용수 순위
  • Reference
1 http://inex.is.informatik.uni-duisburg.de:2003/, INEX (2003): INitiative for the Evaluation of Xml retrieval
2 http://www.sleepycat.com/, BerkeleyDB
3 M. Ronaldo dos Santos and A. H. Carlos: 'A Rule-Based Conversion of a DTD to a Conceptual Schema,' Proceedings of the 20th International Conference on Conceptual Modeling: Conceptual Modeling, 2001
4 I. F. Cruz, H. Xiao and F. Hsu: 'An OntologyBased Framework for XML Semantic Integration', IDEAS, 2004   DOI
5 S. Cohen: 'XSEarch: A semantic search engine for XML,' VLDB, 2003
6 L. Bernadette Farias, sio, S. Ana Carolina, R. Luciano do and G. go: 'Conceptual modeling of XML schernas,' Proceedings of the 5th ACM international workshop on Web information and data management, 2003
7 D. Carlson, Modeling XML Applications with UML: Practical e-Business Applications, Addison-Wesley, 2001
8 P. Giuseppe: 'ERX: a conceptual model for XML documents,' Proceedings of the 2000 ACM symposium on Applied computing-Volume 2, 2000
9 F. Norbert, G. Kai and johann, 'XIRQL: An XML query language based on information retrieval concepts,' ACM Trans. Inf. Syst, 2004   DOI   ScienceOn
10 http://www.urnl.org/, The Unified Modeling Language
11 Y. Li, C. Yu and H. V. Jagadish: 'Schema-Free XQuery,' VLDB, 2004
12 F. Norbert, G. Kai and johann: 'XIRQL: a query language for information retrieval in XML documents,' Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, 2001
13 C. David, S. M. Y oelle, M. Matan, M. Y osi and S. Aya: 'Searching XML documents via XML fragments,' Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, 2003   DOI
14 M. C. A. Klein: 'Interpreting XML Documents via an RDF Schema Ontology,' DEXA, 2002
15 L. Guo, F. Shao, C. Botev and J. Shanmugasundararn: 'XRANK: Ranked keyword search over XML documents,' SIGMOD, 2003   DOI
16 Y. Xu and Y. Papakonstantinou: 'Efficient keyword search for smallest LCAs in XML databases,' SIGMOD, 2005   DOI
17 S. J. Kim, Lee, H., and Kim, H-J, 'Adaptive Partitioned Index for Efficient XML Keyword Search,' submitted for publication (Journal of Research and Practice in Information Technology) 2005
18 F. Daniela, K. Donald and M. Ioana: 'Integrating keyword search into XML query processing,' Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking, 2000