Browse > Article

PrimeFilter: An Efficient XML Data Filtering based on Prime Number Indexing  

Kim, Jae-Hoon (서강대학교 컴퓨터공학과)
Kim, Sang-Wook (삼성전자 정보통신총괄)
Park, Seog (서강대학교 컴퓨터공학과)
Abstract
Recently XML is becoming a de facto standard for online data exchange between heterogeneous systems and also the research of streaming XML data filtering comes into the spotlight. Since streaming XML data filtering technique needs rapid matching of queries with XML data, it is required that the query processing should be efficiently performed. Until now, most of researches focused only on partial sharing of path expressions or efficient predicate processing and they were work for time and space efficiency. However, if containment relationship between queries is previously calculated and the lowest level query is matched with XML data, we can easily get a result that high level queries can match with the XML data without any other processing. That is, using this containment technique can be another optimal solution for streaming XML data filtering. In this paper, we suggest an efficient XML data filtering based on prime number indexing and containment relationship between queries. Through some experimental results, we present that our suggested method has a better performance than the existing method. All experiments have shown that our method has a more than two times better performance even though each experiment has its own distinct test purpose.
Keywords
XML data filtering; query containment; prime number indexing;
Citations & Related Records
연도 인용수 순위
  • Reference
1 K. S. Candan, W. Hsiung, S. Chen, J. Tatemura, D. Agrawal, "AFilter: Adaptable XML filtering with prefix-caching and suffix-clustering," Proc. 32th VLDB, Seoul, Korea, pp. 559-570, 2006
2 J. Kim and S. Park, "PosFilter: An efficient filtering technique of XML documents based on postfix sharing," Proc. 24th BNCOD, Glasgow, Scotland, pp. 70-81, 2007
3 P.Th. Eugster, P. Felber, R. Guerraoui, A. M. Kermarrec, "The Many Faces of Publish/Subscribe," ACM computing surveys, 2003
4 J. Clark, S. DeRose, "XML Path Language (XPath) Version 1.0," http://www.w3.org/TR/xpath, November, 1999
5 Y. Chen, S. Davidson, and Y. Zheng, "An efficient XPath query processor for XML streams," Proc. of the 22nd International Conference on Data Engineering (ICDE), pp. 77, 2006
6 M. Franklin, Y. Diao, S. Rizvi, A. Edakkunni, M. Altinel, P. Fischer, R. To, and P. Hwang, "YFilter 1.0 release," 2003, available at http://yfilter.cs.umass. edu/code_release.htm
7 C.-Y. Chan, P. Felber, M. Garofalakis, and R. Rastogi, "Efficient Filtering of XML Documents with XPath Expressions," Proc. of the 18th International Conference on Data Engineering (ICDE), pp. 235-244, Feb. 2002
8 J. Kwon, P. Rao, B. Moon, and S. Lee, "Fist: scalable XML document filtering by sequencing twig patterns," In Proc. of the 31th VLDB, pp. 217-228, 2005
9 X. Wu, M. L. Lee, W. Hsu, "A Prime Number Labeling Scheme for Dynamic Ordered XML Trees," Proc. of the 20th International Conference on Data Engineering (ICDE), pp. 66-78, 2004
10 Y. Diao, M. Altinel, M. J. Franklin, H. Zhang, and P. Fischer, "Path sharing and predicate evaluation for high-performance XML filtering," ACM Transactions on Database Systems, Vol.28, No.4, pp. 467-516, 2003   DOI   ScienceOn