Browse > Article

A Filtering Technique of Streaming XML Data based Postfix Sharing for Partial matching Path Queries  

Park Seog (서강대학교 컴퓨터학과)
Kim Young-Soo (삼성전자 정보통신총괄)
Abstract
As the environment with sensor network and ubiquitous computing is emerged, there are many demands of handling continuous, fast data such as streaming data. As work about streaming data has begun, work about management of streaming data in Publish-Subscribe system is started. The recent emergence of XML as a standard for information exchange on Internet has led to more interest in Publish - Subscribe system. A filtering technique of streaming XML data in the existing Publish- Subscribe system is using some schemes based on automata and YFilter, which is one of filtering techniques, is very popular. YFilter exploits commonality among path queries by sharing the common prefixes of the paths so that they are processed at most one and that is using the top-down approach. However, because partial matching path queries interrupt the common prefix sharing and don't calculate from root, throughput of YFilter decreases. So we use sharing of commonality among path queries with the common postfixes of the paths and use the bottom-up approach instead of the top-down approach. This filtering technique is called as PoSFilter. And we verify this technique through comparing with YFilter about throughput.
Keywords
Streaming XML data; Filtering; Postfix sharing;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Yanlei Diao, Peter Fischer, Michael J. Franklin, Raymond To, 'YFilter: Efficient and scalable Filtering of XML Documents,' In Proceeding of the ICDE, 2002   DOI
2 Todd J. Green, Gerome Miklau, Makoto Onizuka, and Dan Suciu, 'Processing XML Streams with Deterministic Automata,' In Proceeding of the LNCS, 2003, pp. 173-189
3 Todd J. Green, Gerome Miklau, Makoto Onizuka, and Dan Suciu, 'Processing XML Streams with Deterministic Automata and Stream Indexes,' In Proceeding of the TODS, 2004
4 Albrecht Schmidt, Florian Waas, Martin Kersten, Michael J. Carey, Ioana Manolescu, Ralph Busse, 'XMark: A Benchmark for XMl Data Management,' In Proceeding of the VLDB, 2002, pp.974-985
5 Chin-Wan Chung, Jun-Ki Min, Kyuseok Shim, 'APEX: An Adaptive Path Index for XML data,' In Proceeding of the SIGMOD, 2002, pp.121-132   DOI
6 B. Babcock, S. Babu, M. Datar, R. MotWani, J. Widom, 'Models and Issues In Data Stream Systems,' In Proceeding of the PODS, 2002, pp.1-16   DOI
7 S. Babu and J. Widom, 'Continuous Queries over data Streams,' In Proceeding of the SIGMOD, 2001, pp.109-120   DOI
8 Jianjun Chen, David J. DeWitt, Feng Tian, Yuan Wang, 'NiagaraCQ: A Scalable Continuous Query System for Internet databases,' In Proceeding of the SIGMOD, 2000, pp.379-390   DOI
9 Tim Furche, 'Optimizing multiple queries against XML streams,' http://www.pms.ifi.lmu. de/publikationen/diplomarbeiten/Tim.Furche/mqspex.pdf
10 P.Th. Eugster, P.Felber, R. Guerraoui, A. M. Kermarrec, 'The Many Faces of Publish/ Subscribe,' ACM Computing Serveys, 2003, pp.114-131   DOI   ScienceOn
11 Mehmet Altinel, Michael J. Franklin, 'Efficient Filtering of XML Documents for selective Dissemination of Information,' In Proceeding of the VLDB, 2000, pp.53-64
12 Yanlei Diao, Mehmet Altinel, Michael J. Franklin, Hao Zhang, Peter Fischer, 'Path Sharing and Predicate Evaluation for High-Performance XML Filtering,' ACM Transcations on Database Systems, 2003, pp. 467-516   DOI   ScienceOn
13 Yanlei Diao, Michael J. Franklin, 'High-Performance XML Filtering: an overview of YFilter,' Bulletin of the IEEE, 2003, pp.1-8