Browse > Article

SemFilter: A Simple and Efficient Semantic XML Message Filtering  

Kim, Jae-Hoon (서강대학교 컴퓨터공학과)
Park, Seog (서강대학교 컴퓨터공학과)
Abstract
Recent studies on XML filtering assume that all data sources follow a single global schema defined in a filtering system. However, beyond this simple assumption, a filtering system can provide a service that allows data publishers to have their own schema; hence, the data sources will become heterogeneous. The number of data sources is expected to be large in a filtering system and the data sources are frequently published, updated, and disappeared, that is, dynamic. In this paper, we introduce implementing a simple and efficient XPath query translation method for such a dynamic environment. The method is especially targeted for a query which is composed based only on users' knowledge and experience without a graphical guidance of the global schema. When a user queries a large number of heterogeneous data, there is a high possibility that the query is not consistent with the same local schema assumed by the user. Our query translation method also supports a function for this problem. Some experimental results for query translation performance have shown that our method has reasonable performance, and is more practical than the existing method.
Keywords
Semantic XML filtering system; Heterogeneous data; Data integration; XPath query translation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 S. Amer-Yahia, N. Koudas, A. Marian, D. Srivastava, and D. Toman, 'Structure and Content Scoring for XML,' In Proc. of 31th Inter. Conf on Very Large Data Bases (VLDB'05), pp. 361-372, 2005
2 E. Mena, A. Illarramendi, V. Kashyap, A. Sheth, 'OBSERVER: An approach for query processing in global information systems based on interoperation across pre-existing ontologies,' International journal on Distributed And Parallel Databases (DAPD), 8(2), pp. 223-271, 2000   DOI
3 M. Altinel and M. J. Franklin, 'Efficient filtering of XML documents for selective dissemination of information,' Proc. 26th VLDB, Cairo, Egypt, pp. 53-64, Sept. 2000
4 R. Agrawal, A. Borgida, and H. V. Jagadish, 'Efficient management of transitive relationships in large data and knowledge bases,' In Proc. of the SIGMOD Inter. Conf. on Management of Data, pp. 253-262, 1989
5 J. Kim and S. Park, 'PosFilter: An efficient filtering technique of XML documents based on postfix sharing,' Proc. 24th BNCOD, Glasgow, Scotland, pp. 70-81, 2007
6 XPath Version 1.0, http://www.w3.org/TR/xpath
7 Y. Kanza and S. Sagiv, 'Flexible queries over semistructured data,' In Proc. 20th Symposium on Principles of Database Systems, pp. 40-51, May 2001
8 RDF Primer, W3C Recommendation, http://www.w3.org/TR/rdf-primer/
9 OWL Web Ontology Language Overview, W3C Recommendation, http://www.w3.org/TR/owl-features/
10 A. Doan, J. Madhavan, P. Domingos, A. Halvey, 'Learning to map between ontologies on the semantic web,' In Proc. of the 11th International Conference on World Wide Web, pp. 662-673, 2002
11 Q. Li and B. Moon, 'Indexing and querying XML data for regular path expressions,' In Proc. of 27th Inter. Conf. on Very Large Data Bases (VLDB'02), pp. 361-370, 2001
12 K. S. Candan, W. Hsiung, S. Chen, J. Tatemura, D. Agrawal, 'AFilter: Adaptable XML filtering with prefix-caching and suffix-clustering,' Proc. 32th VLDB, Seoul, Korea, pp. 559-570, 2006
13 Y. Diao, M. Altinel, M. J. Franklin, H. Zhang, and P. Fischer, 'Path sharing and predicate evaluation for high-performance XML filtering,' ACM Transactions on Database Systems, Vol.28, No.4, pp. 467-516, 2003   DOI   ScienceOn
14 C. X. Chen, G. A. Mihaila, S. Padmanabhan, and I. M. Rouvellou, 'Query translation scheme for heterogeneous XML data sources,' Proc. 7th WIDM, pp. 31-38, Nov. 2005