Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2008.15-D.1.23

Mining of Frequent Structures over Streaming XML Data  

Hwang, Jeong-Hee (남서울대학교 컴퓨터학과)
Abstract
The basic research of context aware in ubiquitous environment is an internet technique and XML. The XML data of continuous stream type are popular in network application through the internet. And also there are researches related to query processing for streaming XML data. As a basic research to efficiently query, we propose not only a labeled ordered tree model representing the XML but also a mining method to extract frequent structures from streaming XML data. That is, XML data to continuously be input are modeled by a stream tree which is called by XFP_tree and we exactly extract the frequent structures from the XFP_tree of current window to mine recent data. The proposed method can be applied to the basis of the query processing and index method for XML stream data.
Keywords
Stream Data; XML Data; Frequent Structure Extraction; XML Mining;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 B. Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom, 'Models and Issues in Data Stream Systems,' Invited paper in Proc. of PODS, 2002
2 V. Ganti, J. Gehrke, R. Ramakrishnan, 'DEMON: Mining and Monitoring Evolving Data,' TKDE 1391), pp.50-63, 2001
3 R. Nayak, R. Witt, A. Tonev, 'Data Mining and XML Documents,' International Conference on Internet Computing, 2002
4 M. Zaki, 'Efficiently Mining Frequent Tree in a Forest,' Proceedings of the ACM SIGKDD International Conference, 2002
5 T.Asai, K.Abe, S. Kawasoe, H.Sakamoto, et al., 'Online algorithms for mining semi-structured data stream,' In.Proc. ICDM, 2002
6 S. Babu, J. Widom, 'Continuous Queries over Data Stream,' SIG MOD Record 30(3), pp.109-120, 2001   DOI   ScienceOn
7 J. Chen, D. J. DeWitt, F. Tian, U. Wang, 'A Scalable Continuous Query System for Internet Database,' ACM SIGMOD, 2000
8 L.H. Yang, M.L. Lee, W. Hsu, 'Finding hot query patterns over an XQuery stream,' VLDB Journal Special Issue on Data Stream Processing, 2004
9 G. S. Manku, R. Motwani, 'Approximate Frequency Counts over Data Streams,' VLDB 2002
10 D. Braga, A. Campi, S. Ceri, M. Klemettinen, and P. Lanzi, 'A Tool for Extracting XML Association Rules from XML Documents,' Proceedings of IEEE-ICTAI 2002, USA, November, 2002
11 C. K. S. Leung Q. I. Khan, 'DSTree:A Tree Structure for the Mining of Frequent Sets from Data Streams,' In proc. ICDM 2006
12 J. Li. D. Maier, 'Semantics and Evaluation Techniques for Window Aggregates in Data Streams,' In Proc. of ACM SIGMOD International Conference on the Management of Data, 2005
13 장중혁, 이원석, '데이터 스트림에서 개방 데이터 마이닝 기반의 빈발 항목 탐색,' 정보처리학회논문지D, 제10-D권 제3호, 2003
14 김현규, 김철기, 김명호, '비순서화된 스트림 처리를 위한 슬라이딩 윈도우 기법,' 정보과학회, 제33권 제 6호, 2006
15 C. K. S. Leung Q. I. Khan, T. Hoque, 'CanTree:A Tree Structure for Efficient Incremental Mining of Frequent Pattern Sets,' In proc. ICDM 2005
16 김영현, 강현철, 'XML 스트림 데이터에 대한 적응력 있는 질의 처리 시스템,' 정보과학회, 제33권 제 3호, 2006
17 NIAGARA query engine. http://www.cs.wisc.edu/niagara/data.html
18 M.C. Hsieh, Y.H. Wu, A.L. Chen, 'Discovering Frequent Tree Patterns over Data Stream,' In Proc of SIAM International Conference on Data Mining, 2006