Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2006.13D.2.147

XML-based Modeling for Semantic Retrieval of Syslog Data  

Lee Seok-Joon (중앙대학교 대학원 정보시스템학과)
Shin Dong-Cheon (중앙대학교 정보시스템학과)
Park Sei-Kwon (중앙대학교 정보시스템학과)
Abstract
Event logging plays increasingly an important role in system and network management, and syslog is a de-facto standard for logging system events. However, due to the semi-structured features of Common Log Format data most studies on log analysis focus on the frequent patterns. The extensible Markup Language can provide a nice representation scheme for structure and search of formatted data found in syslog messages. However, previous XML-formatted schemes and applications for system logging are not suitable for semantic approach such as ranking based search or similarity measurement for log data. In this paper, based on ranked keyword search techniques over XML document, we propose an XML tree structure through a new data modeling approach for syslog data. Finally, we show suitability of proposed structure for semantic retrieval.
Keywords
Event Logging; Semantic Retrieval; XML;
Citations & Related Records
연도 인용수 순위
  • Reference
1 S. Cohen and J. Mamou and Y. Kanza, and Y. Sagiv, 'XSEarch : A Semantic Search Engine for XML', Proc. of 29th International Conference on Very Large Data Bases, pp.45-56, Sep., 2003
2 XML Interface to Syslog Messages, http://www.cisco.com, 2004
3 S. Cohen and Y. Kanza and Y Sagiv, 'Generating Relations from XML Documents', ICDT, Vol. 2572, pp.285-299, Jan., 2003
4 R. Vaarandi, 'A Breadth-First Algorithm for Mining Frequent Patterns from Event Logs', INTELLCOMM, Vol. 3283, pp.293-308, Nov., 2004
5 R. Vaarandi, 'A Data Clustering Algorithm for Mining Patterns From Event Logs', Proc. of the 2003 IEEE Workshop on IP Operations and Management, pp.119-126, 2003
6 R. Gerhards, 'The syslog Protocol', syslog Working Group, http://www.ietf.org, 2005
7 P. Berkhin, 'Survey of Clustering Data Mining Techniques', Accrue Software, http://www.accrue.com, 2002
8 R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval, Addison-Wesley Longman Publishing Company, 1999
9 L. Page and S. Brin and R. Motwani and T. Winograd, 'The PageRank Citation Ranking: Bringing Order to the Web', Stanford Digital Library Technologies Project, 1998
10 L. Guo and F. Shao and C. Botev and J. Shanmugasundaram, 'XRANK: Ranked Keyword Search over XML Documents', In Proc. 2003 ACM SIGMOD International Conference on Management of Data, pp.16-27, June, 2003   DOI
11 Lire Documentation, http://logreport.org/lire/, 2004
12 J. Punin and M. Krishnamoorthy and M. Zaki, 'LOGML-Log Markup Language for Web Usage Mining', Lecture Notes In Computer Science; Vol.2356, pp.88-112, 2001
13 L. Feng and E. Chang and T. Dillon, 'A Semantic Network-Based Design Methodology for XML Documents', ACM Transactions on Information Systems, (20, 4), pp.390-421, Oct., 2002   DOI   ScienceOn
14 J. Clark and S. DeRose, 'XML Path Language', W3C Recommendation, 1999
15 C. Lonvick, 'The BSD syslog Protocol', RFC3164, 2001
16 H. Mannila and H. Toivonen and A. I. Verkamo, 'Discovery of Frequent Episodes in Event Sequences', Data Mining and Knowledge Discovery, (1, 3), pp.259-289, 1997   DOI
17 J. Abela and T. Debeaupuis, 'Universal Format for Logger Messages', Herve Schauer Consultants, http:/www.hsc.fr/, 1999
18 B. Babcock and S. Babu and M. Datar and R. Motwani and J. Widom, 'Models and Issues in Data Stream Systems', ACM Symposium on Principles of Database Systems, pp. 1-16, June, 2002   DOI
19 XML-Logs : Analyse your logs using XML encoding, Herve Schauer Consultants, http://www.hsc.fr/ressources/outils/xml-logs/index.html.en, 2004