Browse > Article
http://dx.doi.org/10.5909/JBE.2008.13.1.86

Effective and Statistical Quantification Model for Network Data Comparing  

Cho, Jae-Ik (CIST, Korea University)
Kim, Ho-In (CIST, Korea University)
Moon, Jong-Sub (CIST, Korea University)
Publication Information
Journal of Broadcast Engineering / v.13, no.1, 2008 , pp. 86-91 More about this Journal
Abstract
In the field of network data analysis, the research of how much the estimation data reflects the population data is inevitable. This paper compares and analyzes the well known MIT Lincoln Lab network data, which is composed of collectable standard information from the network with the KDD CUP 99 dataset which was composed from the MIT/LL data. For comparison and analysis, the protocol information of both the data was used. Correspondence analysis was used for analysis, SVD was used for 2 dimensional visualization and weigthed euclidean distance was used for network data quantification.
Keywords
Data Set Comparing; Intrusion Detection; Evaluation Data Set; Data Set Composing; Correspondence Analysis;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Alan Agresti. Categorical Data Analysis. Pp382. Wiley. 2002
2 J. W. Haines. 1999 DARPA Intrusion Detection Evaluation. Technical Report 1062. MIT Lincoln Laboratory. 2001
3 Goodman, L.A. Association models and canonical correlation in the analysis of cross-classifications having ordered categories. J. Am. Statist. Assoc. 76, 320-334. 1981   DOI
4 Goodman, L.A. Simple models for the analysis of association in cross-classifications having ordered categories. J. Am. Statist. Assoc. 74, 537-552. 1979   DOI
5 Goodman, L.A. Some useful extensions of the usual correspondence analysis approach and the usual log-linear models approach in the analysis of contingency tables (with discussion). Int. Statist. Rev. 54, 243-309. 1986   DOI   ScienceOn
6 MH Huh. Correspondence Analysis of Two-way Contingency Tables with Ordered Column Categories. International Statistical Institute. Vol. 52. Pp59-60. 1999
7 James Lattin, J. Douglas Carroll, Paul E. Green. Analyzing Multivariate Data. Thomson. Pp318, 2003
8 James Lattin, J. Douglas Carroll, Paul E. Green. Analyzing Multivariate Data. Thomson. pp25, 2003
9 Goodman, L.A. The analysis of cross-classified data having ordered and/or unordered categories: Association models, correlation models, and asymmetry models for contingency tables with or without missing entries. Ann. Statist. 13, 10-69. 1985   DOI
10 Saharon Rosset, Aron Inger. KDD-cup 99. ACM SIGKDD Explorations Newsletter. KDD-99 Conference report. 2000   DOI