Browse > Article
http://dx.doi.org/10.13089/JKIISC.2010.20.1.105

An Algorithm for Detecting Leak of Defaced Confidential Information Based on SVDD  

Ghil, Ji-Ho (Graduate School of Information Management Engineering, Korea University)
Nam, Ki-Hyo (WinnerDigm Inc.)
Kang, Hyung-Seok (Graduate School of Information Management Engineering, Korea University)
Kim, Seong-In (Graduate School of Information Management Engineering, Korea University)
Abstract
This paper proposes the algorithm which addresses the problem of detecting leak of defaced confidential documents from original confidential document. Generally, a confidential document is defaced into various forms by insiders and then they are trying to leak these defaced documents to outside. Traditional algorithms detecting leak of documents have low accuracy because they are based on similarity of two documents, which do not reflect various forms of defaced documents in detection. In order to overcome this problem, this paper proposes a novel v-SVDD algorithm which is based on SVDD, the novelty detection algorithm. The result of experiment shows that there is significant improvement m the accuracy of the v-SVDD in comparison with the traditional algorithms.
Keywords
SVDD; Data Loss Prevention; Contents Filtering;
Citations & Related Records
연도 인용수 순위
  • Reference
1 남기효, 강형석, 길지호, 김성인, "내부정보 유출방지(DLP) 기술동향," 정보통신산업진흥원, 주간기술동향, 제1413호, pp. 1-9, 2009년 9월.
2 C. James and H. Ray, "Tightening the net: A review of current and next generation spam filtering tools," Computers & Security, vol. 25, no. 8, pp. 566-578, Nov. 2006.   DOI   ScienceOn
3 F. Janssens, W. Glänzel, and B.D. Moore, "Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis," In Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 360-369, Aug. 2007.
4 P. Glenisson, W. Glänzel, F. Janssens, and B.D. Moore, "Combining full text and bibliometric information in mapping scientific disciplines," Information Processing & Management, vol. 41, no. 6, pp. 1548-1572, Dec. 2005.   DOI   ScienceOn
5 T. David and D. Robert, "Support vector data description," Machine Learning, vol. 54, no. 1, pp. 45-66, Jan. 2004.   DOI
6 Z. Yong, C.Z. Xian, and L.K. Qiu, "Fuzzy multi-class classifier based on Support vector data description and improved PCM," Expert System with Applications, vol. 36, no. 5, pp. 8714-8718, July 2009.   DOI   ScienceOn
7 A. Per and C. Cristian, "Document-document similarity approaches and science mapping: Experimental comparison of five approaches," Journal of Informetrics, vol. 3, no. 1, pp. 49-63, Jan. 2009.   DOI   ScienceOn
8 G. Vladutz and J. Cook, "Bibliographic coupling and subject relatedness," In Proceedings of the 47th ASIS annual meeting, pp. 204-207, Dec. 1984.
9 F. Janssens, J. Leta, W. Glänzel, and B.D. Moore, "Towards mapping library and information science," Information Processing & Management, vol. 42, no. 6, pp. 1614-1642, Dec. 2006.   DOI   ScienceOn