Browse > Article

A Parameter-Free Approach for Clustering and Outlier Detection in Image Databases  

Oh, Hyun-Kyo (Department of Electronics and Computer Engineering, Hanyang University)
Yoon, Seok-Ho (Department of Electronics and Computer Engineering, Hanyang University)
Kim, Sang-Wook (Department of Electronics and Computer Engineering, Hanyang University)
Publication Information
Abstract
As the volume of image data increases dramatically, its good organization of image data is crucial for efficient image retrieval. Clustering is a typical way of organizing image data. However, traditional clustering methods have a difficulty of requiring a user to provide the number of clusters as a parameter before clustering. In this paper, we discuss an approach for clustering image data that does not require the parameter. Basically, the proposed approach is based on Cross-Association that finds a structure or patterns hidden in data using the relationship between individual objects. In order to apply Cross-Association to clustering of image data, we convert the image data into a graph first. Then, we perform Cross-Association on the graph thus obtained and interpret the results in the clustering perspective. We also propose the method of hierarchical clustering and the method of outlier detection based on Cross-Association. By performing a series of experiments, we verify the effectiveness of the proposed approach. Finally, we discuss the finding of a good value of k used in k-nearest neighbor search and also compare the clustering results with symmetric and asymmetric ways used in building a graph.
Keywords
Cross-Association; parameter-free;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Y. Gdalyahu, D. Weinshall, and M. Werman, "Self-Organization in Vision: Stochastic Clustering for Image Segmentation, Perceptual Grouping, and Image Database Organization," IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 23, No. 10, pp. 1053-1074, 2001.   DOI   ScienceOn
2 Y. Chen, J. Wang, and R. Krovetz, "CLUE: Cluster-Based Retrieval of Images by Unsupervised Learning," IEEE Trans. Image Processing, Vol. 14, No. 8, pp. 1187-1201, 2005.   DOI
3 J. Han and M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann, 2006.
4 D. Chakrabarti, "Autopart: Parameter-free graph partitioning and outlier detection," In Proc. of ECML PKDD, pages 112–124, 2004.
5 K. Beyer, J. Goldstein, R. Ramakrishnan, U. Shaft, "When Is Nearest Neighbor Meaningful?," In Proc. Int'l Conf. on Database Theory, pp. 217-235, 1999.
6 S. Guha, R. Rastogi, and K. Shim, "CURE: An Efficient Clustering Algorithm for Large Databases," In Proc. of ACM SIGMOD Int'l. Conf. on Management of Data, pp. 73-84, 1998.
7 T. Zhang, R. Ramakrishnan, and M. Livny, "BIRCH: An Efficient Data Clustering Method for Very Large Databases," In Proc. of ACM SIGMOD Int'l. Conf. on Management of Data, pp. 103-114, 1996.
8 G. Karypis, E. H. Han, and V. Kumar, "Chameleon: A Hierarchical Clustering Algorithm Using Dynamic Modeling," IEEE Computer, Vol. 32, No. 8, pp. 68-75, 1999.   DOI   ScienceOn
9 P. Grunwald, A Tutorial Introduction To The Minimum Description Length Principle, MIT Press, 2005.
10 W. Niblack, R. Barber, W. Equitz, M. Flickner, E. H. Clasman, D. Petkovic, P. Yanker, C. Faloutsos, G. Taubin, "The QBIC Project: Querying Images by Content using Color, Texture, and Shape," In Proc. of Storage and Retrieval for Image and Video Databases, pp. 173-187, 1993.
11 D. Chakrabarti, S. Papadimitriou, D. S. Modha, C. Faloutsos, "Fully Automatic Crossssociations," In Proc. Int'l Conf. on Knowledge Discovery and Data Mining, pp. 79-88, 2004.
12 이 재호, 장 민희, 김 두열, 김 상욱, 김 민호, 최 진성, "Shader Space Navigator: 유사 쉐이더 검색 시스템," 대한전자공학회논문지, Vol. 45, No. 3, pp. 198-207, 2008년 5월.
13 S. Papadimitriou, J. Sun, P. S. Yu, C. Faloutsos, "Hierarchical, parameter-free community discovery," In Proc. of ECML PKDD, page 170-187, 2008.