DOI QR코드

DOI QR Code

A Survey of Advances in Hierarchical Clustering Algorithms and Applications

  • Munshi, Amr (Computer Engineering Department, Umm Al-Qura University)
  • Received : 2022.05.05
  • Published : 2022.05.30

Abstract

Hierarchical clustering methods have been proposed for more than sixty years and yet are used in various disciplines for relation observation and clustering purposes. In 1965, divisive hierarchical methods were proposed in biological sciences and have been used in various disciplines such as, and anthropology, ecology. Furthermore, recently hierarchical methods are being deployed in economy and energy studies. Unlike most clustering algorithms that require the number of clusters to be specified by the user, hierarchical clustering is well suited for situations where the number of clusters is unknown. This paper presents an overview of the hierarchical clustering algorithm. The dissimilarity measurements that can be utilized in hierarchical clustering algorithms are discussed. Further, the paper highlights the various and recent disciplines where the hierarchical clustering algorithms are employed.

Keywords

References

  1. A. K. Jain and R. C. Dubes, Algorithms for Clustering Data, ser. Prentice-Hall Advanced References series. Englewood Cliffs, NJ: Prentice-Hall, 1988.
  2. A. Vr. F. Edwards and, L.L. CavaIli-Sforza, A method for Cluster Analysis, Biometrics, 1965, pp. 362-375.
  3. C. Peebles, Monothetic-Divisive Analysis of Moundville Burials. Newsletter of Computer Archaeology, 1972.
  4. H. T. Clifford and W. Stephenson, An introduction to numerical classification. New York: Academic Press, 1975.
  5. P. H. A. Sneath and R. R. Sokal, Numerical taxonomy, San Francisco: W. H. Freeman, 1973.
  6. J. W. Beckstead, "Using Hierarchical Cluster Analysis in Nursing Research," Western Journal of Nursing Research, vol. 24, no. 3, pp. 307-319, 2002. https://doi.org/10.1177/01939450222045923
  7. T. Hastie, R. Tibshirani, J. Friedman, and J. Franklin, "The Elements of Statistical Learning: Data Mining: Inference and Prediction," The Mathematical Intelligencer, vol. 27, no. 2, pp. 83-85, 2005.
  8. E. A. Mooi, and M. Sarstedt, A Concise Guide to Market Research, Springer-Verlag Heudelberg, 2011.
  9. X. Rui and D. Wunsch, "Survey of clustering algorithms," Neural Networks, IEEE Transactions, vol. 16, no. 3, pp. 645-678, May 2005. https://doi.org/10.1109/TNN.2005.845141
  10. A. A. Munshi, and Y. A.-R. I. Mohamed, "Photovoltaic power pattern clustering based on conventional and swarm clustering methods," Solar Energy, vol. 124, pp. 39-56, 2016. https://doi.org/10.1016/j.solener.2015.11.010
  11. D. T. Larose, Discovering Knowledge in Data: An Introduction to Data Mining, John Wiley & Sons, Hoboken, NJ, USA, 2005.
  12. I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann, San Francisco, 2005.
  13. N. Rajalingam and K. Ranjini, "Hierarchical Clustering Algorithm - A Comparative Study," International Journal of Computer Applications, vol. 19, no. 3, April 2011.
  14. T. G. Nikolaou, D. S. Kolokotsa, G. S. Stavrakakis and I. D. Skias, "On the Application of Clustering Techniques for Office Buildings' Energy and Thermal Comfort Classification," Smart Grid, IEEE Transactions on , vol. 3, no. 4, pp. 2196-2210, Dec. 2012. https://doi.org/10.1109/TSG.2012.2215059
  15. G. C. Tseng, "A comparative review of gene clustering in expression profile," Control, Automation, Robotics and Vision Conference, 2004. ICARCV 2004 8th, vol. 2, pp. 1320-1324 Dec. 2004
  16. M-H Tsai, C-H Lai, S-Jr Lu and S-F Su, "Performance Comparisons between Unsupervised Clustering Techniques for Microarray Data Analysis on Ovarian Cancer," Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference, vol. 5, pp. 3685-3690, Oct. 2006.
  17. G. Gatta and M. B. Lasota, et al., "Survival of European women with gynaecological tumours, during the period 1978-1989," Eur J Cancr, vol. 34, no. 14, pp. 2218-2225, 1998. https://doi.org/10.1016/S0959-8049(98)00326-8
  18. G. Chicco, R. Napoli and F. Piglione, "Application of clustering algorithms and Self Organising Maps to Classify Electricity Customers", Proc. IEEE Bologna PowerTech, June 2003.
  19. W-B. Chen, C. Zhang and H. Zhong, "An unsupervised protein sequences clustering algorithm using functional domain information," Information Reuse and Integration, 2008. IRI 2008, pp. 76-81, July 2008
  20. Z-X. Han, S. Feng, Y. Ye and Q. Jiang, "A Parameter-Free Hybrid Clustering Algorithm Used for Malware Categorization," Anti-counterfeiting, Security, and Identification in Communication, 2009. ASID 2009. 3rd International Conference, pp. 480-483, Aug. 2009.
  21. C-Y. Zhao, B-X. Shi, M-X. Zhang and Z-W. Shang, "Image retrieval based on improved hierarchical clustering algorithm," Wavelet Analysis and Pattern Recognition (ICWAPR), 2010 International Conference, pp.154-157, July 2010.
  22. D. Singh and A. Singh, "A New Framework for Texture Based Image Content with Comparative Analysis of Clustering Techniques," Computational Intelligence and Communication Networks (CICN), 2012 Fourth International Conference, pp. 232-236, Nov. 2012.
  23. L. Meng and A-H. Tan, "Semi-supervised hierarchical clustering for personalized web image organization," Neural Networks (IJCNN), The 2012 International Joint Conference, pp. 1-8, June 2012.
  24. LI Zhao-peng and LI Ken-l, "Parallel data preprocessing based on hierarchical clustering," Microelectronics & Computer, vol. 24, no. 10, 2007
  25. J. Senthilnath, S. N. Omkar, V. Mani, N. Tejovanth, P. G. Diwakar and B. A. Shenoy, "Hierarchical Clustering Algorithm for Land Cover Mapping Using Satellite Images," Selected Topics in Applied Earth Observations and Remote Sensing, vol. 5, no. 3, pp. 762-768, June 2012. https://doi.org/10.1109/JSTARS.2012.2187432