Browse > Article
http://dx.doi.org/10.7465/jkdi.2015.26.6.1239

Cluster analysis for Seoul apartment price using symbolic data  

Kim, Jaejik (Department of Statistics, Sungkyunkwan University)
Publication Information
Journal of the Korean Data and Information Science Society / v.26, no.6, 2015 , pp. 1239-1247 More about this Journal
Abstract
In this study, 64 administrative regions with high frequencies of apartment trade in Seoul, Korea are classified by the apartment sale price. To consider distributions of apartment price for each region as well as the mean of the price, the symbolic histogram-valued data approach is employed. Symbolic data include all types of data which have internal variation in themselves such as intervals, lists, histograms, distributions, and models, etc. As a result of the cluster analysis using symbolic histogram data, it is found that Gangnam, Seocho, and Songpa districts and regions near by those districts have relatively higher prices and larger dispersions. This result makes sense because those regions have good accessibility to downtown and educational environment.
Keywords
Apartment price; cluster analysis; symbolic histogram-valued data;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Cha, S. H. and Srihari, S. H. (2002). On measuring the distance between histograms. Pattern Recognition Letter, 35, 1355-1370.   DOI
2 Billard, L. and Diday, E. (2006). Symbolic data analysis: Conceptual statistics and data mining, John Wiley and Sons, England.
3 Bock, H. H. and Diday, E. (2000). Analysis of symbolic data: Exploratory methods for extracting statistical information from complex data, Springer-Verlag, Berlin.
4 Brito, P. and Chavent, M. (2012). Divisive monothetic clustering for interval and histogram-valued data. ICPRAM 2012, Portugal.
5 Diday, E. (1989). Introduction a l'approche symbolique en analyse des donnees. Recherche Operationnelle, 2, 193-236.
6 Irpino, A. and Verde, R. (2006). A new Wasserstein based distance for the hierarchical clustering of histogram symbolic data. IFCS 2006, 185-192.
7 Kim, J. (2009). Dissimilarity measures for histogram-valued data and divisive clustering of symbolic objects, Ph.D. Thesis, University of Georgia.
8 Kim, J. and Billard, L. (2013). Dissimilarity measures for histogram-valued observations. Communications in Statistics - Theory and Methods, 42, 283-303   DOI
9 Lim, S. S. (2014). A study on the forecasting models using housing price index. Journal of the Korean Data & Information Science Society, 25, 65-76.   DOI
10 Strelkov, V. V. (2008). A new similarity measure for histogram comparison and its application in time series analysis. Pattern Recognition Letter, 29, 1768-1774.   DOI
11 Woo, S. Y., Lee, J.W. and Jhun, M. (2014). Microarray data analysis using relative hierarchical clustering. Journal of the Korean Data & Information Science Society, 25, 999-1009.   DOI