Browse > Article
http://dx.doi.org/10.5351/KJAS.2007.20.1.061

Information Theory and Data Visualization Approach to Poll Analysis  

Huh, Moon-Yul (Department of Statistics, Sungkyunkwan University)
Cha, Woon-Ock (Department of Mutimedia Engineering, Hansung University)
Publication Information
The Korean Journal of Applied Statistics / v.20, no.1, 2007 , pp. 61-78 More about this Journal
Abstract
A method for poll analysis using information theory and data visualization is proposed in this paper. Questions of opinion poll consist of a target variable and many explanation variables. The type of explanation variables is either numerical or categorical. In this study, explanation variables of mixed types have been ranked according to the magnitude of their effect on target variable by using mutual information. Likewise, the order of explanation variables has been evaluated using data visualization. This is the first study to quantify the impact of specific explanation variable on the related target variable.
Keywords
Association measure; mutual information; data visualization;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Tan, P. N., Kumar, V. and Srivastava, J. (2002). Selecting the right interestingness measure for association patterns, In Proceedings of the eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 32-41
2 Cang, S. and Partridge, D. (2004). Feature ranking and best feature subset using mutual information, Neural Computing & Applications, 13, 175-184   DOI
3 Cleveland, W. S. and McGill, M. E. (1988). Dynamic Graphics for Data Analysis, Wadsworth & Brooks/Cole
4 Cover, T. M. and Thomas, J. A. (2004). Elements of Information Theory, 2nd ed., John Wiley & Sons
5 Huh, M. Y. and Song, K. R. (2002). DAVIS: a Java-based data visualization system, Computational Statistics, 17, 411-423, http://stat.skku.ac.kr/myhuh/DAVIS.html   DOI
6 Hutter, M. and Zaffalon, M. (2005). Distribution of mutual information from complete and incomplete data, Computational Statistics & Data Analysis, 48, 633-657   DOI   ScienceOn
7 Lee, S. C. and Huh, M. Y. (2003). A measure of association for complex data, Computational Statistics & Data Analysis, 44, 211-222   DOI   ScienceOn
8 Parzen, E. (1962). On estimation of a probability density function and mode, Annals of Mathematical Statatistics, 33, 1065-1076   DOI