Browse > Article
http://dx.doi.org/10.5351/KJAS.2006.19.2.291

A Study on the Data Fusion Method using Decision Rule for Data Enrichment  

Kim S.Y. (Department of Statistical Informatics, Chonbuk National University)
Chung S.S. (Department of Statistical Informatics, Chonbuk National University)
Publication Information
The Korean Journal of Applied Statistics / v.19, no.2, 2006 , pp. 291-303 More about this Journal
Abstract
Data mining is the work to extract information from existing data file. So, the one of best important thing in data mining process is the quality of data to be used. In this thesis, we propose the data fusion technique using decision rule for data enrichment that one phase to improve data quality in KDD process. Simulations were performed to compare the proposed data fusion technique with the existing techniques. As a result, our data fusion technique using decision rule is characterized with low MSE or misclassification rate in fusion variables.
Keywords
Data fusion; Statistical matching; Data enrichment; Data Mining; k-Nearest Neighbor; Decision Tree; Recipient file; Donor file;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Ingram, D., O' Hare, J., Scheuren, F. and Turek, J. (2000). Statistical matching: a new validation case study, Proceedings of the Survey Research Methods Section, American Statistical Association
2 김순영 , 이기훈, 정성석 (2005), A Study on a Statistical Matching Method Using Clustering for Data Enrichment, <한국통계학회 논문집>, 12, 509-520   과학기술학회마을   DOI
3 정성석, 김순영, 김현진 (2004). 데이터 보강을 위한 데이터 통합기법에 관한 연구, <응용통계연구>, 17, 605-617
4 Rassler, S. (2002). Statistical Matching : A frequentist theory, practical applications, and alternative Bayesian approaches, Springer Verlag, New York
5 Saporta, G. (2002). Data fusion and data grafting, Computational Statistics & Data Analysis, 38, 465-473   DOI   ScienceOn
6 U.S. Department of Commerce, (1980). Report on exact and statistical matching techniques, Statistical Policy Working Paper 5. Washington, DC: Federal Committee on Statistical Methodology
7 van der Putten, P., Joost N. K. and Gupta, A. (2002). Why the Information explosion can be bad for data mining, and how data fusion provides a way out, Second SIAM International Conference on Data Mining, Arlington, April, 11-13
8 Yoshizoe, Y. and Araki, M. (1999). Use of statistical matching for household surveys in Japan, In 52nd Session of the International Statistical Institute, Helsinki, Finland