Browse > Article
http://dx.doi.org/10.5351/KJAS.2008.21.1.095

An Imputation for Nonresponses in the Survey on the Rural Living Indicators  

Cho, Young-Sook (Rural Resource Development Institute)
Chun, Young-Min (Associate Research Fellow, Korea Employment Information Service)
Hwang, Dae-Yong (Rural Resource Development Institute)
Publication Information
The Korean Journal of Applied Statistics / v.21, no.1, 2008 , pp. 95-107 More about this Journal
Abstract
Survey on the rural living indicators was the statistic approved from National Statistical Office and the survey executed by rural resources development institute. This study was used the raw data of survey on the rural living indicators in 2005. After editing procedure for raw data, we were studied 1,582 households which is acquired through elimination of case included nonresponses, and imputed a nonresponses of 15 item selected from 146 item. The imputation methods and efficiency of imputation for simulation was adapted differently from type of data. For continuous data, we imputed the nonresponses with mean imputation, regression imputation, adjusted grey-based k-NN imputation(DU, DW, WU, WW) and compared the results with RMSE. For categorical data, we imputed the nonresponses with mode method, probability imputation, conditional mode method, conditional probability method, hot-deck imputation, and compared the results with Accuracy. By the results, regression imputation and adjusted grey-based k-NN imputation appropriated for continuous data and hot-deck imputation appropriated for categorical data.
Keywords
Accuracy; imputation; nonresponses; RMSE(Root mean square error);
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Little, R. J. A. and Rubin, D. B. (2002). Statistical Analysis with Missing Data, John Wiley & Sons, 2nd ed., New York
2 Deng, J. (1989). The basic course of grey system theory, HUST Publisher
3 Hsia, K. H. and Wu, J. H. (1998). A study on the data preprocessing in grey relational analysis, Journal of Chinese Grey System, 1, 47-54
4 Huang, C. C. and Lee, H. M. (2004). A grey-based nearest neighbor approach for missing attribute value prediction, Applied Intelligence, 20, 239-252   DOI
5 Chun, Y. M., Lee, J. W. and Chung, S. S. (2006). A modified grey-based k-NN approach for treatment of missing value, Journal of the Korean Data & Information Science Society, 17, 421-436
6 Deng, J. (1982). Control problems of grey systems, Systems and Control Letters, 5, 288-294
7 Park, Jinwoo (2002). A Combined Method Compensating for Wave Nonresponse, Journal of the Korean Statistical Society, 31, 469-482
8 황대용, 박은식, 신덕주, 조영숙, 고정숙, 강경하, 최윤지, 윤순덕, 김효철, 이재식 (2005) 농촌생활지표 조사보고서. 농촌진흥청, 농촌자원개발연구소
9 Kim, Y. W., Ryu, J. B., Park, J. W. and Lee, J. W. (2003). Imputation methods for the population and housing census 2000 in Korea, The Korean Communications in Statistics, 10, 575-583   DOI   ScienceOn
10 Quinlan, J. R. (1993). C4.5 : Programs for Machine Learning, Morgan Kaufmann Publishers, San Mateo, CA
11 Wen, K. L. (2004). Grey systems : Modeling and Prediction, Yang's Scientific Press, Tucson
12 김진 (2004). 농가경제조사에 대한 대체법 비교, 통계연구, 통계연구, 9, 133-145
13 박대식, 이영대 (1997) 농촌복지지표의 개발에 관한 연구,한국농촌경제연구원
14 도세록, 이관제 (2006) . 국민건강 검진조사의 무응답 대체에 관한 연구, Journal of the Korean Data Analysis Society, 8, 139-151
15 박태성, 이승연 (1998) 무응답을 포함하는 범주형 자료의 분석, 응용통계연구, 11, 83-95
16 김재광, 한근식, 윤연옥 (2004). 가계조사 무응답 처리가볍 연구, 통계연구, 9, 79-102
17 김주환 (2004). 연구학적 특성에 따른 단위 무응답률 분석 : 사례, Journal of the Korean Data Analysis Society, 6, 1725-1734
18 이진희, 김진, 이기재 (2006) . 표본조사에서 공간변수를 이용한 결측 대체의 효율성 비교, 응용통계 연구, 19, 57-67   과학기술학회마을   DOI
19 조사통계연구회 (2000). 무응답 오차, 자유아카데미
20 선민웅, 백정용 (2005) . 아웃바언드 캠페인의 변경 희망률 추정을 위한 무응답 대체법 비교, Journal of the Korean Data Analysis Society, 7, 1653-1667
21 선형원, 손소영 (2002) . 범주형 자료의 결측치 추정방법 성능 비교, 응용통계연구, 15, 33-43
22 Baker, S. G. and Laird, N. M. (1988). Regression analysis for categorical variables with outcome subject to nonignorable nonresponse, Journal of the American Statistical Association, 78, 708-717   DOI
23 김규성 (2000) . 무응답 대체 방법과 대체 효과, 조사연구, 1, 1-14
24 조영숙, 박은식, 고정숙, 황대용, 강경하 (2004). 농촌생활지표 개발 및 작성에 관한 연구, 농촌자원개발연구, 농촌진흥청 농업과학기술원 ,255-286
25 김영원, 조선경 (1996). 표본조사에서 항목 무응답 대체 방법, 한국통계학회논문집, 3, 145-159
26 김영원, 이주원 (2003). CART를 활용한 결측값 대체방법 연구주택총조사 혼인상태 항목을 중심으로, 조사연구,조사연구, 4, 1-21
27 김규성, 황영은, 박진우 (2005b) 패널조사에서 가중치 부여 방법 및 효과에 관한 연구, 제6회 한국노동패널 학술대회
28 김규성, 이기재, 검진 (2005a). 농어가경제조사에서 가중핫텍 무응답 대체법의 활용, 응용통계 연구, 18, 311-328   과학기술학회마을   DOI