Browse > Article
http://dx.doi.org/10.5392/JKCA.2022.22.03.354

Metabolic Diseases Classification Models according to Food Consumption using Machine Learning  

Hong, Jun Ho ((주)현대자동차 데이터인텔리전스팀)
Lee, Kyung Hee (충북대학교 경영정보학과)
Lee, Hye Rim (농촌진흥청 디지털농업추진단)
Cheong, Hwan Suk (농촌진흥청 디지털농업추진단)
Cho, Wan-Sup (충북대학교 경영정보학과)
Publication Information
Abstract
Metabolic disease is a disease with a prevalence of 26% in Korean, and has three of the five states of abdominal obesity, hypertension, hunger glycemic disorder, high neutral fat, and low HDL cholesterol at the same time. This paper links the consumer panel data of the Rural Development Agency(RDA) and the medical care data of the National Health Insurance Service(NHIS) to generate a classification model that can be divided into a metabolic disease group and a control group through food consumption characteristics, and attempts to compare the differences. Many existing domestic and foreign studies related to metabolic diseases and food consumption characteristics are disease correlation studies of specific food groups and specific ingredients, and this paper is logistic considering all food groups included in the general diet. We created a classification model using regression, a decision tree-based classification model, and a classification model using XGBoost. Of the three models, the high-precision model is the XGBoost classification model, but the accuracy was not high at less than 0.7. As a future study, it is necessary to extend the observation period for food consumption in the patient group to more than 5 years and to study the metabolic disease classification model after converting the food consumed into nutritional characteristics.
Keywords
Food Consumption; Metabolic Disease; Classification Model; Machine Learning; Data Linkage;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 홍남기, 박혜정, 이유미. "특집 : 당뇨병 및 내분비질환분야 머신러닝 활용," 당뇨병(JKD), Vol.21, No.3, pp.130-139, 2020.
2 W. Seo, Y. B. Lee, S. Lee, S. M. Jin, S. M. Park, "A machinelearning approach to predict postprandial hypoglycemia," BMC Med Inform Decis Mak, Vol.19, p.210, 2019.   DOI
3 조우균, "한국 남자 당뇨환자의 식품 섭취 실태 조사," 韓國食品營養學會誌, Vol.6, No.3, pp.143-157, 1993.
4 홍준호, 오민지, 조용빈, 이경희, 조완섭, "다차원 데이터의 군집분석을 위한 차원축소 방법: 주성분분석 및 요인분석 비교," 학술지명 삽입, Vol.5, No.2, pp.135-143, 2020.
5 N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," Journal of artificial intelligence research, Vol.16, pp.321-357, 2002.   DOI
6 B. Annie, P. Ann-Marie, R. Iwona, L. Simone, and C. Patrick, and V. Marie-Claude, "Associations between dietary patterns and gene expression profiles of healthy men and women: a cross-sectional study," Nutrition journal, Vol.1, No.12, pp.1-13, 2013.
7 정미미, 엄한주, "Two-way ANOVA 분석절차 및 사후검증방법의 이해," 한국체육측정평가학회지, Vol.13, No.2, pp.1-15, 2011.   DOI
8 Safari, Shariff, Kandiah, Rashidkhani, Fereidooni, "Dietary patterns and risk of colorectal cancer in Tehran Province: a case-control study," BMC Public Health, Vol.13, No.222, 2013. https://doi.org/10.1186/1471-2458/13/222
9 유소영, 홍혜숙, 이현숙, 최영주, 허갑범, 김화영, "제2형 당뇨병 환자에서 인슐린저항성과 심혈관질환 위험요인 및 식이요인과의 관계," 한국영양학회지, Vol.40, No.1, pp.31-40, 2007.
10 Andreas C. Muller and Sarah Guido, "Introduction to machine learning with Python: a guide for data scientists," O'Reilly Media, Inc, 2016.
11 홍준호, 머신러닝과 통계적 방법을 이용한 대사성 질환과 식품 소비와의 관계성 연구, 충북대학교, 석사학위논문, 2022.
12 T. Mizoue, T. Yamaji, S. Tabata, K. Yamaguchi, S. Ogawa, M. Mineshita, and S. Kono, "Dietary patterns and glucose tolerance abnormalities in Japanese men," The Journal of nutrition, Vol.136, No.5, pp.1352-1358, 2006.   DOI
13 원진기, 한두봉, "식품영양표시와 운동이 고혈압 집단의 식생활패턴에 미친 영향," 농촌경제, Vol.42, No.3, pp.55-84, 2019.   DOI
14 강지혜, 유리나, "비만성 염증/대사질환 제어를 위한 기능성 식품성분의 활용 가능성," 대한비만학회지, Vol.21, No.3, pp.132-139, 2012.
15 Czekajlo, Rozanska, Zatonska, Szuba, Regulska-Ilow, "Association between dietary patterns and metabolic syndrome in the selected population of Polish adults-results of the pure Poland study," European journal of public health, Vol.2, No.29, pp.335-340, 2019.
16 B. G. Choi, S. W. Rha, S. W. Kim, J. H. Kang, J. Y. Park, and Y. K. Noh, "Machine learning for the prediction of new-onset diabetes mellitus during 5-year follow-up in non-diabetic patients with cardiovascular risks," Yonsei Med J, Vol.60, pp.191-199, 2019.   DOI