Browse > Article
http://dx.doi.org/10.3745/KTSDE.2021.10.9.359

Classifying Sub-Categories of Apartment Defect Repair Tasks: A Machine Learning Approach  

Kim, Eunhye (성균관대학교 데이터사이언스융합학과)
Ji, HongGeun (성균관대학교 인공지능융합학과)
Kim, Jina (성균관대학교 인터랙션사이언스)
Park, Eunil (성균관대학교 인공지능융합학과)
Ohm, Jay Y. (한국과학기술원 기술경영전문대학원)
Publication Information
KIPS Transactions on Software and Data Engineering / v.10, no.9, 2021 , pp. 359-366 More about this Journal
Abstract
A number of construction companies in Korea invest considerable human and financial resources to construct a system for managing apartment defect data and for categorizing repair tasks. Thus, this study proposes machine learning models to automatically classify defect complaint text-data into one of the sub categories of 'finishing work' (i.e., one of the defect repair tasks). In the proposed models, we employed two word representation methods (Bag-of-words, Term Frequency-Inverse Document Frequency (TF-IDF)) and two machine learning classifiers (Support Vector Machine, Random Forest). In particular, we conducted both binary- and multi- classification tasks to classify 9 sub categories of finishing work: home appliance installation work, paperwork, painting work, plastering work, interior masonry work, plaster finishing work, indoor furniture installation work, kitchen facility installation work, and tiling work. The machine learning classifiers using the TF-IDF representation method and Random Forest classification achieved more than 90% accuracy, precision, recall, and F1 score. We shed light on the possibility of constructing automated defect classification systems based on the proposed machine learning models.
Keywords
Apartment; Defect; Repair Tasks; Sub Category; Finishing Works; Machine Learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Glemaitre. Imbalanced-Learn [Internet], https://github.com/scikit-learn-contrib/imbalanced-learn/tree/master/imblearn.
2 W. McKinney, "Data structures for statistical computing in python," Proceedings of the 9th Python in Science Conference, Vol.445, 2010.
3 Kim, Daenyeon, Housing Survey Statistical Report (2019) [Internet], http://stat.molit.go.kr/portal/cate/statFileView.do?hRsId=327&hFormId=
4 D. S. Watt, "Building pathology: Principles and practice," John Wiley & Sons, 2009.
5 Housing Construction Supply Division, Apartment Defect Dispute Mediation Committee [Internet] http://www.adc.go.kr
6 Jin, Dongyeong, Apartment defect application, 62 times' explosion' in 10 years [Internet], https://www.sedaily.com/NewsView/1Z8YOBPOY1.
7 The Housing Policy Division. Housing act [Internet], https://www.law.go.kr/LSW/eng/engLsSc.do?menuId=2§ion=lawNm&query=16006.
8 T. Joachims, "A probabilistic analysis of the rocchio algorithm with TFIDF for text categorization," Carnegie-mellon Univ Pittsburgh Pa Dept of Computer Science, 1996.
9 S. H. Lee, S. H. Lee, and J. J. Kim, "Evaluating the impact of defect risks in residential buildings at the occupancy phase," Sustainability, Vol.10, No.12, pp.4466, 2018.   DOI
10 S. Y. Park, Y. H. Ahn, and S. H. Lee, "Analyzing the finishing works service life pattern of public housing in South Korea by probabilistic approach," Sustainability, Vol.10, No.12, pp.4469, 2018.   DOI
11 F. Pedregosa, et al., "Scikit-learn: Machine learning in Python," The Journal of Machine Learning Research, Vol.12, pp.2825-2830, 2011.
12 B. Kim, Y. H. Ahn, and S. H. Lee, "LDA-based model for defect management in residential buildings," Sustainability, Vol.11, No.24, pp.7201, 2019.   DOI