• 제목/요약/키워드: Decision Tree Based Machine Learning

검색결과 229건 처리시간 0.047초

폐경 여성에서 트리기반 머신러닝 모델로부터 골다공증 예측 (Predictive of Osteoporosis by Tree-based Machine Learning Model in Post-menopause Woman)

  • 이인자;이준호
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제43권6호
    • /
    • pp.495-502
    • /
    • 2020
  • In this study, the prevalence of osteoporosis was predicted based on 10 independent variables such as age, weight, and alcohol consumption and 4 tree-based machine-learning models, and the performance of each model was compared. Also the model with the highest performance was used to check the performance by clearing the independent variable, and Area Under Curve(ACU) was utilized to evaluate the performance of the model. The ACU for each model was Decision tree 0.663, Random forest 0.704, GBM 0.702, and XGBoost 0.710 and the importance of the variable was shown in the order of age, weight, and family history. As a result of using XGBoost, the highest performance model and clearing independent variables, the ACU shows the best performance of 0.750 with 7 independent variables. This data suggests that this method be applied to predict osteoporosis, but also other various diseases. In addition, it is expected to be used as basic data for big data research in the health care field.

머신 러닝을 활용한 의류제품의 판매량 예측 모델 - 아우터웨어 품목을 중심으로 - (Sales Forecasting Model for Apparel Products Using Machine Learning Technique - A Case Study on Forecasting Outerwear Items -)

  • 채진미;김은희
    • 한국의류산업학회지
    • /
    • 제23권4호
    • /
    • pp.480-490
    • /
    • 2021
  • Sales forecasting is crucial for many retail operations. For apparel retailers, accurate sales forecast for the next season is critical to properly manage inventory and plan their supply chains. The challenge in this increases because apparel products are always new for the next season, have numerous variations, short life cycles, long lead times, and seasonal trends. In this study, a sales forecasting model is proposed for apparel products using machine learning techniques. The sales data pertaining to outerwear items for four years were collected from a Korean sports brand and filtered with outliers. Subsequently, the data were standardized by removing the effects of exogenous variables. The sales patterns of outerwear items were clustered by applying K-means clustering, and outerwear attributes associated with the specific sales-pattern type were determined by using a decision tree classifier. Six types of sales pattern clusters were derived and classified using a hybrid model of clustering and decision tree algorithm, and finally, the relationship between outerwear attributes and sales patterns was revealed. Each sales pattern can be used to predict stock-keeping-unit-level sales based on item attributes.

Default Prediction of Automobile Credit Based on Support Vector Machine

  • Chen, Ying;Zhang, Ruirui
    • Journal of Information Processing Systems
    • /
    • 제17권1호
    • /
    • pp.75-88
    • /
    • 2021
  • Automobile credit business has developed rapidly in recent years, and corresponding default phenomena occur frequently. Credit default will bring great losses to automobile financial institutions. Therefore, the successful prediction of automobile credit default is of great significance. Firstly, the missing values are deleted, then the random forest is used for feature selection, and then the sample data are randomly grouped. Finally, six prediction models of support vector machine (SVM), random forest and k-nearest neighbor (KNN), logistic, decision tree, and artificial neural network (ANN) are constructed. The results show that these six machine learning models can be used to predict the default of automobile credit. Among these six models, the accuracy of decision tree is 0.79, which is the highest, but the comprehensive performance of SVM is the best. And random grouping can improve the efficiency of model operation to a certain extent, especially SVM.

결정트리 기반의 기계학습을 이용한 동적 데이터에 대한 재익명화기법 (Re-anonymization Technique for Dynamic Data Using Decision Tree Based Machine Learning)

  • 김영기;홍충선
    • 정보과학회 논문지
    • /
    • 제44권1호
    • /
    • pp.21-26
    • /
    • 2017
  • 사물인터넷, 클라우드 컴퓨팅, 빅데이터 등 새로운 기술의 도입으로 처리하는 데이터의 종류와 양이 증가하면서, 개인의 민감한 정보가 유출되는 것에 대한 보안이슈가 더욱 중요시되고 있다. 민감정보를 보호하기 위한 방법으로 데이터에 포함된 개인정보를 공개 또는 배포하기 전에 일부를 삭제하거나 알아볼 수 없는 형태로 변환하는 익명화기법을 사용한다. 그러나 준식별자의 일반화 수준을 계층화하여 익명화를 수행하는 기존의 방법은 데이터 테이블의 레코드가 추가 또는 삭제되어 k-익명성을 만족하지 못하는 경우에 더 높은 일반화 수준을 필요로 한다. 이와 같은 과정으로 인한 정보의 손실이 불가피하며 이는 데이터의 유용성을 저해하는 요소이다. 따라서 본 논문에서는 결정트리 기반의 기계학습을 적용하여 기존의 익명화방법의 정보손실을 최소화하여 데이터의 유용성을 향상시키는 익명화기법을 제안한다

Very Fast Decision Tree 기반 Naive Bayesian 알고리즘의 Weight 부여 기법 (An Attribute Weighting Approach for Naive Bayesian based on Very Fast Decision Tree)

  • 김세준;유승언;이병준;김경태;윤희용
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2018년도 제58차 하계학술대회논문집 26권2호
    • /
    • pp.139-140
    • /
    • 2018
  • 본 논문에서는 지도 기계 학습 알고리즘 중 하나인 Naive Bayesian (NB) 알고리즘의 데이터 분류 정확도를 향상시키기 위하여 데이터 속성에 Weight를 부여하는 새로운 기법을 제안하였다. 기존에 Decision Tree(DT) 알고리즘의 깊이를 이용하여 Weigth를 부여하는 방법이 제안되었으나, DT를 구축하는데 오버헤드가 크기 때문에 데이터의 실시간 분석이나 자원 제한적인 환경에서의 적용은 어렵다는 단점이 있다. 이를 해결하기 위하여 본 논문에서는 최소한의 데이터를 사용하여 신속하게 DT를 구축하는 Very Fast Decision Tree (VFDT) 알고리즘 기반의 Weight 부여 기법을 제안함으로써 적은 오버헤드로 NB의 정확도를 향상시킨다.

  • PDF

기계학습 기반 강 구조물 지진응답 예측기법 (Machine Learning based Seismic Response Prediction Methods for Steel Frame Structures)

  • 이승혜;이재홍
    • 한국공간구조학회논문집
    • /
    • 제24권2호
    • /
    • pp.91-99
    • /
    • 2024
  • In this paper, machine learning models were applied to predict the seismic response of steel frame structures. Both geometric and material nonlinearities were considered in the structural analysis, and nonlinear inelastic dynamic analysis was performed. The ground acceleration response of the El Centro earthquake was applied to obtain the displacement of the top floor, which was used as the dataset for the machine learning methods. Learning was performed using two methods: Decision Tree and Random Forest, and their efficiency was demonstrated through application to 2-story and 6-story 3-D steel frame structure examples.

기계학습을 이용한 Joint Torque Sensor 기반의 충돌 감지 알고리즘 비교 연구 (A Comparative Study on Collision Detection Algorithms based on Joint Torque Sensor using Machine Learning)

  • 조성현;권우경
    • 로봇학회논문지
    • /
    • 제15권2호
    • /
    • pp.169-176
    • /
    • 2020
  • This paper studied the collision detection of robot manipulators for safe collaboration in human-robot interaction. Based on sensor-based collision detection, external torque is detached from subtracting robot dynamics. To detect collision using joint torque sensor data, a comparative study was conducted using data-based machine learning algorithm. Data was collected from the actual 3 degree-of-freedom (DOF) robot manipulator, and the data was labeled by threshold and handwork. Using support vector machine (SVM), decision tree and k-nearest neighbors KNN method, we derive the optimal parameters of each algorithm and compare the collision classification performance. The simulation results are analyzed for each method, and we confirmed that by an optimal collision status detection model with high prediction accuracy.

Near Field IR (NIR) 스펙트럼 및 결정 트리 기반 기계학습을 이용한 플라스틱 재질 분류 시스템 (The Evaluation of a Plastic Material Classification System using Near Field IR (NIR) Spectrum and Decision Tree based Machine Learning)

  • 국중진
    • 반도체디스플레이기술학회지
    • /
    • 제21권3호
    • /
    • pp.92-97
    • /
    • 2022
  • Plastics are classified into 7 types such as PET (PETE), HDPE, PVC, LDPE, PP, PS, and Other for separation and recycling. Recently, large corporations advocating ESG management are replacing them with bioplastics. Incineration and landfill of disposal of plastic waste are responsible for air pollution and destruction of the ecosystem. Because it is not easy to accurately classify plastic materials with the naked eye, automated system-based screening studies using various sensor technologies and AI-based software technologies have been conducted. In this paper, NIR scanning devices considering the NIR wavelength characteristics that appear differently for each plastic material and a system that can identify the type of plastic by learning the NIR spectrum data collected through it. The accuracy of plastic material identification was evaluated through a decision tree-based SVM model for multiclass classification on NIR spectral datasets for 8 types of plastic samples including biodegradable plastic.

사망사고와 부상사고의 산업재해분류를 위한 기계학습 접근법 (Machine Learning Approach to Classifying Fatal and Non-Fatal Accidents in Industries)

  • 강성식;장성록;서용윤
    • 한국안전학회지
    • /
    • 제36권5호
    • /
    • pp.52-60
    • /
    • 2021
  • As the prevention of fatal accidents is considered an essential part of social responsibilities, both government and individual have devoted efforts to mitigate the unsafe conditions and behaviors that facilitate accidents. Several studies have analyzed the factors that cause fatal accidents and compared them to those of non-fatal accidents. However, studies on mathematical and systematic analysis techniques for identifying the features of fatal accidents are rare. Recently, various industrial fields have employed machine learning algorithms. This study aimed to apply machine learning algorithms for the classification of fatal and non-fatal accidents based on the features of each accident. These features were obtained by text mining literature on accidents. The classification was performed using four machine learning algorithms, which are widely used in industrial fields, including logistic regression, decision tree, neural network, and support vector machine algorithms. The results revealed that the machine learning algorithms exhibited a high accuracy for the classification of accidents into the two categories. In addition, the importance of comparing similar cases between fatal and non-fatal accidents was discussed. This study presented a method for classifying accidents using machine learning algorithms based on the reports on previous studies on accidents.

2 단계 결정트리 학습을 이용한 뇌 자기공명영상 분류 (Classification of Brain Magnetic Resonance Images using 2 Level Decision Tree Learning)

  • 김형일;김용욱
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제34권1호
    • /
    • pp.18-29
    • /
    • 2007
  • 본 논문에서는 뇌 자기공명영상을 분류하기 위하여 결정트리 알고리즘을 2 단계로 적용하는 영상 분류 시스템을 제안한다. 영상으로부터 얻을 수 있는 정보에는 두 종류가 있다. 하나는 크기, 색상, 질감, 윤곽선 등 영상으로부터 직접 얻을 수 있는 하위레벨 특징들이고, 다른 하나는 특정 객체의 존재 유무, 여러 부위 사이의 공간적 관계 등 분할된 영상들에 대한 해석을 통해서 얻을 수 있는 상위레벨 특징들이다. 의미에 따라 영상을 분류하기 위해서는 상위레벨 특징들을 기반으로 학습 및 분류가 수행되어야 한다. 제안하는 시스템에서는 결정트리 학습을 각각의 레벨에 개별적으로 적용하며, 하위레벨 분류 결과를 이용하여 상위레벨의 특징을 추출한다. 종양이 있는 뇌 자기공명영상 집합에 대하여 분류 실험을 수행하였으며, 몇 가지 실험 결과를 통해 제안된 시스템의 효과를 확인하였다.