• 제목/요약/키워드: decision tree

검색결과 1,629건 처리시간 0.024초

이명증에 대한 보중익기탕과 반하백출천마탕의 비용효과 분석 연구 (Cost-effectiveness Analysis of Bojungikgitang and Banhabaekchulchonmatang in Chronic Tinnitus Patients)

  • 김남권;오용열;서은성;이동효
    • 한방안이비인후피부과학회지
    • /
    • 제23권1호
    • /
    • pp.260-269
    • /
    • 2010
  • Background : Bojungikgitang(BJT) and Banhabaekchulchonmatang(BBT) are known to treat the tinnitus patients, which were registered Korean National Health Insurance coverage lists. Objective : Few studies have evaluated economic benefits of both herbal medicines. This research is to investigate the cost-effectiveness of Bojungikgitang(BJT) and Banhabaekchulchonmatang(BBT) in chronic tinnitus patients over nineteen years old. Method : We built the decision tree model of chronic tinnitus and executed the deterministic analysis and threshold sensitivity analysis based on randomized clinical trial. Effectiveness was measured in quality-adjusted life-years(QALYs), and costs were in 2009 KRW(South Korean Currency). The perspective is societal, time horizon is 10 weeks, and Korean willingness to pay threshold is assumed to 20,000,000KRW. Results : In the base case analysis, BJT treatment resulted is better outcomes as low cost, so BJT is dominant medicine and BBT is dominated. But both cost per QALYs (BJT is 3,120,339KWN per QALY, BBT is 3,505,780KWN per QALY) are lower than the threshold, that could be covered by Korean National Health Insurance(KNHI). Conclusion : This study results showed that BJT was more cost-effective than BBT treating tinnitus patients for 10 weeks, and the cost per QALYs of both alternatives were lower than Korean national threshold.

A QoS-aware Web Services Selection for Reliable Web Service Composition

  • Nasridinov, Aziz;Byun, Jeongyong
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2012년도 춘계학술발표대회
    • /
    • pp.586-589
    • /
    • 2012
  • Web Services have been utilized in a wide variety of applications and have turned into a key technology in developing business operations on the Web. Originally, Web Services can be exploited in an isolated form, however when no single Web Service can satisfy the functionality required by a user, there should be a possibility to compose existing services together in order to fulfill the user requirement. However, since the same service may be offered by different providers with different non-functional Quality of Service (QoS), the task of service selection for Web Service composition is becoming complicated. Also, as Web Services are inherently unreliable, how to deliver reliable Web Services composition over unreliable Web Services should be considered while composing Web Services. In this paper, we propose an approach on a QoS-aware Web Service selection for reliable Web Service composition. In our approach, we select and classify Web Services using Decision Tree based on QoS attributes provided by the client. Service classifier will improve selection of relevant Web Services early in the composition process and also provide flexibility to replace a failed Web Services with a redundant alternative Web Services, resulting in high availability and reliability of Web Service composition. We will provide an implementation of our proposed approach along with efficiency measurements through performance evaluation.

입력자료 군집화에 따른 앙상블 머신러닝 모형의 수질예측 특성 연구 (The Effect of Input Variables Clustering on the Characteristics of Ensemble Machine Learning Model for Water Quality Prediction)

  • 박정수
    • 한국물환경학회지
    • /
    • 제37권5호
    • /
    • pp.335-343
    • /
    • 2021
  • Water quality prediction is essential for the proper management of water supply systems. Increased suspended sediment concentration (SSC) has various effects on water supply systems such as increased treatment cost and consequently, there have been various efforts to develop a model for predicting SSC. However, SSC is affected by both the natural and anthropogenic environment, making it challenging to predict SSC. Recently, advanced machine learning models have increasingly been used for water quality prediction. This study developed an ensemble machine learning model to predict SSC using the XGBoost (XGB) algorithm. The observed discharge (Q) and SSC in two fields monitoring stations were used to develop the model. The input variables were clustered in two groups with low and high ranges of Q using the k-means clustering algorithm. Then each group of data was separately used to optimize XGB (Model 1). The model performance was compared with that of the XGB model using the entire data (Model 2). The models were evaluated by mean squared error-ob servation standard deviation ratio (RSR) and root mean squared error. The RSR were 0.51 and 0.57 in the two monitoring stations for Model 2, respectively, while the model performance improved to RSR 0.46 and 0.55, respectively, for Model 1.

정교한 데이터 분류를 위한 방법론의 고찰 (A Review of the Methodology for Sophisticated Data Classification)

  • 김승재;김성환
    • 통합자연과학논문집
    • /
    • 제14권1호
    • /
    • pp.27-34
    • /
    • 2021
  • 전 세계적으로 인공지능(AI)을 구현하려는 움직임이 많아지고 있다. AI구현에서는 많은 양의 데이터, 목적에 맞는 데이터의 분류 등 데이터의 중요성을 뺄 수 없다. 이러한 데이터를 생성하고 가공하는 기술에는 사물인터넷(IOT)과 빅데이터(Big-data) 분석이 있으며 4차 산업을 이끌어 가는 원동력이라 할 수 있다. 또한 이러한 기술은 국가와 개인 차원에서 많이 활용되고 있으며, 특히나 특정분야에 집결되는 데이터를 기준으로 빅데이터 분석에 활용함으로써 새로운 모델을 발견하고, 그 모델로 새로운 값을 추론하고 예측함으로써 미래비전을 제시하려는 시도가 많아지고 있는 추세이다. 데이터 분석을 통한 결론은 데이터가 가지고 있는 정보의 정확성에 따라 많은 변화를 가져올 수 있으며, 그 변화에 따라 잘못된 결과를 발생시킬 수도 있다. 이렇듯 데이터의 분석은 데이터가 가지는 정보 또는 분석 목적에 맞는 데이터 분류가 매우 중요하다는 것을 알 수 있다. 또한 빅데이터 분석결과 통계량의 신뢰성과 정교함을 얻기 위해서는 각 변수의 의미와 변수들 간의 상관관계, 다중공선성 등을 고려하여 분석해야 한다. 즉, 빅데이터 분석에 앞서 분석목적에 맞도록 데이터의 분류가 잘 이루어지도록 해야 한다. 이에 본 고찰에서는 AI기술을 구현하는 머신러닝(machine learning, ML) 기법에 속하는 분류분석(classification analysis, CA) 중 의사결정트리(decision tree, DT)기법, 랜덤포레스트(random forest, RF)기법, 선형분류분석(linear discriminant analysis, LDA), 이차선형분류분석(quadratic discriminant analysis, QDA)을 이용하여 데이터를 분류한 후 데이터의 분류정도를 평가함으로써 데이터의 분류 분석률 향상을 위한 방안을 모색하려 한다.

중학교 여학생의 스마트폰 장시간 사용 관련요인 및 고위험군 특성 (The Factors related to Long Hours of Smartphone Usage and the Characteristics of High-risk Group in Female Middle School Students)

  • 박성희;이지선
    • 한국학교보건학회지
    • /
    • 제31권3호
    • /
    • pp.135-145
    • /
    • 2018
  • Purpose: The study aimed to investigate the factors associated with long hours of smartphone usage and to identify the characteristics of the high-risk group among female middle school students in South Korea. Methods: The study analyzed the data of 13,648 female middle school students using their own smartphone extracted from the 13th Youth Health Behavior Online Survey (2017). The factors related to using smartphones for a long time was analyzed by binomial logistic regression. The characteristics of the high-risk group was defined by a decision tree analysis. Results: The average hours spent on smartphone usage was 269.54 minutes per day. The significant factors associated with the long hours of smartphone usage were grade, living with parents, perceived household economic status, perceived academic achievement, stress, sadness and hopelessness, the main purpose of smartphone usage, drinking, body mass index, breakfast, and satisfaction with sleep quality. The subjects showing low academic performance and having breakfast four times a week or less were more likely to use their smartphone for a long time. Conclusion: Based on the results of the research, we need to establish intervention strategies focusing on the factors influencing long-time usage of smartphone. Particularly, the subjects who show poor academic performance and skip breakfast frequently should be considered as the high-risk group for spending long hours on smartphone usage.

A Review of Machine Learning Algorithms for Fraud Detection in Credit Card Transaction

  • Lim, Kha Shing;Lee, Lam Hong;Sim, Yee-Wai
    • International Journal of Computer Science & Network Security
    • /
    • 제21권9호
    • /
    • pp.31-40
    • /
    • 2021
  • The increasing number of credit card fraud cases has become a considerable problem since the past decades. This phenomenon is due to the expansion of new technologies, including the increased popularity and volume of online banking transactions and e-commerce. In order to address the problem of credit card fraud detection, a rule-based approach has been widely utilized to detect and guard against fraudulent activities. However, it requires huge computational power and high complexity in defining and building the rule base for pattern matching, in order to precisely identifying the fraud patterns. In addition, it does not come with intelligence and ability in predicting or analysing transaction data in looking for new fraud patterns and strategies. As such, Data Mining and Machine Learning algorithms are proposed to overcome the shortcomings in this paper. The aim of this paper is to highlight the important techniques and methodologies that are employed in fraud detection, while at the same time focusing on the existing literature. Methods such as Artificial Neural Networks (ANNs), Support Vector Machines (SVMs), naïve Bayesian, k-Nearest Neighbour (k-NN), Decision Tree and Frequent Pattern Mining algorithms are reviewed and evaluated for their performance in detecting fraudulent transaction.

사망사고와 부상사고의 산업재해분류를 위한 기계학습 접근법 (Machine Learning Approach to Classifying Fatal and Non-Fatal Accidents in Industries)

  • 강성식;장성록;서용윤
    • 한국안전학회지
    • /
    • 제36권5호
    • /
    • pp.52-60
    • /
    • 2021
  • As the prevention of fatal accidents is considered an essential part of social responsibilities, both government and individual have devoted efforts to mitigate the unsafe conditions and behaviors that facilitate accidents. Several studies have analyzed the factors that cause fatal accidents and compared them to those of non-fatal accidents. However, studies on mathematical and systematic analysis techniques for identifying the features of fatal accidents are rare. Recently, various industrial fields have employed machine learning algorithms. This study aimed to apply machine learning algorithms for the classification of fatal and non-fatal accidents based on the features of each accident. These features were obtained by text mining literature on accidents. The classification was performed using four machine learning algorithms, which are widely used in industrial fields, including logistic regression, decision tree, neural network, and support vector machine algorithms. The results revealed that the machine learning algorithms exhibited a high accuracy for the classification of accidents into the two categories. In addition, the importance of comparing similar cases between fatal and non-fatal accidents was discussed. This study presented a method for classifying accidents using machine learning algorithms based on the reports on previous studies on accidents.

Comparing the Performance of 17 Machine Learning Models in Predicting Human Population Growth of Countries

  • Otoom, Mohammad Mahmood
    • International Journal of Computer Science & Network Security
    • /
    • 제21권1호
    • /
    • pp.220-225
    • /
    • 2021
  • Human population growth rate is an important parameter for real-world planning. Common approaches rely upon fixed parameters like human population, mortality rate, fertility rate, which is collected historically to determine the region's population growth rate. Literature does not provide a solution for areas with no historical knowledge. In such areas, machine learning can solve the problem, but a multitude of machine learning algorithm makes it difficult to determine the best approach. Further, the missing feature is a common real-world problem. Thus, it is essential to compare and select the machine learning techniques which provide the best and most robust in the presence of missing features. This study compares 17 machine learning techniques (base learners and ensemble learners) performance in predicting the human population growth rate of the country. Among the 17 machine learning techniques, random forest outperformed all the other techniques both in predictive performance and robustness towards missing features. Thus, the study successfully demonstrates and compares machine learning techniques to predict the human population growth rate in settings where historical data and feature information is not available. Further, the study provides the best machine learning algorithm for performing population growth rate prediction.

Risk Factors for Sarcopenia, Sarcopenic Obesity, and Sarcopenia Without Obesity in Older Adults

  • Kim, Seo-hyun;Yi, Chung-hwi;Lim, Jin-seok
    • 한국전문물리치료학회지
    • /
    • 제28권3호
    • /
    • pp.177-185
    • /
    • 2021
  • Background: Muscle undergoes change continuously with aging. Sarcopenia, in which muscle mass decrease with aging, is associated with various diseases, the risk of falling, and the deterioration of quality of life. Obesity and sarcopenia also have a synergy effect on the disease of the older adults. Objects: This study examined the risk factors for sarcopenia, sarcopenic obesity, and sarcopenia without obesity and developed prediction models. Methods: This machine-learning study used the 2008-2011 Korea National Health and Nutrition Examination Surveys in the analysis. After data curation, 5,563 older participants were selected, of whom 1,169 had sarcopenia, 538 had sarcopenic obesity, and 631 had sarcopenia without obesity; the remaining 4,394 were normal. Decision tree and random forest models were used to identify risk factors. Results: The risk factors for sarcopenia chosen by both methods were body mass index (BMI) and duration of moderate physical activity; those for sarcopenic obesity were sex, BMI, and duration of moderate physical activity; and those for sarcopenia without obesity were BMI and sex. The areas under the receiver operating characteristic curves of all prediction models exceeded 0.75. BMI could predict sarcopenia-related disease. Conclusion: Risk factors for sarcopenia-related diseases should be identified and programs for sarcopenia-related disease prevention should be developed. Data-mining research using population data should be conducted to enhance the effectiveness of early treatment for people with sarcopenia-related diseases through predictive models.

머신러닝(Machine Learning) 기법을 활용한 제주국제공항의 운항 지연과의 상관관계 분석 및 지연 여부 예측모형 개발 - 기상을 중심으로 - (Development of a Prediction Model and Correlation Analysis of Weather-induced Flight Delay at Jeju International Airport Using Machine Learning Techniques)

  • 이충섭;;여혜민;김동신;백호종
    • 한국항공운항학회지
    • /
    • 제29권4호
    • /
    • pp.1-20
    • /
    • 2021
  • Due to the recent rapid increase in passenger and cargo air transport demand, the capacity of Jeju International Airport has been approaching its limit. Even though in COVID-19 crisis which has started from Nov 2019, Jeju International Airport still suffers from strong demand in terms of air passenger and cargo transportation. However, it is an undeniable fact that the delay has also increased in Jeju International Airport. In this study, we analyze the correlation between weather and delayed departure operation based on both datum collected from the historical airline operation information and aviation weather statistics of Jeju International Airport. Adopting machine learning techniques, we then analyze weather condition Jeju International Airport and construct a delay prediction model. The model presented in this study is expected to play a useful role to predict aircraft departure delay and contribute to enhance aircraft operation efficiency and punctuality in the Jeju International Airport.