• Title/Summary/Keyword: Decision Tree analysis

Search Result 725, Processing Time 0.031 seconds

Computational Methods for Traditional Korean Medicine : A survey (한의 정보의 계산적 방법 조사)

  • Kim, Sang-Kyun;Jang, Hyun-Chul;Kim, Jin-Hyun;Kim, Chul;Yea, Sang-Jun;Song, Mi-Young
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.25 no.5
    • /
    • pp.894-899
    • /
    • 2011
  • Traditional Korean Medicine (TKM) has been actively researched through various approaches, including computational methods. This paper aims at providing an overview of domestic studies using the computational techniques in TKM field. A literature search was conducted in Korean publications using OASIS system, and major studies of data mining in TKM were identified. A review was presented in six diagnosis fields, including sasang constitution diagnosis, eight constitution diagnosis, tongue diagnosis, pattern diagnosis for stroke, diagnosis based on ontology, diagnosis for cause of disease. They collect clinical data themselves for experiments and primarily applied a algorithm of decision tree, SVM, neural network, case-based reasoning, ontology reasoning, discriminant analysis. In the future, there needs to identify which algorithm is suitable to diagnosis or other fields of TKM.

Design and Implementation of a User Activity Auto-recognition System based on Multimodal Sensor in Ubiquitous Computing Environment (유비쿼터스 컴퓨팅환경에서의 Multimodal Sensor 기반의 Health care를 위한 사용자 행동 자동인식 시스템 - Multi-Sensor를 이용한 ADL(activities of daily living) 지수 자동 측정 시스템)

  • Byun, Sung-Ho;Jung, Yu-Suk;Kim, Tae-Su;Kim, Hyun-Woo;Lee, Seung-Hwan;Cho, We-Duke
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.21-26
    • /
    • 2009
  • A sensor system capable of automatically recognize activities would allow many potential Ubiquitous applications. This paper presents a new system for recognizing the activities of daily living(ADL) like walking, running, standing, sitting, lying etc. The system based on the state-dependent motion analysis using Tri-Accelerometer and Zigbee tag. Two accelerometers are used for the classification of body and hand activities. Classification of the environment and instrumental activities is performed based on the hand interaction with an object ID using.

  • PDF

A Machine Learning-based Customer Classification Model for Effective Online Free Sample Promotions (온라인 무료 샘플 판촉의 효과적 활용을 위한 기계학습 기반 고객분류예측 모형)

  • Won, Ha-Ram;Kim, Moo-Jeon;Ahn, Hyunchul
    • The Journal of Information Systems
    • /
    • v.27 no.3
    • /
    • pp.63-80
    • /
    • 2018
  • Purpose The purpose of this study is to build a machine learning-based customer classification model to promote customer expansion effect of the free sample promotion. Specifically, the proposed model classifies potential target customers who are expected to purchase the products included in the free sample promotion after receiving the free samples. Design/methodology/approach This study proposes to build a customer classification model for determining customers suitable for providing free samples by using various machine learning techniques such as logistic regression, multiple discriminant analysis, case-based reasoning, decision tree, artificial neural network, and support vector machine. To validate the usefulness of the proposed model, we apply it to a real-world free sample-based target marketing case of a Korean major cosmetic retail company. Findings Experimental results show that a machine learning-based customer classification model presents satisfactory accuracy ranging from 70% to 75%. In particular, support vector machine is found to be the most effective machine learning technique for free sample-based target marketing model. Our study sheds a light on customer relationship management strategies using free sample promotions.

Developing the administrative model using the data mining technique for injury in National Health Insurance (데이터마이닝 기법을 활용한 국민건강보험 상해상병 관리모형 개발)

  • Park, Il-Su;Han, Jun-Tae;Sohn, Hae-Sook;Kang, Suk-Bok
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.3
    • /
    • pp.467-476
    • /
    • 2011
  • We developed the hybrid model coupled with predictive model and business rule model for administration of injury by utilizing medical data of the National Health Insurance in Korea. We performed decision tree analysis using data mining methodology and used SAS Enterprise Miner 4.1. We also investigated under several business rule for benefits (expense paid by insurer) and claims of injury in National Health Insurance Corporation. We can see that the proposed hybrid model provides a quite efficient plausible results.

Effective R & D Management using Data Mining Classification Techniques (데이터마이닝 분류기법을 이용한 효과적인 연구관리에 관한 연구)

  • 황석해;문태수;이준한
    • Journal of Information Technology Application
    • /
    • v.3 no.2
    • /
    • pp.1-24
    • /
    • 2001
  • This purpose of this study is to drive important criteria for improving customer relationship of R institute using data mining techniques. The focus of this research is to consider patterns and interactions of research variables from research management database of R institute, and to classify the outside organizations and the inside organizations for research contract organizations, and to decide the directions of customer relationship management through analyzing the research type and research cost of research topics. In order to drive criteria variables through pattern analysis of the research database, decision tree algorithm is employed. The results show that determinant variables of 17 input variables are research period, overhead cost, R & D cost as variables to classify the outside and inside contract organization.

  • PDF

Efficient Acoustic Echo Cancellation System for Distant-Talking Automatic Speech Recognition (원거리 음성 인식을 위한 효율적인 에코제거 시스템)

  • Kim, Ki-Beom;Kim, Sang-Yoon;Lee, Woo-Jung;Kwon, Min-Seok;Ko, Byeong-Seob
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2014.10a
    • /
    • pp.150-155
    • /
    • 2014
  • 본 논문에서는, 원거리 음성인식을 위한 서브밴드 필터링 기반의 빠르고 효율적인 에코제거 시스템을 제안한다. 제안하는 에코제거 시스템은 우선 채널간 유사도 (correlation) 가 높을 경우 적응필터가 오작동하는 것을 방지하기 위해 spatial decorrelation 을 적용하게 된다. 그리고 tree 형태를 가지는 IIR filterbank 기반의 subband 구조를 채택함으로써, 적은 차수로도 효과적인 analysis, synthesis 필터링을 수행할 수 있도록 한다. 이 과정에서 불가피하게 발생하는 서브 밴드간 spectral aliasing은 notch filter를 적용해 해결할 수 있다. 또한 적응 필터로는 improved proportionate normalized least-mean-square (IP-NLMS) 알고리즘을 사용해 수렴속도 및 에코제거 성능에서 우수함을 확인하였다. 마지막으로 decision-directed estimation 기반의 residual echo suppressor를 적용해 잔여 에코를 제거하게 된다. 본 논문에서는 각 단계를 구성하게 된 이론적인 배경을 소개하고, 실제 에코가 존재하는 환경에서 ERLE, 원거리 음성 인식률, computational complexity를 통해 제안하는 에코제거 시스템의 효과를 입증하도록 한다.

  • PDF

SVM based Stock Price Forecasting Using Financial Statements (SVM 기반의 재무 정보를 이용한 주가 예측)

  • Heo, Junyoung;Yang, Jin Yong
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.3
    • /
    • pp.167-172
    • /
    • 2015
  • Machine learning is a technique for training computers to be used in classification or forecasting. Among the various types, support vector machine (SVM) is a fast and reliable machine learning mechanism. In this paper, we evaluate the stock price predictability of SVM based on financial statements, through a fundamental analysis predicting the stock price from the corporate intrinsic values. Corporate financial statements were used as the input for SVM. Based on the results, the rise or drop of the stock was predicted. The SVM results were compared with the forecasts of experts, as well as other machine learning methods such as ANN, decision tree and AdaBoost. SVM showed good predictive power while requiring less execution time than the other machine learning schemes.

The Factors related to Long Hours of Smartphone Usage and the Characteristics of High-risk Group in Female Middle School Students (중학교 여학생의 스마트폰 장시간 사용 관련요인 및 고위험군 특성)

  • Park, Sung Hee;Yi, Jee Seon
    • Journal of the Korean Society of School Health
    • /
    • v.31 no.3
    • /
    • pp.135-145
    • /
    • 2018
  • Purpose: The study aimed to investigate the factors associated with long hours of smartphone usage and to identify the characteristics of the high-risk group among female middle school students in South Korea. Methods: The study analyzed the data of 13,648 female middle school students using their own smartphone extracted from the 13th Youth Health Behavior Online Survey (2017). The factors related to using smartphones for a long time was analyzed by binomial logistic regression. The characteristics of the high-risk group was defined by a decision tree analysis. Results: The average hours spent on smartphone usage was 269.54 minutes per day. The significant factors associated with the long hours of smartphone usage were grade, living with parents, perceived household economic status, perceived academic achievement, stress, sadness and hopelessness, the main purpose of smartphone usage, drinking, body mass index, breakfast, and satisfaction with sleep quality. The subjects showing low academic performance and having breakfast four times a week or less were more likely to use their smartphone for a long time. Conclusion: Based on the results of the research, we need to establish intervention strategies focusing on the factors influencing long-time usage of smartphone. Particularly, the subjects who show poor academic performance and skip breakfast frequently should be considered as the high-risk group for spending long hours on smartphone usage.

Machine Learning Approach to Classifying Fatal and Non-Fatal Accidents in Industries (사망사고와 부상사고의 산업재해분류를 위한 기계학습 접근법)

  • Kang, Sungsik;Chang, Seong Rok;Suh, Yongyoon
    • Journal of the Korean Society of Safety
    • /
    • v.36 no.5
    • /
    • pp.52-60
    • /
    • 2021
  • As the prevention of fatal accidents is considered an essential part of social responsibilities, both government and individual have devoted efforts to mitigate the unsafe conditions and behaviors that facilitate accidents. Several studies have analyzed the factors that cause fatal accidents and compared them to those of non-fatal accidents. However, studies on mathematical and systematic analysis techniques for identifying the features of fatal accidents are rare. Recently, various industrial fields have employed machine learning algorithms. This study aimed to apply machine learning algorithms for the classification of fatal and non-fatal accidents based on the features of each accident. These features were obtained by text mining literature on accidents. The classification was performed using four machine learning algorithms, which are widely used in industrial fields, including logistic regression, decision tree, neural network, and support vector machine algorithms. The results revealed that the machine learning algorithms exhibited a high accuracy for the classification of accidents into the two categories. In addition, the importance of comparing similar cases between fatal and non-fatal accidents was discussed. This study presented a method for classifying accidents using machine learning algorithms based on the reports on previous studies on accidents.

Risk Factors for Sarcopenia, Sarcopenic Obesity, and Sarcopenia Without Obesity in Older Adults

  • Kim, Seo-hyun;Yi, Chung-hwi;Lim, Jin-seok
    • Physical Therapy Korea
    • /
    • v.28 no.3
    • /
    • pp.177-185
    • /
    • 2021
  • Background: Muscle undergoes change continuously with aging. Sarcopenia, in which muscle mass decrease with aging, is associated with various diseases, the risk of falling, and the deterioration of quality of life. Obesity and sarcopenia also have a synergy effect on the disease of the older adults. Objects: This study examined the risk factors for sarcopenia, sarcopenic obesity, and sarcopenia without obesity and developed prediction models. Methods: This machine-learning study used the 2008-2011 Korea National Health and Nutrition Examination Surveys in the analysis. After data curation, 5,563 older participants were selected, of whom 1,169 had sarcopenia, 538 had sarcopenic obesity, and 631 had sarcopenia without obesity; the remaining 4,394 were normal. Decision tree and random forest models were used to identify risk factors. Results: The risk factors for sarcopenia chosen by both methods were body mass index (BMI) and duration of moderate physical activity; those for sarcopenic obesity were sex, BMI, and duration of moderate physical activity; and those for sarcopenia without obesity were BMI and sex. The areas under the receiver operating characteristic curves of all prediction models exceeded 0.75. BMI could predict sarcopenia-related disease. Conclusion: Risk factors for sarcopenia-related diseases should be identified and programs for sarcopenia-related disease prevention should be developed. Data-mining research using population data should be conducted to enhance the effectiveness of early treatment for people with sarcopenia-related diseases through predictive models.