• Title/Summary/Keyword: 의사결정나무 분석

Search Result 409, Processing Time 0.03 seconds

1D CNN and Machine Learning Methods for Fall Detection (1D CNN과 기계 학습을 사용한 낙상 검출)

  • Kim, Inkyung;Kim, Daehee;Noh, Song;Lee, Jaekoo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.3
    • /
    • pp.85-90
    • /
    • 2021
  • In this paper, fall detection using individual wearable devices for older people is considered. To design a low-cost wearable device for reliable fall detection, we present a comprehensive analysis of two representative models. One is a machine learning model composed of a decision tree, random forest, and Support Vector Machine(SVM). The other is a deep learning model relying on a one-dimensional(1D) Convolutional Neural Network(CNN). By considering data segmentation, preprocessing, and feature extraction methods applied to the input data, we also evaluate the considered models' validity. Simulation results verify the efficacy of the deep learning model showing improved overall performance.

A Study on Segmentation of Preferred Characteristics of Rural Tourists after COVID-19 Using Decision Tree Analysis (의사결정나무분석을 활용한 코로나19 이후 농촌관광객의 선호 특성 세분화 연구)

  • Seung-Hun Lee
    • Asia-Pacific Journal of Business
    • /
    • v.14 no.1
    • /
    • pp.411-426
    • /
    • 2023
  • Purpose - The purpose of this study was to explore and diagnose the characteristics and behavioural patterns of rural tourists after COVID-19 using decision tree analysis to classify and identify key segmentation groups. Design/methodology/approach - The CHAID algorithm was used as the analysis technique for the decision tree. The explanatory variables used in the analysis of each decision tree model were demographic variables and rural tourism usage behaviour and perception variables, and the target variables were the preferences of rural tourists' activities after COVID-19. From the Rural Tourism 2020 survey data, 614 samples with rural tourism experience were extracted and used in the analysis. Findings - The variables that significantly explained the preference for each type of rural tourism activity after COVID-19 were rural tourism safety perception, repeated visits to the region, rural tourism priority activity, rural tourism accommodation experience, gender, age group, marital status, occupation, and education level. Among them, rural tourism safety perception was the most important explanatory variable in each analysis model. Research implications or Originality - Overall, to promote rural tourism, it is necessary to enhance the safety image of rural tourism, strengthen loyalty programs for repeat visitors, and develop customized products that reflect the preferred trends of rural tourism.

Risk factors of alcohol use disorder in Korean adults based on the decision tree analysis (의사결정나무분석을 이용한 성인의 알코올사용장애 위험요인)

  • Mi Young Kwon;Ji In Kim
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.24 no.1
    • /
    • pp.47-59
    • /
    • 2023
  • Objectives: The aim of this study was to identify risk factors of alcohol use disorder among Korean adults. Methods: Cross-sectional exploratory study based on data collected from Data from the 6th Korea National Health and Nutrition Examination Survey in 2015 were performed in this study. There were 3,248 participants who were 2,558 normal drinkers while 690 had alcohol use disorder. Decision tree analysis were used to exam socio-demographic and health-related factors to predict alcohol use disorder. Results: As a result of decision tree analysis, the predictive model for factors related to alcohol use disorder in Korean adults presented with 8 pathways. The significant predictors of alcohol use disorder were age, gender, smoking, marital status, and house income. Male smokers whose household income is 'high' or 'low' are most vulnerable to alcohol use disorders. Conclusions: This study indicates that need to consider health behavior and house income when we practice prevention policies and health education of alcohol use disorder.

A Prediction Model for the Development of Cataract Using Random Forests (Random Forests 기법을 이용한 백내장 예측모형 - 일개 대학병원 건강검진 수검자료에서 -)

  • Han, Eun-Jeong;Song, Ki-Jun;Kim, Dong-Geon
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.4
    • /
    • pp.771-780
    • /
    • 2009
  • Cataract is the main cause of blindness and visual impairment, especially, age-related cataract accounts for about half of the 32 million cases of blindness worldwide. As the life expectancy and the expansion of the elderly population are increasing, the cases of cataract increase as well, which causes a serious economic and social problem throughout the country. However, the incidence of cataract can be reduced dramatically through early diagnosis and prevention. In this study, we developed a prediction model of cataracts for early diagnosis using hospital data of 3,237 subjects who received the screening test first and then later visited medical center for cataract check-ups cataract between 1994 and 2005. To develop the prediction model, we used random forests and compared the predictive performance of this model with other common discriminant models such as logistic regression, discriminant model, decision tree, naive Bayes, and two popular ensemble model, bagging and arcing. The accuracy of random forests was 67.16%, sensitivity was 72.28%, and main factors included in this model were age, diabetes, WBC, platelet, triglyceride, BMI and so on. The results showed that it could predict about 70% of cataract existence by screening test without any information from direct eye examination by ophthalmologist. We expect that our model may contribute to diagnose cataract and help preventing cataract in early stages.

Prediction of Correct Answer Rate and Identification of Significant Factors for CSAT English Test Based on Data Mining Techniques (데이터마이닝 기법을 활용한 대학수학능력시험 영어영역 정답률 예측 및 주요 요인 분석)

  • Park, Hee Jin;Jang, Kyoung Ye;Lee, Youn Ho;Kim, Woo Je;Kang, Pil Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.11
    • /
    • pp.509-520
    • /
    • 2015
  • College Scholastic Ability Test(CSAT) is a primary test to evaluate the study achievement of high-school students and used by most universities for admission decision in South Korea. Because its level of difficulty is a significant issue to both students and universities, the government makes a huge effort to have a consistent difficulty level every year. However, the actual levels of difficulty have significantly fluctuated, which causes many problems with university admission. In this paper, we build two types of data-driven prediction models to predict correct answer rate and to identify significant factors for CSAT English test through accumulated test data of CSAT, unlike traditional methods depending on experts' judgments. Initially, we derive candidate question-specific factors that can influence the correct answer rate, such as the position, EBS-relation, readability, from the annual CSAT practices and CSAT for 10 years. In addition, we drive context-specific factors by employing topic modeling which identify the underlying topics over the text. Then, the correct answer rate is predicted by multiple linear regression and level of difficulty is predicted by classification tree. The experimental results show that 90% of accuracy can be achieved by the level of difficulty (difficult/easy) classification model, whereas the error rate for correct answer rate is below 16%. Points and problem category are found to be critical to predict the correct answer rate. In addition, the correct answer rate is also influenced by some of the topics discovered by topic modeling. Based on our study, it will be possible to predict the range of expected correct answer rate for both question-level and entire test-level, which will help CSAT examiners to control the level of difficulties.

A Study of Factors Associated with Software Developers Job Turnover (데이터마이닝을 활용한 소프트웨어 개발인력의 업무 지속수행의도 결정요인 분석)

  • Jeon, In-Ho;Park, Sun W.;Park, Yoon-Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.191-204
    • /
    • 2015
  • According to the '2013 Performance Assessment Report on the Financial Program' from the National Assembly Budget Office, the unfilled recruitment ratio of Software(SW) Developers in South Korea was 25% in the 2012 fiscal year. Moreover, the unfilled recruitment ratio of highly-qualified SW developers reaches almost 80%. This phenomenon is intensified in small and medium enterprises consisting of less than 300 employees. Young job-seekers in South Korea are increasingly avoiding becoming a SW developer and even the current SW developers want to change careers, which hinders the national development of IT industries. The Korean government has recently realized the problem and implemented policies to foster young SW developers. Due to this effort, it has become easier to find young SW developers at the beginning-level. However, it is still hard to recruit highly-qualified SW developers for many IT companies. This is because in order to become a SW developing expert, having a long term experiences are important. Thus, improving job continuity intentions of current SW developers is more important than fostering new SW developers. Therefore, this study surveyed the job continuity intentions of SW developers and analyzed the factors associated with them. As a method, we carried out a survey from September 2014 to October 2014, which was targeted on 130 SW developers who were working in IT industries in South Korea. We gathered the demographic information and characteristics of the respondents, work environments of a SW industry, and social positions for SW developers. Afterward, a regression analysis and a decision tree method were performed to analyze the data. These two methods are widely used data mining techniques, which have explanation ability and are mutually complementary. We first performed a linear regression method to find the important factors assaociated with a job continuity intension of SW developers. The result showed that an 'expected age' to work as a SW developer were the most significant factor associated with the job continuity intention. We supposed that the major cause of this phenomenon is the structural problem of IT industries in South Korea, which requires SW developers to change the work field from developing area to management as they are promoted. Also, a 'motivation' to become a SW developer and a 'personality (introverted tendency)' of a SW developer are highly importantly factors associated with the job continuity intention. Next, the decision tree method was performed to extract the characteristics of highly motivated developers and the low motivated ones. We used well-known C4.5 algorithm for decision tree analysis. The results showed that 'motivation', 'personality', and 'expected age' were also important factors influencing the job continuity intentions, which was similar to the results of the regression analysis. In addition to that, the 'ability to learn' new technology was a crucial factor for the decision rules of job continuity. In other words, a person with high ability to learn new technology tends to work as a SW developer for a longer period of time. The decision rule also showed that a 'social position' of SW developers and a 'prospect' of SW industry were minor factors influencing job continuity intensions. On the other hand, 'type of an employment (regular position/ non-regular position)' and 'type of company (ordering company/ service providing company)' did not affect the job continuity intension in both methods. In this research, we demonstrated the job continuity intentions of SW developers, who were actually working at IT companies in South Korea, and we analyzed the factors associated with them. These results can be used for human resource management in many IT companies when recruiting or fostering highly-qualified SW experts. It can also help to build SW developer fostering policy and to solve the problem of unfilled recruitment of SW Developers in South Korea.

Forming Shop Analysis with Adaptive Systems Approach (적응시스템 접근법을 이용한 조선소 가공공장 분석)

  • Dong-Hun Shin;Jong-Hun Woo;Jang-Hyun Lee;Jong-Gye Shin
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.39 no.3
    • /
    • pp.75-80
    • /
    • 2002
  • In these days of severe struggle for existence, the world has changed a great deal to global and digital oriented period. The enterprises try to introduce new management and production system to adapt such a change. But, if the only new technologies are applied to an enterprise without definite analysis about manufacturing, failure fellows as a logical consequence. Hence, enterprise must analyze manufacturing system definitely and needs new methodologies to mitigate risk. This study suggests that the new approach, which is systems approach for process improvement, is organized to systems analysis, systems diagnosis, and systems verification. Systems analysis analyzes manufacturing systems with object-oriented methodology-UML(Unified Modeling language) from a point of product, process, and resource view. Systems diagnosis identifies the constraints to optimize the system through scientific management or TOC(Theory of constraints). Systems verification shows the solution with virtual manufacturing technique applied to the core problem which emerged from systems diagnosis. This research shows the artifacts to improve the productivity with the above methodology applied to forming shop. UML provides the definite tool for analysis and re-usability to adapt itself to environment easily. The logical tree of TOC represents logical tool to optimize the forming shop. Discrete event simulator-QUEST suggests the tool for making a decision to verify the optimized forming shop.

Exploring On-line Consumption Tendency of Sports 4.0 Market Consumer: Focused on Sports Goods Consumption by Generation of Working Age Population (스포츠 4.0 시장 소비자의 온라인 소비성향 탐색: 생산 가능인구의 세대별 스포츠 용품 소비를 중심으로)

  • Jin-Ho Shin
    • Journal of the Korean Applied Science and Technology
    • /
    • v.40 no.1
    • /
    • pp.24-34
    • /
    • 2023
  • This study sought to explore the online consumption propensity of sports goods by generation of the productive population and to provide basic data to predict the future consumption market by segmenting online consumers in the sports 4.0 market. Therefore, this survey was conducted on those who consumed sports goods among the generation-specific groups (Generation Y and above, Z) of the productive population, and a total of 478 people's data were applied to the final analysis. Data processing was conducted with SPSS statistics (ver.21.0), frequency analysis, exploratory factor analysis, correlation analysis of re-examination reliability, reliability analysis, and decision tree analysis. According to the online consumption propensity of sports goods by generation of the productive population, there is a high probability of being classified as Generation Z group if the factors of leisure, joy, and environment are high. In addition, the classification accuracy of such a model was 69.7%.

A Study on Eco-Efficiency in Public Sector Using Decision Tree and DEA Analysis (의사결정나무와 자료포락 분석을 이용한 공공기관 유형별 환경효율성에 대한 연구)

  • Lim, Mi Sun;Kim, Jinhwa;Choi, Soon Jae
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.40 no.1
    • /
    • pp.91-116
    • /
    • 2015
  • This study aims to provide public sectors with eco-efficiency information. To implement the purposes of the study, environmental and economic variables of Eco-Efficiency were identified through decision tree model, then the relative Eco-Efficiencies of 243 public sectors were evaluated through input-oriented DEA (Data Envelopment Analysis) model. Specifically, the amount of public purchasing per a staff and the amount of energy use per a staff were considered as input factors. Sales per a staff was considered as output factor. The result shows that most of the public sectors (94.2%) were evaluated as "inefficient" taking into consideration of average value, 0.501 from market-based public corporations, 0.288 from local public corporations, 0.28 from quasi-market-based public corporations, 0.269 from fund-management-based quasi-governmental institutions, 0.09 from non-classified public institutions, and 0.078 from commissioned-service-based quasi-governmental institutions. Furthermore, it is possible to establish a plan for internal Eco-Efficiency improvement based on information of the reference set. In order to improve the Eco-Efficiency in the public sectors in the long term, environmental impacts of the overall public sectors' operations (e.g., energy saving, water saving, waste reduction, and purchasing of green products) needs to be properly proposed in consideration of BSC (Balanced Scorecard) indicators of public sectors.

Soil Moisture Estimation Using CART Algorithm and Ancillary Data (CART기법과 보조자료를 이용한 토양수분 추정)

  • Kim, Gwang-Seob;Park, Han-Gyun
    • Journal of Korea Water Resources Association
    • /
    • v.43 no.7
    • /
    • pp.597-608
    • /
    • 2010
  • In this study, a method for soil moisture estimation was proposed to obtain the nationwide soil moisture distribution map using on-site soil moisture observations, rainfall, surface temperature, NDVI, land cover, effective soil depth, and CART (Classification And Regression Tree) algorithm. The method was applied to the Yong-dam dam basin since the soil moisture data (4 sites) of the basin were reliable. Soil moisture observations of 3 sites (Bu-gui, San-jeon, Cheon-cheon2) were used for training the algorithm and 1 site (Gye-buk2) was used for the algorithm validation. The correlation coefficient between the observed and estimated data of soil moisture in the validation sites is about 0.737. Results show that even though there are limitations of the lack of reliable soil moisture observation for various land use, soil type, and topographic conditions, the soil moisture estimation method using ancillary data and CART algorithm can be a reasonable approach since the algorithm provided a fairly good estimation of soil moisture distribution for the study area.