• Title/Summary/Keyword: 2-stage decision tree analysis

Search Result 9, Processing Time 0.106 seconds

Two-Stage Decision Tree Analysis for Diagnosis of Personal Sasang Constitution Medicine Type (사상체질 판별을 위한 2단계 의사결정 나무 분석)

  • Jin, Hee-Jeong;Lee, Hae-Jung;Kim, Myoung-Geun;Kim, Hong-Gie;Kim, Jong-Yeol
    • Journal of Sasang Constitutional Medicine
    • /
    • v.22 no.3
    • /
    • pp.87-97
    • /
    • 2010
  • 1. Objectives: In SCM, a personal Sasang constitution must be determined accurately before any Sasang treatment. The purpose of this study is to develop an objective method for classification of Sasang constitution. 2. Methods: We collected samples from 5 centers where SCM is practiced, and applied two-stage decision tree analysis on these samples. We recruited samples from 5 centers. The collected data were from subjects whose response to herbal medicine was confirmed according to Sasang constitution. 3. Results: The two-stage decision tree model shows higher classification power than a simple decision tree model. This study also suggests that gender must be considered in the first stage to improve the accuracy of classification. 4. Conclusions: We identified important factors for classifying Sasang constitutions through two-stage decision tree analysis. The two-stage decision tree model shows higher classification power than a simple decision tree model.

CANCER CLASSIFICATION AND PREDICTION USING MULTIVARIATE ANALYSIS

  • Shon, Ho-Sun;Lee, Heon-Gyu;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.706-709
    • /
    • 2006
  • Cancer is one of the major causes of death; however, the survival rate can be increased if discovered at an early stage for timely treatment. According to the statistics of the World Health Organization of 2002, breast cancer was the most prevalent cancer for all cancers occurring in women worldwide, and it account for 16.8% of entire cancers inflicting Korean women today. In order to classify the type of breast cancer whether it is benign or malignant, this study was conducted with the use of the discriminant analysis and the decision tree of data mining with the breast cancer data disclosed on the web. The discriminant analysis is a statistical method to seek certain discriminant criteria and discriminant function to separate the population groups on the basis of observation values obtained from two or more population groups, and use the values obtained to allow the existing observation value to the population group thereto. The decision tree analyzes the record of data collected in the part to show it with the pattern existing in between them, namely, the combination of attribute for the characteristics of each class and make the classification model tree. Through this type of analysis, it may obtain the systematic information on the factors that cause the breast cancer in advance and prevent the risk of recurrence after the surgery.

  • PDF

Preliminary Study to Establish a Decision Support System in Sasang Constitutional Medicine with Clinical Data (사장체질 의사결정시스템 구축을 위한 체질 진단 자료를 이용한 예비연구)

  • Jin, Hee-Jeong;Moon, Jin-Seok;Go, Seong-Ho;Ku, Im-Hoi;Lee, Si-Woo;Lee, Do-Heon;Song, Mi-Young;Kim, Jong-Yeol
    • Korean Journal of Oriental Medicine
    • /
    • v.13 no.2 s.20
    • /
    • pp.75-81
    • /
    • 2007
  • The need for the study of the revealing Sasang constitution at scientific term is increasing as the application of this discipline to the patient produces more accurate result. To obtain scientific evidence of Sasang constitution, it is crucial to analyze accumulated clinical information and associate them to the biological indices that may classify Sasang constitution. Thus, the analysis of clinical information is the most important stepping stone to go toward to the stage of developing model and decision support system (DSS) for classifying Sasang constitution. This study is a preliminary analysis of 1,109 samples collected with 171 clinical indices. To find meaningful clinical indices for classifying Sasang constitutional medicine, we applied decision tree model for them. The skin of 66.5% within whole Taeeumin is thick and non feeble. In the case of 69.8% within whole Soyangin, the skin is non feeble and slippery. In the case of 64.4% within whole Soeumin. they have feeble skin. Therefore, the property of skin can be suggested to be more important than any other index for the classification of Sasang constitution.

  • PDF

Development of the Computer-Assisted HACCP System Program and Developing HACCP-Based Evaluation Tools of Sanitation for Institutional Foodservice Operations (단체급식의 HACCP 전산프로그램 및 위생관리 평가도구 개발)

  • 이정숙;홍희정;곽동경
    • Korean Journal of Community Nutrition
    • /
    • v.3 no.4
    • /
    • pp.655-667
    • /
    • 1998
  • The Computer-assisted Hazard Analysis and Critical Control Point(HACCP) program has been developed for a systematic implementation of HACCP principles in identifying, assessing and controlling hazards in institutional foodservics operations. The HACCP-based sanitation evaluation tool has been developed, based on the results of the computerized assisted HACCP program in 4 service sites of C contracted foodservice company, including 2 general hospitals with 650-beds, one office operation of 400 meals per day, and one factory foodservice of 1,000 meals per day. All database files and processing programs were created by using Unify Vision tool with Windows 95 of user environments. The results of this study can be summarized as follows : 1. This program consists of the pre-stage for HACCP study and the implementation stage of the HACCP system. 1) The pre-stage for HACCP study includes the selection of menu items, the development of the HACCP recipe, the construction of product flow diagrams, and printing the HACCP recipes and product flow diagrams. 2) The implementation of the HACCP system includes the identification of microbiological hazards, the determination of critical control points based on the decision tree base files. 3) The HACCP-based sanitation evaluation tool consisted of 3 dimensions of time-temperature relationship, personal hygiene, and equipment-facility sanitation. The Cronbach's alphas calculation indicated that the tool was reliable. The results showed that the focus groups rated the mean of importance in time-temperature relationship, personal hygiene, and equipment-facility sanitation as 4.57, 4.59 and 4.55 respectively. Based on the results, this HACCP-based sanitation evaluation tool was considered as an effective tool for assuring product quality. This program will assist foodservice managers to encourage a standardized approach in the HACCP study and to maintain a systematic approach for ensuring that the HACCP principles are applied correctly.

  • PDF

Design and Implementation of an Intelligent Medical Expert System for TMA(Tissue Mineral Analysis) (TMA 분석을 위한 지능적 의학 전문가 시스템의 설계 및 구현)

  • 조영임;한근식
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.2
    • /
    • pp.137-152
    • /
    • 2004
  • Assesment of 30 nutritional minerals and 8 toxic elements in hair are very important not only for determining adequacy, deficiencies and unbalance, but also for assessing their relative relationships in the body. A test has been developed that serves this purpose exceedingly well. This test is known as tissue mineral analysis(TMA). TMA is very popular method in hair mineral analysis for health care professionals in over 46 countries' medical center. However, there are some problems. First, they do not have database which is suitable for korean to do analyze. Second, as the TMA results from TEI-USA is composed of english documents and graphic files prohibited to open, its usability is very low. Third, some of them has low level database which is related to TMA, so hairs are sent to TEI-USA for analyzing and medical services. it bring about an severe outflow of dollars. Finally, TMA results are based on the database of american health and mineral standards, it is possibly mislead korean mineral standards. The purposes of this research is to develope the first Intelligent Medical Expert System(IMES) of TMA, in Korea, which makes clear the problems mentioned earlier IMES can analyze the tissue mineral data with multiple stage decision tree classifier. It is also constructed with multiple fuzzy rule base and hence analyze the complex data from Korean database by fuzzy inference methods. Pilot test of this systems are increased of business efficiency and business satisfaction 86% and 92% respectively.

ITS : Intelligent Tissue Mineral Analysis Medical Information System (ITS : 지능적 Tissue Mineral Analysis 의료 정보 시스템)

  • Cho, Young-Im
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.15 no.2
    • /
    • pp.257-263
    • /
    • 2005
  • There are some problems in TMA. There are no databases in Korea which can be independently and specially analyzed the TMA results. Even there are some medical databases, some of them are low level databases which are related to TMA, so they can not serve medical services to patients as well as doctors. Moreover, TMA results are based on the database of american health and mineral standards, it is possibly mislead oriental, especially korean, mineral standards. The purposes of this paper is to develope the first Intelligent TMA Information System(ITS) which makes clear the problems mentioned earlier ITS can analyze TMA data with multiple stage decision tree classifier. It is also constructed with multiple fuzzy rule base and hence analyze the complex data from Korean database by fuzzy inference methods.

Weighted Hot-Deck Imputation in Farm and Fishery Household Economy Surveys (농어가경제조사에서 가중핫덱 무응답 대체법의 활용)

  • Kim Kyu-Seong;Lee Kee-Jae;Kim Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.311-328
    • /
    • 2005
  • This paper deals with a treatment of nonresponse in farm and fishery household economy surveys in Korea. Since the samples in two surveys were selected by stratified multi-stage sampling and weighted sample means has been used to estimate the population means, we choose a weighted hot-deck imputation method as an appropriate method for two surveys. We investigate the procedure of the weighted hot-deck as well as an adjusted jackknife method for variance estimation. Through an empirical study we found that the method worked very well in both mean and variance estimation in two surveys. In addition, we presented a procedure of forming imputation class and formed four imputation classes for each survey and then compared them with analysis. As a result, we presented two most efficient imputation classes for two surveys.

Visualizing the Results of Opinion Mining from Social Media Contents: Case Study of a Noodle Company (소셜미디어 콘텐츠의 오피니언 마이닝결과 시각화: N라면 사례 분석 연구)

  • Kim, Yoosin;Kwon, Do Young;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.89-105
    • /
    • 2014
  • After emergence of Internet, social media with highly interactive Web 2.0 applications has provided very user friendly means for consumers and companies to communicate with each other. Users have routinely published contents involving their opinions and interests in social media such as blogs, forums, chatting rooms, and discussion boards, and the contents are released real-time in the Internet. For that reason, many researchers and marketers regard social media contents as the source of information for business analytics to develop business insights, and many studies have reported results on mining business intelligence from Social media content. In particular, opinion mining and sentiment analysis, as a technique to extract, classify, understand, and assess the opinions implicit in text contents, are frequently applied into social media content analysis because it emphasizes determining sentiment polarity and extracting authors' opinions. A number of frameworks, methods, techniques and tools have been presented by these researchers. However, we have found some weaknesses from their methods which are often technically complicated and are not sufficiently user-friendly for helping business decisions and planning. In this study, we attempted to formulate a more comprehensive and practical approach to conduct opinion mining with visual deliverables. First, we described the entire cycle of practical opinion mining using Social media content from the initial data gathering stage to the final presentation session. Our proposed approach to opinion mining consists of four phases: collecting, qualifying, analyzing, and visualizing. In the first phase, analysts have to choose target social media. Each target media requires different ways for analysts to gain access. There are open-API, searching tools, DB2DB interface, purchasing contents, and so son. Second phase is pre-processing to generate useful materials for meaningful analysis. If we do not remove garbage data, results of social media analysis will not provide meaningful and useful business insights. To clean social media data, natural language processing techniques should be applied. The next step is the opinion mining phase where the cleansed social media content set is to be analyzed. The qualified data set includes not only user-generated contents but also content identification information such as creation date, author name, user id, content id, hit counts, review or reply, favorite, etc. Depending on the purpose of the analysis, researchers or data analysts can select a suitable mining tool. Topic extraction and buzz analysis are usually related to market trends analysis, while sentiment analysis is utilized to conduct reputation analysis. There are also various applications, such as stock prediction, product recommendation, sales forecasting, and so on. The last phase is visualization and presentation of analysis results. The major focus and purpose of this phase are to explain results of analysis and help users to comprehend its meaning. Therefore, to the extent possible, deliverables from this phase should be made simple, clear and easy to understand, rather than complex and flashy. To illustrate our approach, we conducted a case study on a leading Korean instant noodle company. We targeted the leading company, NS Food, with 66.5% of market share; the firm has kept No. 1 position in the Korean "Ramen" business for several decades. We collected a total of 11,869 pieces of contents including blogs, forum contents and news articles. After collecting social media content data, we generated instant noodle business specific language resources for data manipulation and analysis using natural language processing. In addition, we tried to classify contents in more detail categories such as marketing features, environment, reputation, etc. In those phase, we used free ware software programs such as TM, KoNLP, ggplot2 and plyr packages in R project. As the result, we presented several useful visualization outputs like domain specific lexicons, volume and sentiment graphs, topic word cloud, heat maps, valence tree map, and other visualized images to provide vivid, full-colored examples using open library software packages of the R project. Business actors can quickly detect areas by a swift glance that are weak, strong, positive, negative, quiet or loud. Heat map is able to explain movement of sentiment or volume in categories and time matrix which shows density of color on time periods. Valence tree map, one of the most comprehensive and holistic visualization models, should be very helpful for analysts and decision makers to quickly understand the "big picture" business situation with a hierarchical structure since tree-map can present buzz volume and sentiment with a visualized result in a certain period. This case study offers real-world business insights from market sensing which would demonstrate to practical-minded business users how they can use these types of results for timely decision making in response to on-going changes in the market. We believe our approach can provide practical and reliable guide to opinion mining with visualized results that are immediately useful, not just in food industry but in other industries as well.

Verification Test of High-Stability SMEs Using Technology Appraisal Items (기술력 평가항목을 이용한 고안정성 중소기업 판별력 검증)

  • Jun-won Lee
    • Information Systems Review
    • /
    • v.20 no.4
    • /
    • pp.79-96
    • /
    • 2018
  • This study started by focusing on the internalization of the technology appraisal model into the credit rating model to increase the discriminative power of the credit rating model not only for SMEs but also for all companies, reflecting the items related to the financial stability of the enterprises among the technology appraisal items. Therefore, it is aimed to verify whether the technology appraisal model can be applied to identify high-stability SMEs in advance. We classified companies into industries (manufacturing vs. non-manufacturing) and the age of company (initial vs. non-initial), and defined as a high-stability company that has achieved an average debt ratio less than 1/2 of the group for three years. The C5.0 was applied to verify the discriminant power of the model. As a result of the analysis, there is a difference in importance according to the type of industry and the age of company at the sub-item level, but in the mid-item level the R&D capability was a key variable for discriminating high-stability SMEs. In the early stage of establishment, the funding capacity (diversification of funding methods, capital structure and capital cost which taking into account profitability) is an important variable in financial stability. However, we concluded that technology development infrastructure, which enables continuous performance as the age of company increase, becomes an important variable affecting financial stability. The classification accuracy of the model according to the age of company and industry is 71~91%, and it is confirmed that it is possible to identify high-stability SMEs by using technology appraisal items.