• 제목/요약/키워드: CHAID Technique

검색결과 16건 처리시간 0.018초

CHAID 알고리즘을 이용한 산업재해 특성분석 (A Feature Analysis of Industrial Accidents Using CHAID Algorithm)

  • 임영문;황영섭
    • 대한안전경영과학회지
    • /
    • 제7권5호
    • /
    • pp.59-67
    • /
    • 2005
  • The main objective of the statistical analysis about industrial accidents is to find out what is the dangerous factor in its own industrial field so that it is possible to prevent or decrease the number of the possible accidents by educating those who work in the fields for safety tools. However, so far, there is no technique of quantitative evaluation on danger. Almost all previous researches as to industrial accidents have only relied on the frequency analysis such as the analysis of the constituent ratio on accidents. As an application of data mining technique, this paper presents analysis on the efficiency of the CHAID algorithm to classify types of industrial accidents data and thereby identifies potential weak points in accident risk grouping.

CHAID 技法에 의한 都市機能의 試論的 硏究 (An introductory study on the urban functions using CHAID technique)

  • 양순정
    • 대한지리학회지
    • /
    • 제29권3호
    • /
    • pp.360-368
    • /
    • 1994
  • 地理學에서는 地域의 特性을 규명하고자 수많은 計量的 分析手法을 사용하여 왔다. 본 고에서는 일종의 判別分析技法으로 최근에 도입된 CHAID技法을 사용하여 都市와 都市 機能에 관한 통계처리를 시도하였다. 2종류의 자료를 가지고 두 차례 처리를 실시하였는데, 하나는 인구 25만명 이상의 도시 20개를 예측변수로 하고, 行政, 市場, 金融機能 그리고 生 産機能을 반응변수로 하여 도시의 기능을 분류해 내었다. 두번째 처리에서는 앞서 언급한 행정, 시장, 금융, 생산기능 이외에 交通, 敎育, 의료, 文化, 그리고 運送機能의 9가지를 예측 변수로 선정하고, 수도권, 부산권, 대구권, 광주권, 충청권의 5개 권역을 반응변수로 하여 각 권역에서 탁월한 기능을 판별, 분류해 내었다. 이상에서 CHAID기법은 큰 양의 범주형 자료 를 처리할 수 있고, 樹形圖로 결과를 산출하여 해석이 용이하므로 地域을 分類하거나 특성 을 判別하는데 유용한 또 하나외 새로운 분석틀로 여겨진다.

  • PDF

데이터마이닝 기법(CHAID)을 이용한 효과적인 데이터베이스 마케팅에 관한 연구 (A Study on the Effective Database Marketing using Data Mining Technique(CHAID))

  • 김신곤
    • 정보기술과데이타베이스저널
    • /
    • 제6권1호
    • /
    • pp.89-101
    • /
    • 1999
  • Increasing number of companies recognize that the understanding of customers and their markets is indispensable for their survival and business success. The companies are rapidly increasing the amount of investments to develop customer databases which is the basis for the database marketing activities. Database marketing is closely related to data mining. Data mining is the non-trivial extraction of implicit, previously unknown and potentially useful knowledge or patterns from large data. Data mining applied to database marketing can make a great contribution to reinforce the company's competitiveness and sustainable competitive advantages. This paper develops the classification model to select the most responsible customers from the customer databases for telemarketing system and evaluates the performance of the developed model using LIFT measure. The model employs the decision tree algorithm, i.e., CHAID which is one of the well-known data mining techniques. This paper also represents the effective database marketing strategy by applying the data mining technique to a credit card company's telemarketing system.

  • PDF

데이터 마이닝을 활용한 병원 재방문도 영향요인 분석 : 외래환자의 만족도를 중심으로 (On the Determination of Outpatient's Revisit using Data Mining)

  • 이견직
    • 보건행정학회지
    • /
    • 제13권3호
    • /
    • pp.21-34
    • /
    • 2003
  • Patient revisit to used hospital is a key factor in determining a health care organization's competitive advantage and survival. This article examines the relationship between customer's satisfaction and his/her revisit associated with three different methods which are the Chi Square Automatic Interaction Detection(CHAID) for segmenting the outpatient group, logistic regression and neural networks for addressing the outpatient's revisit. The main findings indicate that the important factors on outpatient's revisit are physician's kindness, nurse's skill, overall level of satisfaction, hospital reputation, recommendation, level of diagnoses and outpatient's age. Among these ones, physician's kindness is the most important factor as guidelines for decision of their revisit. The decision maker of hospital should select the strategy containing the variable amount of the level of revisit and size of outpatient's group under the constraint on the hospital's time, budget and manpower given. Finally, this study shows that neural networks, as non-parametric technique, appear to more correctly predict revisit than does logistic regression as a parametric estimation technique.

웹 기반의 산업재해 예측시스템 개발에 관한 연구 (A Study on Development of A Web-Based Forecasting System of Industrial Accidents)

  • 임영문;황영섭;최요한
    • 대한안전경영과학회:학술대회논문집
    • /
    • 대한안전경영과학회 2007년도 추계학술대회
    • /
    • pp.269-274
    • /
    • 2007
  • Ultimate goal of this research is to develop a web-based forecasting system of industrial accidents. As an initial step for the purpose of this study, this paper provides a comparative analysis of 4 kinds of algorithms including CHAID, CART, C4.5, and QUEST. In addition, this paper presents the logical process for development of a forecasting system. Decision tree algorithm is utilized to predict results using objective and quantified data as a typical technique of data mining. The sample for this work was chosen from 10,536 data related to manufacturing industries during three years(2002$^{\sim}$2004) in korea.

  • PDF

SUPPORT Applications for Classification Trees

  • Lee, Sang-Bock;Park, Sun-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권3호
    • /
    • pp.565-574
    • /
    • 2004
  • Classification tree algorithms including as CART by Brieman et al.(1984) in some aspects, recursively partition the data space with the aim of making the distribution of the class variable as pure as within each partition and consist of several steps. SUPPORT(smoothed and unsmoothed piecewise-polynomial regression trees) method of Chaudhuri et al(1994), a weighted averaging technique is used to combine piecewise polynomial fits into a smooth one. We focus on applying SUPPORT to a binary class variable. Logistic model is considered in the caculation techniques and the results are shown good classification rates compared with other methods as CART, QUEST, and CHAID.

  • PDF

청각장애인의 취업결정요인 분석 연구 -데이터마이닝 기법(Exhaustive CHAID)의 적용 (Analyzing vocational outcomes of people with hearing impairments : A data mining approach)

  • 신현욱
    • 디지털융복합연구
    • /
    • 제13권11호
    • /
    • pp.449-459
    • /
    • 2015
  • 본 연구의 목적은 청각장애인의 취업결정요인을 데이터마이닝 기법을 적용하여 분석 제공함으로서, 장애인의 취업 성공률을 높임과 동시에 직업재활 개입의 효율성을 극대화할 수 있는 방안을 제시하는데 있다. 자료 분석을 위해 2013년 장애인고용패널조사의 제6차년도 자료를 이용하여, 전체 패널 데이터 중 청각장애인이면서 전체연령 20세 이상 65세 미만의 422명을 의사결정나무 기법의 하나인 Exhaustive CHAID 알고리즘을 적용하여 분석하였다. 본 연구를 통해서 얻어진 주요한 사실의 하나는 국민기초생활수급여부, 일상생활 도움필요 여부, 그리고 자격증 고용서비스 요인간의 상호작용(interaction)에 관한 패턴 분석이 청각장애인의 취업 예측에 주요한 역할을 할 수 있다는 것으로, 향후 직업재활 개입의 효과성을 높이기 위해 효과적인 취업결정요인, 즉 높은 학력 수준, 자격증 보유, 높은 일상생활 독립성을 가지고 있는 장애인을 적극적으로 발굴하여 집중적인 재활 서비스를 제공할 필요가 있을 것으로 사료된다.

제조업에서의 산업재해 예방을 위한 전문가 시스템 개발 (Development of an Expert System for Prevention of Industrial Accidents in Manufacturing Industries)

  • 임영문;최요한
    • 대한안전경영과학회지
    • /
    • 제8권1호
    • /
    • pp.53-64
    • /
    • 2006
  • Many researches and analyses have been focused on industrial accidents in order to predict and reduce them. As a similar endeavor, this paper is to develop an expert system for prevention of industrial accidents. Although various previous studies have been performed to prevent industrial accidents, these studies only provide managerial and educational policies using frequency analysis and comparative analysis based on data from past industrial accidents. As an initial step for the purpose of this study, this paper provides a comparative analysis of 4 kinds of algorithms including CHAID, CART, C4.5, and QUEST. Decision tree algorithm is utilized to predict results using objective and quantified data as a typical technique of data mining. Enterprise Miner of SAS and Answer Tree of SPSS will be used to evaluate the validity of the results of the four algorithms. The sample for this work was chosen from 10,536 data related to manufacturing industries during three years$(2002\sim2004)$ in korea. The initial sample includes a range of different businesses including the construction and manufacturing industries, which are typically vulnerable to industrial accidents.

데이터마이닝 기법을 이용한 전공이탈자 분류를 위한 성능평가 (Evaluation on Performance for Classification of Students Leaving Their Majors Using Data Mining Technique)

  • 임영문;유창현
    • 대한안전경영과학회:학술대회논문집
    • /
    • 대한안전경영과학회 2006년도 추계공동학술대회
    • /
    • pp.293-297
    • /
    • 2006
  • Recently most universities are suffering from students leaving their majors. In order to make a countermeasure for reducing major separation rate, many universities are trying to find a proper solution. As a similar endeavor, this paper uses decision tree algorithm which is one of the data mining techniques which conduct grouping or prediction into several sub-groups from interested groups. This technique can analyze a feature of type on students leaving their majors. The dataset consists of 5,115 features through data selection from total data of 13,346 collected from a university in Kangwon-Do during seven years(2000.3.1 $\sim$ 2006.6.30). The main objective of this study is to evaluate performance of algorithms including CHAID, CART and C4.5 for classification of students leaving their majors with ROC Chart, Lift Chart and Gains Chart. Also, this study provides values about accuracy, sensitivity, specificity using classification table. According to the analysis result, CART showed the best performance for classification of students leaving their majors.

  • PDF

데이터마이닝 기법을 이용한 전공이탈자 예측모형 (Predicting Model of Students Leaving Their Majors Using Data Mining Technique)

  • 임영문;유창현
    • 대한안전경영과학회지
    • /
    • 제8권5호
    • /
    • pp.17-25
    • /
    • 2006
  • Nowadays most colleges are confronting with a serious problem because many students have left their majors at the colleges. In order to make a countermeasure for reducing major separation rate, many universities are trying to find a proper solution. As a similar endeavor, the objective of this paper Is to find a predicting model of students leaving their majors. The sample for this study was chosen from a university in Kangwon-Do during seven years(2000.3.1 $\sim$ 2006. 6.30). In this study, the ratio of training sample versus testing sample among partition data was controlled as 50% : 50% for a validation test of data division. Also, this study provides values about accuracy, sensitivity, specificity about three kinds of algorithms including CHAID, CART and C4.5. In addition, ROC chart and gains chart were used for classification of students leaving their majors. The analysis results were very informative since those enable us to know the most important factors such as semester taking a course, grade on cultural subjects, scholarship, grade on majors, and total completion of courses which can affect students leaving their majors.