• Title/Summary/Keyword: datamining method

Search Result 37, Processing Time 0.028 seconds

An application of datamining approach to CQI using the discharge summary (퇴원요약 데이터베이스를 이용한 데이터마이닝 기법의 CQI 활동에의 황용 방안)

  • 선미옥;채영문;이해종;이선희;강성홍;호승희
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2000.11a
    • /
    • pp.289-299
    • /
    • 2000
  • This study provides an application of datamining approach to CQI(Continuous Quality Improvement) using the discharge summary. First, we found a process variation in hospital infection rate by SPC (Statistical Process Control) technique. Second, importance of factors influencing hospital infection was inferred through the decision tree analysis which is a classification method in data-mining approach. The most important factor was surgery followed by comorbidity and length of operation. Comorbidity was further divided into age and principal diagnosis and the length of operation was further divided into age and chief complaint. 24 rules of hospital infection were generated by the decision tree analysis. Of these, 9 rules with predictive prover greater than 50% were suggested as guidelines for hospital infection control. The optimum range of target group in hospital infection control were Identified through the information gain summary. Association rule, which is another kind of datamining method, was performed to analyze the relationship between principal diagnosis and comorbidity. The confidence score, which measures the decree of association, between urinary tract infection and causal bacillus was the highest, followed by the score between postoperative wound disruption find postoperative wound infection. This study demonstrated how datamining approach could be used to provide information to support prospective surveillance of hospital infection. The datamining technique can also be applied to various areas fur CQI using other hospital databases.

  • PDF

Two-Step Filtering Datamining Method Integrating Case-Based Reasoning and Rule Induction

  • Park, Yoon-Joo;Chol, En-Mi;Park, Soo-Hyun
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2007.05a
    • /
    • pp.329-337
    • /
    • 2007
  • Case-based reasoning (CBR) methods are applied to various target problems on the supposition that previous cases are sufficiently similar to current target problems, and the results of previous similar cases support the same result consistently. However, these assumptions are not applicable for some target cases. There are some target cases that have no sufficiently similar cases, or if they have, the results of these previous cases are inconsistent. That is, the appropriateness of CBR is different for each target case, even though they are problems in the same domain. Thus, applying CBR to whole datasets in a domain is not reasonable. This paper presents a new hybrid datamining technique called two-step filtering CBR and Rule Induction (TSFCR), which dynamically selects either CBR or RI for each target case, taking into consideration similarities and consistencies of previous cases. We apply this method to three medical diagnosis datasets and one credit analysis dataset in order to demonstrate that TSFCR outperforms the genuine CBR and RI.

  • PDF

A Trade Strategy in Stock Market using Market Basket Analysis (장바구니분석을 이용한 주식투자전략 수립 방안)

  • 주영진
    • Journal of Information Technology Applications and Management
    • /
    • v.9 no.4
    • /
    • pp.65-78
    • /
    • 2002
  • We propose a new application method of the datamining technique that might help building an efficient trade strategy in the stock market, where the analysis of the huge database is essential. The proposed method utilizes the association rules among the price changes of individual stock from the market basket analysis (a datamining technique typically used in the Marketing field) in building the strategy We also apply the proposed method to the daily stock prices in Korean stock market, from Jan. 2000 to Dec. 2001. The application results show that the proposed method gives an significantly higher yield rate than the actual stock chage rate.

  • PDF

Practical Utilization of Engineering Data based on Evolutionary Computation Method (진화연산에 의한 공학 데이터의 활용)

  • Lee Kyung-Ho;Yeon Yun-Seog;Yang Young-Soon
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 2005.04a
    • /
    • pp.317-324
    • /
    • 2005
  • Korean shipyards have accumulated a great amount of data. But they do not have appropriate tools to utilize the data in practical works. Engineering data contains experts' experience and know-how In its own. It is very useful to extract knowledge or information from the accumulated existing data by using datamining technique. This paper treats an evolutionary computation method based on genetic programming (GP), which can be one of the components to realize datamining.

  • PDF

An Effective Recruits' Assignment Method for Early Job Adaptation of Air-munition Maintenance Airmen Using Datamining Technique (데이터마이닝을 이용한 공군 무기정비병의 조기 숙달을 위한 배속방안 연구)

  • Kang, Kew-Young;Yoon, Bong-Kyoo
    • Journal of the military operations research society of Korea
    • /
    • v.37 no.1
    • /
    • pp.147-159
    • /
    • 2011
  • Recently, the military service period has been shortened continuously. Meanwhile, more skilled airmen are needed as the complexity of weapon systems increase. This phenomenon could lead to a disastrous result such as deteriorating the level of the readiness and the fighting power. We suggest a method to improve recruit's maintenance capability rapidly by assigning airmen to jobs appropriate to their characteristics using Datamining methods (K-menas and CART). We focus on the assigning method for air force's air-munition maintenance airmen since they are requested more skilled than other airmen. Grouping airmen with k-means method and devising classification rule with CART algorithm, we found that airmen's proficiency arrival period could be shortened by 1.79 months when they are assigned in the suggested way.

An Empirical Study on Telemarketing Business(L Insurance Case)

  • Kim, Yon-Hyong;Lee, Seok-Won
    • Journal of the Korean Data and Information Science Society
    • /
    • v.19 no.3
    • /
    • pp.877-891
    • /
    • 2008
  • The purpose in this datamining modeling is to maximize the number of L insurance' new customer selected from the S corp.'s customers through the telemarketing. We demonstrated the superiority of this method by comparing the existing marketing method and campaign result. The used software in this analysis is SAS 9.1 and so on.

  • PDF

Flood Forecasting Study using Neural Network Theory and Hydraulic Routing (신경망 이론과 수리학적 홍수추적에 의한 홍수예측에 관한 연구)

  • Jee, Hong Kee;Choo, Yeon Moon
    • Journal of Korea Water Resources Association
    • /
    • v.47 no.2
    • /
    • pp.207-221
    • /
    • 2014
  • Recently, due to global warming, climate change has affected short time concentrated local rain and unexpected heavy rain which is increasingly causing life and property damage. Therefore, this paper studies the characteristic of localized heavy rain and flash flood in Nakdong basin study area by applying Data Mining method to predict flood and constructing water level predicting model. For the verification neural network from Data Mining method and hydraulic flood routing was used for flood from July 1989 to September 1999 in Nakdong point and Iseon point was used to compare flood level change between observed water level and SAM (Slope Area Method). In this research, the study area was divided into three cases in which each point's flood discharge, water level was considered to construct the model for hydraulic flood routing and neural network based on artificial intelligence which can be made from simple input data used for comparison analysis and comparison evaluation according to actual water level and from the model.

Principal Components Logistic Regression based on Robust Estimation (로버스트추정에 바탕을 둔 주성분로지스틱회귀)

  • Kim, Bu-Yong;Kahng, Myung-Wook;Jang, Hea-Won
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.3
    • /
    • pp.531-539
    • /
    • 2009
  • Logistic regression is widely used as a datamining technique for the customer relationship management. The maximum likelihood estimator has highly inflated variance when multicollinearity exists among the regressors, and it is not robust against outliers. Thus we propose the robust principal components logistic regression to deal with both multicollinearity and outlier problem. A procedure is suggested for the selection of principal components, which is based on the condition index. When a condition index is larger than the cutoff value obtained from the model constructed on the basis of the conjoint analysis, the corresponding principal component is removed from the logistic model. In addition, we employ an algorithm for the robust estimation, which strives to dampen the effect of outliers by applying the appropriate weights and factors to the leverage points and vertical outliers identified by the V-mask type criterion. The Monte Carlo simulation results indicate that the proposed procedure yields higher rate of correct classification than the existing method.

Structuring of unstructured big data and visual interpretation (부산지역 교통관련 기사를 이용한 비정형 빅데이터의 정형화와 시각적 해석)

  • Lee, Kyeongjun;Noh, Yunhwan;Yoon, Sanggyeong;Cho, Youngseuk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1431-1438
    • /
    • 2014
  • We analyzed the articles from "Kukje Shinmun" and "Busan Ilbo", which are two local newpapers of Busan Metropolitan City. The articles cover from January 1, 2013 to December 31, 2013. Meaningful pattern inherent in 2889 articles of which the title includes "Busan" and "Traffic" and related data was analyzed. Textmining method, which is a part of datamining, was used for the social network analysis (SNA). HDFS and MapReduce (from Hadoop ecosystem), which is open-source framework based on JAVA, were used with Linux environment (Uubntu-12.04LTS) for the construction of unstructured data and the storage, process and the analysis of big data. We implemented new algorithm that shows better visualization compared with the default one from R package, by providing the color and thickness based on the weight from each node and line connecting the nodes.

Prediction of commitment and persistence in heterosexual involvements according to the styles of loving using a datamining technique (데이터마이닝을 활용한 사랑의 형태에 따른 연인관계 몰입수준 및 관계 지속여부 예측)

  • Park, Yoon-Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.69-85
    • /
    • 2016
  • Successful relationship with loving partners is one of the most important factors in life. In psychology, there have been some previous researches studying the factors influencing romantic relationships. However, most of these researches were performed based on statistical analysis; thus they have limitations in analyzing complex non-linear relationships or rules based reasoning. This research analyzes commitment and persistence in heterosexual involvement according to styles of loving using a datamining technique as well as statistical methods. In this research, we consider six different styles of loving - 'eros', 'ludus', 'stroge', 'pragma', 'mania' and 'agape' which influence romantic relationships between lovers, besides the factors suggested by the previous researches. These six types of love are defined by Lee (1977) as follows: 'eros' is romantic, passionate love; 'ludus' is a game-playing or uncommitted love; 'storge' is a slow developing, friendship-based love; 'pragma' is a pragmatic, practical, mutually beneficial relationship; 'mania' is an obsessive or possessive love and, lastly, 'agape' is a gentle, caring, giving type of love, brotherly love, not concerned with the self. In order to do this research, data from 105 heterosexual couples were collected. Using the data, a linear regression method was first performed to find out the important factors associated with a commitment to partners. The result shows that 'satisfaction', 'eros' and 'agape' are significant factors associated with the commitment level for both male and female. Interestingly, in male cases, 'agape' has a greater effect on commitment than 'eros'. On the other hand, in female cases, 'eros' is a more significant factor than 'agape' to commitment. In addition to that, 'investment' of the male is also crucial factor for male commitment. Next, decision tree analysis was performed to find out the characteristics of high commitment couples and low commitment couples. In order to build decision tree models in this experiment, 'decision tree' operator in the datamining tool, Rapid Miner was used. The experimental result shows that males having a high satisfaction level in relationship show a high commitment level. However, even though a male may not have a high satisfaction level, if he has made a lot of financial or mental investment in relationship, and his partner shows him a certain amount of 'agape', then he also shows a high commitment level to the female. In the case of female, a women having a high 'eros' and 'satisfaction' level shows a high commitment level. Otherwise, even though a female may not have a high satisfaction level, if her partner shows a certain amount of 'mania' then the female also shows a high commitment level. Finally, this research built a prediction model to establish whether the relationship will persist or break up using a decision tree. The result shows that the most important factor influencing to the break up is a 'narcissistic tendency' of the male. In addition to that, 'satisfaction', 'investment' and 'mania' of both male and female also affect a break up. Interestingly, while the 'mania' level of a male works positively to maintain the relationship, that of a female has a negative influence. The contribution of this research is adopting a new technique of analysis using a datamining method for psychology. In addition, the results of this research can provide useful advice to couples for building a harmonious relationship with each other. This research has several limitations. First, the experimental data was sampled based on oversampling technique to balance the size of each classes. Thus, it has a limitation of evaluating performances of the predictive models objectively. Second, the result data, whether the relationship persists of not, was collected relatively in short periods - 6 months after the initial data collection. Lastly, most of the respondents of the survey is in their 20's. In order to get more general results, we would like to extend this research to general populations.