• Title/Summary/Keyword: Association rule mining

Search Result 351, Processing Time 0.025 seconds

A study of association rule by considering the frequency (발생빈도를 고려한 연관성분석 연구)

  • Lim, Je-Soon;Lee, Kyeong-Jun;Cho, Young-Seuk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1061-1069
    • /
    • 2010
  • In data mining, association rule is a popular and well researched method for discovering interesting relations between variables. There are three measures for association rule, support, confidence and lift. But there are some problem in them. They don't consider the frequency of variable in case. So, we need the new association rule which consider the frequency.In this paper, we proposed the new association rule. We compared the proposed association rule with the original association rule from example data. As a result, we knew our function was better than the original function in terms of sensitivity.

Trend Analysis of News Articles Regarding Sungnyemun Gate using Text Mining (텍스트마이닝을 활용한 숭례문 관련 기사의 트렌드 분석)

  • Kim, Min-Jeong;Kim, Chul Joo
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.3
    • /
    • pp.474-485
    • /
    • 2017
  • Sungnyemun Gate, Korea's National Treasure No.1, was destroyed by fire on February 10, 2008 and has been re-opened to the public again as of May 4, 2013 after a reconstruction work. Sungnyemun Gate become a national issue and draw public attention to be a major topic on news or research. In this research, text mining and association rule mining techniques were used on keyword of newspaper articles related to Sungnyemun Gate as a cultural heritage from 2002 to 2016 to find major keywords and keyword association rule. Next, we analyzed some typical and specific keywords that appear frequently and partially depending on before and after the fire and newpaper companies. Through this research, the trends and keywords of newspapers articles related to Sungnyemun Gate could be understood, and this research can be used as fundamental data about Sungnyemun Gate to information producer and consumer.

Recommender System using Association Rule and Collaborative Filtering (연관 규칙과 협력적 여과 방식을 이용한 추천 시스템)

  • 이기현;고병진;조근식
    • Journal of Intelligence and Information Systems
    • /
    • v.8 no.2
    • /
    • pp.91-103
    • /
    • 2002
  • A collaborative filtering which supports personalized services of users has been common use in existing web sites for increasing the satisfaction of users. A collaborative filtering is demanded that items are estimated more than specified number. Besides, it tends to ignore information of other users as recommending them on the basis of information of partial users who have similar inclination. However, there are valuable hidden information into other users' one. In this paper, we use Association Rule, which is common wide use in Data Mining, with collaborative filtering for the purpose of discovering those information. In addition, this paper proved that Association Rule applied to Recommender System has a effects to recommend users by the relation between groups. In other words, Association Rule based on the history of all users is derived from. and the efficiency of Recommender System is improved by using Association Rule with collaborative filtering.

  • PDF

Weighted association rules considering item RFM scores (항목 알에프엠 점수를 고려한 가중 연관성 규칙)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1147-1154
    • /
    • 2010
  • One of the important goals in data mining is to discover and decide the relationships between different variables. Association rules are required for this technique and it find meaningful rules by quantifying the relationship between two items based on association measures such as support, confidence, and lift. In this paper, we presented the evaluation criteria of weighted association rule considering item RFM scores as importance of items. Original RFM technique has been used most widely applied method using customer information to find the most profitable customers. And then we compared general association rule technique with weighted association rule technique through the simulation data.

Public Satisfaction Analysis of Weather Forecast Service by Using Twitter (Twitter를 활용한 기상예보서비스에 대한 사용자들의 만족도 분석)

  • Lee, Ki-Kwang
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.41 no.2
    • /
    • pp.9-15
    • /
    • 2018
  • This study is intended to investigate that it is possible to analyze the public awareness and satisfaction of the weather forecast service provided by the Korea Meteorological Administration (KMA) through social media data as a way to overcome limitations of the questionnaire-based survey in the previous research. Sentiment analysis and association rule mining were used for Twitter data containing opinions about the weather forecast service. As a result of sentiment analysis, the frequency of negative opinions was very high, about 75%, relative to positive opinions because of the nature of public services. The detailed analysis shows that a large portion of users are dissatisfied with precipitation forecast and that it is needed to analyze the two kinds of error types of the precipitation forecast, namely, 'False alarm' and 'Miss' in more detail. Therefore, association rule mining was performed on negative tweets for each of these error types. As a result, it was found that a considerable number of complaints occurred when preventive actions were useless because the forecast predicting rain had a 'False alarm' error. In addition, this study found that people's dissatisfaction increased when they experienced inconveniences due to either unpredictable high winds and heavy rains in summer or severe cold in winter, which were missed by weather forecast. This study suggests that the analysis of social media data can provide detailed information about forecast users' opinion in almost real time, which is impossible through survey or interview.

Interrelationship Analysis between Causal Factors of Construction Defect Using Association Rule Mining

  • Lee, Sang-Deok;Han, Sang-Won;Hyun, Chang-Taek
    • International conference on construction engineering and project management
    • /
    • 2015.10a
    • /
    • pp.627-628
    • /
    • 2015
  • Construction defect which can causes economic damage such as schedule delay, cost overrun is a considerably important factor in construction industry. In general, a construction defect features a difficulty to find out causes precisely because it occurs when several interrelated causes combine. Yet, studies have tried to understand the interrelationships between factors are limited. In addition, despite of a tremendous amount of construction data, it's not still enough to analyze them, but tends to depend on experience or know-how of practitioners. Thus, it is necessary to identify underlying causes in influential factors by utilizing related data. This paper analyses Interrelationships between causal factors using Association Rule Mining to discover root causes of construction defects. Confidence and Lift that can be used for presenting the interrelationships of the causes were extracted from 1241 cases in 30 projects in Korea. It is expected that this paper allows the construction managers to discover key factors and make right decisions to reduce occurrence of construction defects. Furthermore, analysis of interrelationships can improve understanding of structural patterns of construction defects.

  • PDF

Design and implementation of data mining tool using PHP and WEKA (피에이치피와 웨카를 이용한 데이터마이닝 도구의 설계 및 구현)

  • You, Young-Jae;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.2
    • /
    • pp.425-433
    • /
    • 2009
  • Data mining is the method to find useful information for large amounts of data in database. It is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. We need a data mining tool to explore a lot of information. There are many data mining tools or solutions; E-Miner, Clementine, WEKA, and R. Almost of them are were focused on diversity and general purpose, and they are not useful for laymen. In this paper we design and implement a web-based data mining tool using PHP and WEKA. This system is easy to interpret results and so general users are able to handle. We implement Apriori algorithm of association rule, K-means algorithm of cluster analysis, and J48 algorithm of decision tree.

  • PDF

Association Rule Mining and Collaborative Filtering-Based Recommendation for Improving University Graduate Attributes

  • Sheta, Osama E.
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.6
    • /
    • pp.339-345
    • /
    • 2022
  • Outcome-based education (OBE) is a tried-and-true teaching technique based on a set of predetermined goals. Program Educational Objectives (PEOs), Program Outcomes (POs), and Course Outcomes (COs) are the components of OBE. At the end of each year, the Program Outcomes are evaluated, and faculty members can submit many recommended measures which dependent on the relationship between the program outcomes and its courses outcomes to improve the quality of program and hence the overall educational program. When a vast number of courses are considered, bad actions may be proposed, resulting in unwanted and incorrect decisions. In this paper, a recommender system, using collaborative filtering and association rules algorithms, is proposed for predicting the best relationship between the program outcomes and its courses in order to improve the attributes of the graduates. First, a parallel algorithm is used for Collaborative Filtering on Data Model, which is designed to increase the efficiency of processing big data. Then, a parallel similar learning outcomes discovery method based on matrix correlation is proposed by mining association rules. As a case study, the proposed recommender system is applied to the Computer Information Systems program, College of Computer Sciences and Information Technology, Al-Baha University, Saudi Arabia for helping Program Quality Administration improving the quality of program outcomes. The obtained results revealed that the suggested recommender system provides more actions for boosting Graduate Attributes quality.

Comparison of confidence measures useful for classification model building (분류 모형 구축에 유용한 신뢰도 측도 간의 비교)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.365-371
    • /
    • 2014
  • Association rule of the well-studied techniques in data mining is the exploratory data analysis for understanding the relevance among the items in a huge database. This method has been used to find the relationship between each set of items based on the interestingness measures such as support, confidence, lift, similarity measures, etc. By typical association rule technique, we generate association rule that satisfy minimum support and confidence values. Support and confidence are the most frequently used, but they have the drawback that they can not determine the direction of the association because they have always positive values. In this paper, we compared support, basic confidence, and three kinds of confidence measures useful for classification model building to overcome this problem. The result confirmed that the causal confirmed confidence was the best confidence in view of the association mining because it showed more precisely the direction of association.

Association rule thresholds of similarity measures considering negative co-occurrence frequencies (동시 비 발생 빈도를 고려한 유사성 측도의 연관성 규칙 평가 기준 활용 방안)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.6
    • /
    • pp.1113-1121
    • /
    • 2011
  • Recently, a variety of data mining techniques has been applied in various fields like healthcare, insurance, and internet shopping mall. Association rule mining is a popular and well researched method for discovering interesting relations among large set of data items. Association rule mining is the method to quantify the relationship between each set of items in very huge database based on the association thresholds. There are three primary quality measures for association rules; support and confidence and lift. In this paper we consider some similarity measures with negative co-occurrence frequencies which is widely used in cluster analysis or multi-dimensional analysis as association thresholds. The comparative studies with support, confidence and some similarity measures are shown by numerical example.