• Title/Summary/Keyword: Frequent Item

Search Result 117, Processing Time 0.023 seconds

Discovering Association Rules using Item Clustering on Frequent Pattern Network (빈발 패턴 네트워크에서 아이템 클러스터링을 통한 연관규칙 발견)

  • Oh, Kyeong-Jin;Jung, Jin-Guk;Ha, In-Ay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.14 no.1
    • /
    • pp.1-17
    • /
    • 2008
  • Data mining is defined as the process of discovering meaningful and useful pattern in large volumes of data. In particular, finding associations rules between items in a database of customer transactions has become an important thing. Some data structures and algorithms had been proposed for storing meaningful information compressed from an original database to find frequent itemsets since Apriori algorithm. Though existing method find all association rules, we must have a lot of process to analyze association rules because there are too many rules. In this paper, we propose a new data structure, called a Frequent Pattern Network (FPN), which represents items as vertices and 2-itemsets as edges of the network. In order to utilize FPN, We constitute FPN using item's frequency. And then we use a clustering method to group the vertices on the network into clusters so that the intracluster similarity is maximized and the intercluster similarity is minimized. We generate association rules based on clusters. Our experiments showed accuracy of clustering items on the network using confidence, correlation and edge weight similarity methods. And We generated association rules using clusters and compare traditional and our method. From the results, the confidence similarity had a strong influence than others on the frequent pattern network. And FPN had a flexibility to minimum support value.

  • PDF

An Efficient Algorithm for Updating Discovered Association Rules in Data Mining (데이터 마이닝에서 기존의 연관규칙을 갱신하는 효율적인 앨고리듬)

  • 김동필;지영근;황종원;강맹규
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.21 no.45
    • /
    • pp.121-133
    • /
    • 1998
  • This study suggests an efficient algorithm for updating discovered association rules in large database, because a database may allow frequent or occasional updates, and such updates may not only invalidate some existing strong association rules, but also turn some weak rules into strong ones. FUP and DMI update efficiently strong association rules in the whole updated database reusing the information of the old large item-sets. Moreover, these algorithms use a pruning technique for reducing the database size in the update process. This study updates strong association rules efficiently in the whole updated database reusing the information of the old large item-sets. An updating algorithm that is suggested in this study generates the whole candidate item-sets at once in an incremental database in view of the fact that it is difficult to find the new set of large item-sets in the whole updated database after an incremental database is added to the original database. This method of generating candidate item-sets is different from that of FUP and DMI. After generating the whole candidate item-sets, if each item-set in the whole candidate item-sets is large at an incremental database, the original database is scanned and the support of each item-set in the whole candidate item-sets is updated. So, the whole large item-sets in the whole updated database is found out. An updating algorithm that is suggested in this study does not use a pruning technique for reducing the database size in the update process. As a result, an updating algoritm that is suggested updates fast and efficiently discovered large item-sets.

  • PDF

The Design Characteristics of Modern Children's Clothes - Focus on Children's Clothes for Girls From The 2006S/S-2010S/S Collections - (현대 아동복의 디자인 특성 분석 - 2006S/S~2010S/S 컬렉션의 여자 아동복를 중심으로 -)

  • Kong, Mi-Ran
    • Fashion & Textile Research Journal
    • /
    • v.14 no.3
    • /
    • pp.347-362
    • /
    • 2012
  • The analysis results of the characteristics of children's clothes for school girls from the 2006S/S-2010S/S collections reveal that the most frequent type of item composition was 2'PS & 3'PS. T-shirts were the popular kind of item for upper garments and pants for lower garments. The major silhouettes were H and A line. The predominant neckline was U, which was followed by camisole and V. The major collar styles were stand-up and flat. Chromatic colors were used more than achromatic ones and after white the most popular colors were blue, pink, red, black, navy, and green. The dominant color scheme was a two-color or three-color scheme. The percentage of solid and patterned material garments was the highest and the most popular pattern was flowers. The most popular style of garment had no decorative detail that was followed by one-item, two-item, and three-item decoration. The major kinds of decorative detail were frills, pleats, and prints.

A Study on Improvement of the Quality Management for Fire Doors (방화문의 현장품질관리 개선방안에 관한 연구)

  • Choi, Dong-ho
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2019.05a
    • /
    • pp.93-94
    • /
    • 2019
  • When the fire door is recently installed in the field, there are frequent cases where the fire door is manufactured with fire door having low quality or different structure compared to the performance that the fire door producer has confirmed in the performance test or the construction specification. In order to improve the on-site quality management of the fire door, we comprehensively classify the quality management items of the fire door according to the management subject and the step by stage and set the field quality management procedure, the field quality management inspection item, regulations and standardized checklists were presented.

  • PDF

Border-based HSFI Algorithm for Hiding Sensitive Frequent Itemsets (민감한 빈발항목집합을 숨기기 위한 경계기반 HSFI 알고리즘)

  • Lee, Dan-Young;An, Hyoung-Keun;Koh, Jae-Jin
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.10
    • /
    • pp.1323-1334
    • /
    • 2011
  • This paper suggests the border based HSFI algorithm to hide sensitive frequent itemsets. Node formation of FP-Tree which is different from the previous one uses the border to minimize the impacts of nonsensitive frequent itemsets in hiding process, including the organization of sensitive and border information, and all transaction as well. As a result of applying HSFI algorithms, it is possible to be the example transaction database, by significantly reducing the lost items, it turns out that HSFI algorithm is more effective than the existing algorithm for maintaining the quality of more improved database.

A New Method for Efficiently Generating of Frequent Items by IRG in Data Mining (데이터 마이닝에서 IRG에 의한 효율적인 빈발항목 생성방법)

  • 허용도;이광형
    • Journal of Korea Multimedia Society
    • /
    • v.5 no.1
    • /
    • pp.120-127
    • /
    • 2002
  • The common problems found in the data mining methods current in use have following problems. First: It is ineffective in searching for frequent items due to changing of minimal support values. Second: It is not adaptable to occurring of unuseful relation rules. Third: It is very difficult to re-use preceding results while adding new transactions. In this paper, we introduce a new method named as SPM-IRG(Selective Patters Mining using item Relation Graph), that is designed to solve above listed problems. SPM-IRG method creates a frequent items using minimal support values obtained by investigating direct or indirect relation of all items in transaction. Moreover, the new method can minimize inefficiency of existing method by constructing frequent items using only the items that we are interested.

  • PDF

I-Tree: A Frequent Patterns Mining Approach without Candidate Generation or Support Constraint

  • Tanbeer, Syed Khairuzzaman;Sarkar, Jehad;Jeong, Byeong-Soo;Lee, Young-Koo;Lee, Sung-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.31-33
    • /
    • 2007
  • Devising an efficient one-pass frequent pattern mining algorithm has been an issue in data mining research in recent past. Pattern growth algorithms like FP-Growth which are found more efficient than candidate generation and test algorithms still require two database scans. Moreover, FP-growth approach requires rebuilding the base-tree while mining with different support counts. In this paper we propose an item-based tree, called I-Tree that not only efficiently mines frequent patterns with single database scan but also provides multiple mining scopes with multiple support thresholds. The 'build-once-mine-many' property of I-Tree allows it to construct the tree only once and perform mining operation several times with the variation of support count values.

  • PDF

An Extended Frequent Pattern Tree for Hiding Sensitive Frequent Itemsets (민감한 빈발 항목집합 숨기기 위한 확장 빈발 패턴 트리)

  • Lee, Dan-Young;An, Hyoung-Geun;Koh, Jae-Jin
    • The KIPS Transactions:PartD
    • /
    • v.18D no.3
    • /
    • pp.169-178
    • /
    • 2011
  • Recently, data sharing between enterprises or organizations is required matter for task cooperation. In this process, when the enterprise opens its database to the affiliates, it can be occurred to problem leaked sensitive information. To resolve this problem it is needed to hide sensitive information from the database. Previous research hiding sensitive information applied different heuristic algorithms to maintain quality of the database. But there have been few studies analyzing the effects on the items modified during the hiding process and trying to minimize the hided items. This paper suggests eFP-Tree(Extended Frequent Pattern Tree) based FP-Tree(Frequent Pattern Tree) to hide sensitive frequent itemsets. Node formation of eFP-Tree uses border to minimize impacts of non sensitive frequent itemsets in hiding process, by organizing all transaction, sensitive and border information differently to before. As a result to apply eFP-Tree to the example transaction database, the lost items were less than 10%, proving it is more effective than the existing algorithm and maintain the quality of database to the optimal.

A Study on Fitness of Middle School Girls Uniform Size in Consideration of Growth -Focusing on Jacket and Skirt- (여중생 성장을 고려한 최적 교복치수 선정 -자켓과 스커트를 중심으로-)

  • 김덕하;김인숙
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.25 no.2
    • /
    • pp.315-326
    • /
    • 2001
  • This study is to suggest data for setting school uniform size with the size satisfaction degree through finding out the physical body change and uniform size problems following an increase in age during middle school girl days. For this purpose, by analyzing the physical body size data of 13~15 age of girls among $\ulcorner$human body size data of the youth for product design$\lrcorner$publicized under sponsorship of National Technology Quality Institute in 1999 the physical body size change by part following an increase in age during middle school girl days was found out and the most frequent physical body size by grade was suggested. Questionnaire about uniform production status and product size at the object of uniform makers were measured directly, a school uniform wearing status and size satisfaction degree by part were at the abject of middle school girls were examined by means of questionnaire and unsatisfactory factors in uniform size were found out. Based on collected data the most optimum product size in each part by item were suggested. The method of suggesting the most optimum size suitability by item was decided based on the result of survey into corresponding title and product size by maker and that of survey into the size satisfaction degree of middle school girls by maker, and the product size of maker showing the highest size satisfaction degree was selected as the most optimum product size.

  • PDF

Efficient Dynamic Weighted Frequent Pattern Mining by using a Prefix-Tree (Prefix-트리를 이용한 동적 가중치 빈발 패턴 탐색 기법)

  • Jeong, Byeong-Soo;Farhan, Ahmed
    • The KIPS Transactions:PartD
    • /
    • v.17D no.4
    • /
    • pp.253-258
    • /
    • 2010
  • Traditional frequent pattern mining considers equal profit/weight value of every item. Weighted Frequent Pattern (WFP) mining becomes an important research issue in data mining and knowledge discovery by considering different weights for different items. Existing algorithms in this area are based on fixed weight. But in our real world scenarios the price/weight/importance of a pattern may vary frequently due to some unavoidable situations. Tracking these dynamic changes is very necessary in different application area such as retail market basket data analysis and web click stream management. In this paper, we propose a novel concept of dynamic weight and an algorithm DWFPM (dynamic weighted frequent pattern mining). Our algorithm can handle the situation where price/weight of a pattern may vary dynamically. It scans the database exactly once and also eligible for real time data processing. To our knowledge, this is the first research work to mine weighted frequent patterns using dynamic weights. Extensive performance analyses show that our algorithm is very efficient and scalable for WFP mining using dynamic weights.