Parallel Data Mining with Distributed Frequent Pattern Trees

;;

Proceedings of the IEEK Conference (대한전자공학회:학술대회논문집)

2003.07c
/
Pages.2561-2564
/
2003

The Institute of Electronics and Information Engineers (대한전자공학회)

Parallel Data Mining with Distributed Frequent Pattern Trees

분산형 FP트리를 활용한 병렬 데이터 마이닝

조두산 (고려대학교 전기공학과) ;
김동승 (고려대학교 전기공학과)

Published : 2003.07.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Data mining is an effective method of the discovery of useful information such as rules and previously unknown patterns existing in large databases. The discovery of association rules is an important data mining problem. We have developed a new parallel mining called Distributed Frequent Pattern Tree (abbreviated by DFPT) algorithm on a distributed shared nothing parallel system to detect association rules. DFPT algorithm is devised for parallel execution of the FP-growth algorithm. It needs only two full disk data scanning of the database by eliminating the need for generating the candidate items. We have achieved good workload balancing throughout the mining process by distributing the work equally to all processors. We implemented the algorithm on a PC cluster system, and observed that the algorithm outperformed the Improved Count Distribution scheme.

Proceedings of the IEEK Conference (대한전자공학회:학술대회논문집)

Parallel Data Mining with Distributed Frequent Pattern Trees

분산형 FP트리를 활용한 병렬 데이터 마이닝

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)