Fast Distributed Algorithm of Mining Global Frequent Itemsets

Article Preview

Abstract:

Most distributed algorithms of mining global frequent itemsets worked on net structure network and adopted Apriori-like algorithm. Whereas there were some problems in these algorithms: a lot of candidate itemsets and heavy communication traffic. Aiming at these problems, this paper proposed a fast distributed algorithm of mining global frequent itemsets, namely, FDMGFI algorithm, which set centre node. FDMGFI algorithm made computer nodes compute local frequent itemsets independently with FP-growth algorithm, then the centre node exchanged data with other computer nodes and combined, finally, global frequent itemsets were gained. FDMGFI algorithm required far less communication traffic by the searching strategies of top-down and bottom-up. Theoretical analysis and experimental results suggest that FDMGFI algorithm is fast and effective.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 219-220)

Pages:

191-194

Citation:

Online since:

March 2011

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Chen ZB, Han H, Wang JX. Data Warehouse and Data Mining[M]. Beijing: Tsinghua University Press, (2009).

Google Scholar

[2] Agrawal, R. Shafer, J.C. Parallel mining of association rules[C]. IEEE Transaction on Knowledge and Data Engineering, 1996, 962-969.

DOI: 10.1109/69.553164

Google Scholar

[3] Cheung, D.W., Han, J.W., Ng, W.T., Tu, Y.J.A fast distributed algorithm for mining association rules[C]. In: Proceedings of IEEE 4th International Conference on Management of Data, Miami Beach, Florida,1996, 31-34.

Google Scholar

[4] Han, J.W., Pei, J., Yin, Y. Mining frequent patterns without Candidate Generation[C]. In: Proceedings of the 2000 ACM SIGMOD international conference on Management of data, Dallas, Texas, United States,2000,1-12.

DOI: 10.1145/342009.335372

Google Scholar

[5] Bo H, Yue W, Yang W and Yuan C. Fast Algorithm for Mining Global Frequent Itemsets Based on Distributed Database [C]. Rough Sets and Knowledge Technology, Chongqing, 2006, 415-420.

DOI: 10.1007/11795131_60

Google Scholar