A Frequent Item Sets Computing Algorithm Based on MapReduce

Article Preview

Abstract:

Traditional method mining for association rules between items in large and grand data sets is inefficient. In this paper we present an efficient method called BPMRA which is based on mapreduce and partition. We have compared BPMRA algorithm based multi-node and partition based single node method and performed some experiments. It turns out that BPMRA possesses high parallelism good stability and scalability, especially suitable for mining for association rules in large and grand data sets.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1701-1704

Citation:

Online since:

August 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] AGRAWAL R, IMIELINSKI T, SWAMI A. Mining Association Rules Between Sets of Items in Large Databases[C]/In Proceedings of 1993 ACM SIGMOD International Conference on Management of Data. Washington, DC: ACM Press, 1993: 207-216.

DOI: 10.1145/170036.170072

Google Scholar

[2] JEFFREY DEAN, SANJAY GHEMAWAT. MapReduce: Simplified Data Processing on Large Clusters [C]/In Proceedings of the 6th Symposium on Operating System Design and Implementation. San Francisco: Google Inc., 2004: 10-23.

Google Scholar

[3] SARVASERE A, OMIECINSKY E, NAVATHE S. An Efficient Algorithm for Mining Association Rules in Large Databases[C]/In Proceedings of the 21st International Conference on Very Large Databases. San Francisco: [s. n. ], 1995: 432-444.

Google Scholar

[4] Hadoop project [EB/OL]. (2009). http: /hadoop. apache. org.

Google Scholar

[5] TOM WHITE. Hadoop: The Definitive Guide [M]. California:O'Reilly Media, Inc. ,(2009).

Google Scholar

[6] JIAWEI HAN, MICHELINE KAMBER. Data Mining Concepts and techniques, Second Edition [M], (2008).

Google Scholar