The Research of Generation Algorithm of Frequent Itemsets in High-Dimensional Data

Article Preview

Abstract:

In the mining of association rules, the generation of frequent itemsets is a key factor that influence the efficiency and performance of the algorithm. With the increase of data dimension, it is obvious that the traditional association rules mining algorithm can’t meet the demand of high dimensional data mining. On the basis of Apriori algorithm, we put forward Split Mtrix _Apriori algorithm in this paper. By generating the Boolean matrix of the database, Split Mtrix _Apriori algorithm decreased the times of scanning database when generating the frequent itemsets. With adopting grouping processing strategy in the Boolean matrix, the algorithm can still keep high efficiency in dealing with high-dimensional data.So Split Mtrix _Apriori improved the efficiency of association rule mining significantly.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

127-131

Citation:

Online since:

January 2015

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2015 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Lloyd, S. Wang, W. Shao. Least squares quantization in PCM [C]. IEEE Transactions onInformation Theory, Switzerland, 1982, 129 137.

DOI: 10.1109/tit.1982.1056489

Google Scholar

[2] Hamerly, G. and Elkan, C. Alternatives to the k-means algorithm that find better clusterings[C]. Proceedings of the eleventh international conference on Information and knowledge management (CIKM), 2002, 141 182.

DOI: 10.1145/584792.584890

Google Scholar

[3] Rajani Baburajan. The Rising Cloud Storage Market Opportunity Strengthens Vendors. http: /www. thectoforum. com/content/converged-infrastructure-0, Dec. 02, (2011).

Google Scholar

[4] PeterMell Timothy Grance. The NIST Definition of Cloud Computing. http: /csrc. nist. gov/publications/nistpubs/800-145/SP800-145. pdf, July. 24, (2011).

Google Scholar