An Improved Apriori Algorithm Based on Similarity

Article Preview

Abstract:

The disadvantages of Apriori algorithm are firstly discussed. Then, a new measure of cosine similarity is proposed and treated as an interest threshold. Furthermore, an improved Apriori algorithm called Sim-Apriori is proposed based on similarity. It cannot only accurately find the relations between different products in transactions databases and reduce the useless rules but also handle the negative rules. Experiments have been carried out to verify the effectiveness of the algorithm. A Sim-Apriori system is developed by ASP.NET and SQL SERVER in B/S mode. The result shows that the algorithm is effective at discovering the association rules in sales management related ERP.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 532-533)

Pages:

1825-1829

Citation:

Online since:

June 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] R. Srikant, R. Agrawal: Mining generalized association rules. In: Proc of 21th Int'l Conf on Very Large Data Bases. Zurich, Switzerland: Morgan Kaufmann, 1995. 407-419.

Google Scholar

[2] A. Savasere, E. Omiecinski, S. Navathe: Mining for strong negative associations in a large database of customer transactions. In: Proc of the 14th Int'l Conf on Data Engineering. Orlando, Florida, USA: IEEE Computer Society Press, 1998. 494-502.

DOI: 10.1109/icde.1998.655812

Google Scholar

[3] R. Agrawal, T. Imielinski, and A. Swami: Mining association rules between set of items in large databases. Proceedings of the 1993 ACMSIGMOD Conference on Management of Data, Washington, D. C., 1993. 207-216.

DOI: 10.1145/170036.170072

Google Scholar

[4] F.J. Shao, Z.Q. Yu, J.L. Wang, R.C. Sun, in: Principle and Algorithm of Data Mining. Science Press, 2009. 92-99.

Google Scholar

[5] J. A. Michael, S. L. Gordon, in: Data Mining Technology For Marketing, Sales, and Customer Relationship Management. Machinery Industry Press, 2006, 340-350.

Google Scholar