A Web Data Mining Algorithm Based on Weighted Association Rules

Xiao Lv; Yong Jie Li; Xu Lu

doi:10.4028/www.scientific.net/KEM.467-469.1386

Paper Titles

A Human-Attention Driven Video Capture Method for Wearable Visual Computing
p.1361

Prediction of the Plastic Fracture Mechanical Properties on Metallurgical Composite Bimetallic Tube with Axial Crack
p.1367

The Numerical Simulation on the Temperature Field of Laser Cladding
p.1372

An Algorithm of Complete Coverage Path Planning for Autonomous Underwater Vehicles
p.1377

A Web Data Mining Algorithm Based on Weighted Association Rules
p.1386

Virus Evolution Based Gene Expression Programming for Classification Rules Mining
p.1392

Study on Forecasting of Gold Price Based on Varying-Coefficient Regression Model
p.1398

Effects of Chlorine on Aluminum Refining: A Review and an Ab Initio Molecular Dynamics Study
p.1404

A New Method of Wireless Health Monitoring for SMA Composite Structure Based on Micro-Capacitance Measurement
p.1410

HomeKey Engineering MaterialsKey Engineering Materials Vols. 467-469A Web Data Mining Algorithm Based on Weighted...

A Web Data Mining Algorithm Based on Weighted Association Rules

Abstract:

Association rules mining is attracting much attention in research community due to its broad applications. Existing web data mining methods suffer the problems that 1) the large number of candidate itemsets, which are hard to be pruned, should be pruned in advance. 2) the time of scanning the database, which are needed to scan transactional database repeatedly, should be reduced. In this paper, a new association rules mining model is introduced for overcoming above two problems. We develop an efficient algorithm-WARDM(Weighted Association Rules Data Mining) for mining the candidate itemsets. The algorithm discusses the generation of candidate-1 itemset, candidate-2 itemset and candidate-k itemset(k>2),which can avoid missing weighted frequent itemsets. And the transactional database are scanned only once and candidate itemsets are pruned twice, which can reduce the amount of candidate itemsets. Theoretical analysis and experimental results show the space and time complexity is relatively good, Meanwhile the algorithm decreases the number of candidate itemsets, enhances the execution efficiency.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Key Engineering Materials (Volumes 467-469)

Pages:

1386-1391

DOI:

https://doi.org/10.4028/www.scientific.net/KEM.467-469.1386

Citation:

Cite this paper

Online since:

February 2011

Authors:

Xiao Lv, Yong Jie Li, Xu Lu

Keywords:

New-Apriori, WARDM, Web Data Mining, Weighted Association Rules

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] Usama Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth. Knowledge discovery and data mining: towards a unifying framework. In proc. of the 2nd int'1 corf. on knowledge discovery and data mining, 2007: 505-509PR.W. Thomas, L. A. Dasilva, and A.B. MacKenzie. Cognitive Networks. Proc. of the First IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks (DySPAN 2005), 2005, pp.352-360.

DOI: 10.1109/dyspan.2005.1542652

Google Scholar

[2] Agrawal R, Shafer J C. Parallel mining of association rules. IEEE Transactions on Knowledge and Data Engineering, 2004, 8(6): 962-969P.

Google Scholar

[3] Agrawal R, Imiclinski T, Swami A. Mining Assoeiation Rules between Sets of Items in Large Databases. Proeeedings of the ACM SIGMOD conference, Washington DC, ACM Press NY, 1993: 207-216P.

DOI: 10.1145/170036.170072

Google Scholar

[4] Agrawal R, Srikant R. Fast Algorithm for Mining Association Rules. Proeeeding Intenational conference on Very Large Databases, Morgan Kaufmann Publishers, 1999: 487-499P.

Google Scholar

[5] Feng Tao, Fionn Murtagh, Mohsen Farid. Weighted Association Rule Mining using Weighted Support and Significance Framework. In Proc. 2003 ACM SIGKDD Int. Conf. on Knowledge discovery and data mining, 2003: 145-148P.

DOI: 10.1145/956750.956836

Google Scholar

[6] Raymond Kosala, Hendrik Blockeel. Web Mining Research: A Survey. In Proc. ACM SIGKDD, 2000(2): 1-15P.

Google Scholar

[7] M. S Chen, J. S Park, P. S Yu. Data mining for path traversal patterns in a web environment. In Proceedings of the 16th International Conference On Distributed Computer System, 2007: 385-392P.

DOI: 10.1109/icdcs.1996.507986

Google Scholar

[8] R. Agrawal, T. lmielinski, A. Swami. Mining Association Rules Between Sets of Items in Large Databases. Proc. 1993 ACM SIGMOD Int'1 Conf. Management of Data, 1993: 207-216P.

DOI: 10.1145/170036.170072

Google Scholar

[9] Park J S, Chen M S, Yu P S. Efficient parallel data mining for association rules. In Proceedings of the 4th International Conference on Information and Knowledge Management, 2002(11): 86-90P.

Google Scholar

[10] Agrawal R, Imielinski T, Swami A. Database Mining: a performance perspective. IEEE Transactions on Knowledge and Data Engineering, 1993, 5(6): 914-925P.

DOI: 10.1109/69.250074

Google Scholar

[11] Hannu Toivonen, Mika Klemettinen, Pirjo Ronkaine et al. Pruning and grouping discovered association rules. In M L net Workshop on Statistics, Machine Learning and Discovery in databases. Heraklion, Crete, Greece, 2007(9): 58-60P.

Google Scholar

[12] HongJun Lu, Rudy Setiono, Huan Liu. Neurorule: a connectionist approach to data mining. In Proc. of the 21st VLDB conf., Zurich, Swizerland, 2005: 56-58P.

Google Scholar