A Kind of Hierarchical K-Means Web Log Clustering Algorithm

Article Preview

Abstract:

Clustering techniques are often used in Web log mining to analyze user’s interest on the web pages. Based on the analysis of advantages and disadvantages of the application of classic clustering algorithm in Web log data mining, the paper brought out a kind of hierarchical K-means Web log clustering algorithm, which integrated K-means clustering algorithm and cohesion-based hierarchical clustering algorithm and overcame shortcoming of high time complexity of hierarchical clustering algorithm. The clustering effect of the algorithm is better than K-means clustering and fit for clustering process of large amount data. The result analysis of practical Web log data clustering also proves the validity of the algorithm.

You might also be interested in these eBooks

Info:

Periodical:

Key Engineering Materials (Volumes 439-440)

Pages:

481-485

Citation:

Online since:

June 2010

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2010 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Jaideep S., Robert C. and Mukund D.: Web Usage Mining- Discovery and Applications of Usage Patterns from Web Data, SIGKDD Explorations, vol. 1, 2000, pp.1-12.

DOI: 10.1145/846183.846188

Google Scholar

[2] Zhang E, Feng Qin-hong, Xuan Hui-yu and Tian Zeng-rui: Web Access Pattern Data-ming, Computer Applications Research, vol. 3, 2001, pp.80-83.

Google Scholar

[3] Daniel A and Keim steve Eick: L Workshop On Visual Data Mining, USAKDD, (2001).

Google Scholar

[4] R. Agrawal, J. Gehrke, D. Gunopulos and P. Raghavan: Automatic Subspace Clustering, Proc. of High Dimensional Data Mining Applications, (1998).

DOI: 10.1145/276305.276314

Google Scholar

[5] Zhang.T., Ramakrishnan R. and Livny M: BIRCH-An efficient Data Clustering Method for Very Large Databases, In Proc. SIGMOD96, (1996).

DOI: 10.1145/235968.233324

Google Scholar

[6] Guha S., Rastogi R. and Shim K.: ROCK- A Robust Clustering Algorithm for Categorical Attributes, In Proc. ICDE99, (1999).

DOI: 10.1109/icde.1999.754967

Google Scholar

[7] A. K. Jain, M.N. Murty and P.J. Flynn: Data Clustering: A Survey, ACM Comput. Surv. (1999).

Google Scholar