Paper Title:
Research on Improved Clustering Algorithm on Web Usage Mining Based on Scientific Analysis of Web Materials
  Abstract

Clustering analysis is an important method to research the Web user’s browsing behavior and identify the potential customers on Web usage mining. The traditional user clustering algorithms are not quite accurate. In this paper, we give two improved user clustering algorithms, which are based on the associated matrix of the user’s hits in the process of browsing website. To this matrix, an improved Hamming distance matrix is generated by defining the minimum norm or the generalized relative Hamming distance between any two vectors. Then, similar user clustering are obtained by setting the threshold value. At the last step of our algorithm, the clustering results are confirmed by defining the clustering’s Similar Index and setting sub-algorithm. Finally, the testing examples show that the new algorithms are more accurate than the old one, and the real log data presents that the improved algorithms are practical.

  Info
Periodical
Edited by
Helen Zhang and David Jin
Pages
863-867
DOI
10.4028/www.scientific.net/AMM.63-64.863
Citation
B. Li, J. Yang, C. M. Liu, J. D. Zhang, Y. Zhang, "Research on Improved Clustering Algorithm on Web Usage Mining Based on Scientific Analysis of Web Materials", Applied Mechanics and Materials, Vols. 63-64, pp. 863-867, 2011
Online since
June 2011
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Tian Pei Zhou, Wen Fang Huang
Abstract:In the process of recycling chemical product in coking object, ammonia and tar were indispensable both metallurgy and agriculture, so the...
1945
Authors: Ravinder Kumar, Pravin Chandra, M. Hanmandlu
Chapter 7: Machining
Abstract:This paper presents a fast and reliable algorithm for fingerprint verification. Our proposed fingerprint verification algorithm is based on...
888
Authors: Gang Zhu Qiao, Jian Chao Zeng
Chapter 12: Computer-Aided Design, Manufacturing and Engineering
Abstract:The path loss exponent shows the effect of space environment on the RF signals in wireless communication model. In most RSSI based location...
4530
Authors: Xue Feng Wu, Yu Fan
Chapter 6: Mechatronics
Abstract:A new algorithms for parameters of an image irregular boundary circle parameters is presented, which is based on “Curve-Approximate Method”...
639
Authors: Zhan Kun Zhao
Chapter 6: Information Technologies, WEB and Networks Engineering, Information Security, Software Application and Development
Abstract:Efficient data mining model design for a large database in the cloud computing environment is studied. For large databases efficiently mining...
2447