Improved K-MEANS Algorithm Based on Samples

Article Preview

Abstract:

Clustering analysis plays an important role in scientific research and commercial application. K-means algorithm is a widely used partition method in clustering. in this method.The number of clusters is predefined and the technique is highly dependent off the initial identification of elements that represent the clusters well. As the dataset’s scale increases rapidly, it is difficult to use K-means and deal with massive data. partitions.To prevent this problem,refining initial points algorithm provided.it can reduce execution time and improve solutions for large data by setting the refinement of initial conditions.The experiments demonstrate that sample-based K-means is more stable and more accurate.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

472-475

Citation:

Online since:

February 2015

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2015 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] J. Han and M. Kamber. Data Mining: Concepts and Techniques, Morgan Kaufmann Pubhshers, San Francisco, CA. (2001).

Google Scholar

[2] M. Ester. H.P. KriegeL j. Sander, and X. xu. A density-based algorithm for discovering clusters i large spatial databases". Proc. 1996. Int. Conf Knowledge, Discovery and Data Mining(KDD, 96), Portland. OR, August l 996.

Google Scholar

[3] P. S. Bradley, and U. M. Fayvad , Refining initial points for K-means clustering,. Proceeding of the Fifteenth International Conference on Machine Learning(ICML98). 1998. pp. 9l-99.

Google Scholar

[4] The Analysis of a Simple K-Means Algorithm.T. Kanungo.D. M Mount. N S. Netanyahu, C Piatko ,R. Silverman and A. Y Wu. (2000).

DOI: 10.21236/ada458738

Google Scholar

[5] R. Kannan. S Vempala. and Adrian Vetta. On Clusterings. "Good, Bad and spectral". Proc,. of the 41st foundations of Computer science, Redondo Beach. (2000).

DOI: 10.1109/sfcs.2000.892125

Google Scholar

[6] Usama Fayyad, Cory Reina, P. S. Bradley: Initialization of Iterative Refinement Clustering Algorithms. Microsoft Research Technical Report MSR-TR-98-38, June (1998).

Google Scholar