Analysis and Improvement for K-Means Algorithm

Article Preview

Abstract:

K-Means algorithm is one of the mostly used foundation algorithm in data mining, it base on a greedy clustering algorithm. This paper will introduce this algorithm and analysis. Then prove the correctness of the algorithm. And then show the productivity of this algorithm. And at last, this paper will show some improvement to K-Means algorithm, including how to choose initial center points, and how to calculate the means. This will improve the algorithm at a certain extent.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1976-1980

Citation:

Online since:

March 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] P.K. Agarwal and C.M. Procopiuc, ªExact and Approximation Algorithms for Clustering, º Proc. Ninth Ann. ACM-SIAM Symp. Discrete Algorithms, pp.658-667, Jan. (1998).

Google Scholar

[2] K. Alsabti, S. Ranka, and V. Singh, ªAn Efficient k-means Clustering Algorithm, º Proc. First Workshop High Performance DataMining, Mar. (1998).

Google Scholar

[3] S. Arora, P. Raghavan, and S. Rao, ªApproximation Schemes for Euclidean k-median and Related Problems, º Proc. 30th Ann. ACM Symp. Theory of Computing, pp.106-113, May (1998).

Google Scholar

[4] S. Arya and D. M. Mount, ªApproximate Range Searching, Computational Geometry: Theory and Applications, vol. 17, pp.135-163, (2000).

Google Scholar

[5] S. Arya, D.M. Mount, N.S. Netanyahu, R. Silverman, and A.Y. Wu, An Optimal Algorithm for Approximate Nearest NeighborSearching, ºJ. ACM, vol. 45, pp.891-923, (1998).

DOI: 10.1145/293347.293348

Google Scholar

[6] G.H. Ball and D.J. Hall, ªSome Fundamental Concepts and Synthesis Procedures for Pattern Recognition Preprocessors, Proc. Int'l Conf. Microwaves, Circuit Theory, and Information Theory, Sept. (1964).

Google Scholar

[7] J.L. Bentley, ªMultidimensional Binary Search Trees Used for Associative Searching, º Comm. ACM, vol. 18, pp.509-517, (1975).

DOI: 10.1145/361002.361007

Google Scholar