Hybrid Genetic Clustering by Using FCM and Geodesic Distance for Complex Distributed Data

Article Preview

Abstract:

To efficiently find hidden clusters in datasets with complex distributed data,inspired by complementary strategies, a hybrid genetic clustering algorithm was developed, which is on the basis of the geodesic distance metric, and combined with the Fuzzy C-Means clustering (FCM) algorithm. First, instead of using Euclidean distance,the new approach employs geodesic distance based dissimilarity metric during all fitness evaluation. And then, with the help of FCM clustering, some sub-clusters with spherical distribution are partitioned effectively. Next, a genetic algorithm based clustering using geodesic distance metric, named GCGD, is adopted to cluster the clustering centers obtained from FCM clustering. Finally, the final results are acquired based on above two clustering results. Experimental results on eight benchmark datasets clustering questions show the effectiveness of the algorithm as a clustering technique. Compared with conventional GCGD, the hybrid clustering can decrease the computational time obviously, while retaining high clustering correct ratio.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2597-2601

Citation:

Online since:

December 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] J. Han, and M. Kamber, Data mining: Concepts and Techniques, 2nd ed., San Francisco, Morgan Kaufmann, 2006.

Google Scholar

[2] E.R. Hruschka, R.J.G.B. Campello, A.A. Freitas, and A.C.P.L.F. de Carvalho, "A Survey of Evolutionary Algorithms for Clustering," IEEE Transactions on Systems, Man and Cybernetics, Part C: Applications and Reviews, vol.39, no.2, March 2009, p.133–155.

DOI: 10.1109/tsmcc.2008.2007252

Google Scholar

[3] G. Li, J. zhuang, H. N. Hou and D. H. Yu, "A genetic algorithm based clustering using geodesic distance measure," In Proceedings of IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS2009, Shanghai, China, Nov. 20-22, 2009, p.274–278.

DOI: 10.1109/icicisys.2009.5357846

Google Scholar

[4] G. Li, H. X. Wang and J. Zhuang, "Machinery fault detection using geodesic distance based genetic clustering algorithm," Advanced Materials Research, vol. 411, 2012, p.572–575.

DOI: 10.4028/www.scientific.net/amr.411.572

Google Scholar

[5] J. Handl. Multiobjective approaches to the data-driven analysis of biological systems. University of Manchester. 2006.

Google Scholar