Combination Clustering Evaluation Research on Different Data Distribution Patterns

Article Preview

Abstract:

Clustering analysis, as a practical data mining method, has wide-ranging applications in many fields. But because of different original data resources, clustering results of different data distribution patterns and applicable clustering evaluation methods are different from each other. Aiming at different data distribution patterns, only reasonable clustering evaluation methods can achieve a better recognition of different clustering results for realizing the application value of clustering technology. In this paper, the combination clustering evaluation model is constructed form three angles, through clustering experiment of different artificial simulated data distribution patterns, comparative analysis draw a conclusion that the combination clustering evaluation model constructed is reasonable, and according to applicable clustering and clustering evaluation methods based on different data distribution patterns, the optimization clustering process is constructed for improving the effectiveness and interpretability of different clustering results.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 694-697)

Pages:

2794-2800

Citation:

Online since:

May 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Pang-Ning Tan, Michael Steinbach, Vipin Kumar: Introduction to Data Mining (Beijing: China Machine Press, China 2010).

Google Scholar

[2] J. MacQueen, Proc. of the fifth Berkeley symposium on Mathematical Statistics and Probability 1(1967) 281–297, in press.

Google Scholar

[3] Yuan Chang-an: Data Mining Theory and SPSS Clementine Application (Beijing: Publishing House of Electronics Industry, China 2009).

Google Scholar

[4] T. Kohonen, Proc. of the IEEE 78(1990) 1464–1480, in press.

Google Scholar

[5] Peter J.Rousseeuw, Computational and Applied Mathematics 20(1987)53-65, in press.

Google Scholar

[6] Zhang Hong-bing: SPSS Application (Beijing: Publishing House of Electronics Industry, China 2007).

Google Scholar

[7] Yu-Chen Song, Xiao-Liang Jia, Hai-Dong Meng, Influence analysis of linear data distribution on different clustering results, Advanced Materials Research (2012)3144-3152, in press.

DOI: 10.4028/www.scientific.net/amr.472-475.3144

Google Scholar

[8] Hai-Dong Meng, Yu-Chen Song, Fei-Yan Song, Research and appli-cation of cluster and association analysis in geochemical data processing, Computational Geosciences 15(2011)87–98,in press.

DOI: 10.1007/s10596-010-9199-x

Google Scholar

[9] J. G. Wang, Two-dimensional supervised discriminant projection method for feature extraction, Applied Mathematics & Information Sciences 6 (2012)81–85, in press.

Google Scholar

[10] B. Mobasher, R. Cooley, Data preparation for mining World Wide browsing patterns, Knowledge and Information Systems 1(1999)32–35, in press.

DOI: 10.1007/bf03325089

Google Scholar

[11] Yu-Chen Song, Yi-Feng Fang, Application research of association analysis with Clementine, Proc. of 2nd International Conference on Software Engineering and Data Mining 2(2010)445–449, in press.

Google Scholar

[12] J. T. Kent, Information gain and a general measure of correlation, Biometrika Trust 70 (1983)163–173, in press.

DOI: 10.1093/biomet/70.1.163

Google Scholar

[13] Yu-Chen Song, Hai-Dong Meng, Clustering Algorithms for Arbitray Data Sets, edtied by Encyclopedia of Artificial Intelligence Idea Group Inc, Information Science Reference, (2010) in press.

Google Scholar

[14] Jiawei Han, Micheline Kamber: Data Mining: Concept and Techniques (Beijing: China Machine Press, China 2008).

Google Scholar

[15] Richard A. Johnson, Dean W. Wichern: Applied Multivariate Statistical Analysis (Pearson Education International, China 2003).

Google Scholar

[16] Andriy Kovalenko, Fumio Hirata, "Three-dimension density profiles of water in contact with a solute of arbitrary shape: a RISM approach," Chemical Physics Letters 290(1998) 237-244, in press.

DOI: 10.1016/s0009-2614(98)00471-0

Google Scholar

[17] Yu-Chen Song, Xiao-Liang Jia and Hai-Dong Meng, Comparative study of clustering methods based on linear data distribution, IEEE Technology Management Council, (2012)377-384, in press.

Google Scholar

[18] Mohamed M. Mostafa, "A Psycho-cognitive segmentation of organ donors in Egypt using Kohonen's self-organizing maps," Expert Systems with Applications 38(2011) 6906-6915, in press.

DOI: 10.1016/j.eswa.2010.12.033

Google Scholar