Multi-Radius Density Clustering Algorithm Based on Outlier Factor

Article Preview

Abstract:

This paper proposes a novel multi-radius density clustering algorithm based on outlier factor. The algorithm first calculates the density-similar-neighbor-based outlier factor (DSNOF) for each point in the dataset according to the relationship of the density of the point and its neighbors, and then treats the point whose DSNOF is smaller than 1 as a core point. Second, the core points are used for clustering by the similar process of the density based spatial clustering application with noise (DBSCAN) to get some sub-clusters. Third, the proposed algorithm merges the obtained sub-clusters into some clusters. Finally, the points whose DSNOF are larger than 1 are assigned into these clusters. Experiments are performed on some real datasets of the UCI Machine Learning Repository and the experiments results verify that the effectiveness of the proposed model is higher than the DBSCAN algorithm and k-means algorithm and would not be affected by the parameter greatly.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

427-431

Citation:

Online since:

January 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Mete ÇELİK, Filiz DADAŞER-ÇELİK and Ahmet Şakir DOKUZ, Anomaly Detection in Temperature Data Using DBSCAN Algorithm, in proc. of 2011 International Symposium on Innovations in Intelligent Systems and Applications, 2011, 91-95.

DOI: 10.1109/inista.2011.5946052

Google Scholar

[2] Pihui Huang, Wenchieh Chou, Wentzu Lin, Using SOM and DBSCAN-based models for landslide hazard and spatial correlations analysis: A case study in central Taiwan, in proc. of 20th International Conference on Geoinformatics: 2012, 1-5.

DOI: 10.1109/geoinformatics.2012.6270257

Google Scholar

[3] Pangning Tan and Micheal Steinbacn, Vipin Kumar, Introduction to Data Mining, Beijing, Post & Telecom Press, 2011, Chapter 8.

Google Scholar

[4] Cao H, Si G, Zhang Y, et al. Enhancing effectiveness of density-based outlier mining scheme with density-similarity-neighbor-based outlier factor[J]. Expert Systems with Applications, 2010, 37(12): 8090-8101.

DOI: 10.1016/j.eswa.2010.05.079

Google Scholar

[5] Frank, A., Asuncion, A. UCI Machine Learning Repository [http: /archive. ics. uci. edu/ml]. Irvine, CA: University of California, School of Information and Computer Science, (2010).

Google Scholar

[6] Julia Handl, Joshua Knowles and Douglas B. Kell. Computation cluster validation in post-genomic data analysis. Bioinformatics, 2005, 21(15): 3201-3212.

DOI: 10.1093/bioinformatics/bti517

Google Scholar