Outlier Detection Algorithm Basing on Similarity Measurement Relation

Article Preview

Abstract:

Outlier detection is an important field of data mining, which is widely used in credit card fraud detection, network intrusion detection ,etc. A kind of high dimensional data similarity metric function and the concept of class density are given in the paper, basing on the combination of hierarchical clustering and similarity, as well as outlier detection algorithm about similarity measurement is presented after the redefinition of high dimension density outliers is put. The algorithm has some value for outliers detection of high dimensional data set in view of experimental result.

You have full access to the following eBook

Info:

Periodical:

Pages:

621-624

Citation:

Online since:

September 2012

Authors:

Export:

Share:

Citation:

[1] D Hawkins, Identifications of Outliers[M]. London: Chapman and Hall, (1980).

Google Scholar

[2] EKnorr, RNg. Algorithms for mining distance-based outliers in large datasets[A]. In Proc of the24th VLDB Conf[C]. NewYork: Morgan Kaufmann, 1998. 392-403.

Google Scholar

[3] J W Han, M Damber. Data Mining: Concepts and Technologies [M]. San Francisco: Morgan Kaufmann, (2001).

Google Scholar

[4] P J Rousseeuw, A M Leroy. Robust Regression and Outlier Detection[M]. New York: John Wiley& Sons, (1987).

DOI: 10.1002/0471725382

Google Scholar

[5] Rakes h Agra w a, l JohannesGehrke , D m i it rios Gunopulos , et al. Au to m at ic Subspace Clustering of H i gh D i m ens i ona lDat a for Data Mining Applicati on [ C ] / /Proceed i ngs of the 1998 ACMSIGMOD Internation a Conference on Management of Data, Seattle, Washington , (1998).

Google Scholar

[6] A ggarwal C C, P rocopiuc C, Wolf JL, etal Fast al gorithmsf or projected clustering [C]/Proc. of the ACM SIGMOD Conference Philadel Phia , P A, 1999: 61-72.

Google Scholar

[7] A gra w alR, Geh rke J . Gun opol os D, et a. l Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications . In ACM SIGMOD Con f eren ce, (1998).

DOI: 10.1145/276305.276314

Google Scholar

[8] Zenshui Xu, Meimei Xia. Distance and similarity measures for hesitant fuzzy sets[J]. Information Sciences, 2011. 2128-2138.

DOI: 10.1016/j.ins.2011.01.028

Google Scholar