Outlier Detection Algorithm Basing on Similarity Measurement Relation

Hong Bin Fang

doi:10.4028/www.scientific.net/AEF.6-7.621

Paper Titles

Filter-Based Information Selection Mechanism in Publish/Subscribe Middleware
p.595

A Research of High-Performance Geometric Precision Correction Based on Reference Image Database
p.601

Enterprise Information Management Performance Evaluation Based on Grey Fuzzy Theory
p.607

An Improvement to Advanced Centroid Location Algorithm
p.615

Outlier Detection Algorithm Basing on Similarity Measurement Relation
p.621

Construction Search Engine Based on Formal Concept Analysis and Association Rule Mining
p.625

The Application of Fuzzy Association Rule Mining in E-Commerce Information System Mining
p.631

The Development of E-Commerce Recommendation System Based on Collaborative Filtering
p.636

Using Rough Set to Construct the Enterprise Information Management System
p.641

HomeAdvanced Engineering ForumAdvanced Engineering Forum Vols. 6-7Outlier Detection Algorithm Basing on Similarity...

Outlier Detection Algorithm Basing on Similarity Measurement Relation

Abstract:

Outlier detection is an important field of data mining, which is widely used in credit card fraud detection, network intrusion detection ,etc. A kind of high dimensional data similarity metric function and the concept of class density are given in the paper, basing on the combination of hierarchical clustering and similarity, as well as outlier detection algorithm about similarity measurement is presented after the redefinition of high dimension density outliers is put. The algorithm has some value for outliers detection of high dimensional data set in view of experimental result.

By email View Pdf

You have full access to the following eBook

Read eBook

Info:

Periodical:

Advanced Engineering Forum (Volumes 6-7)

Pages:

621-624

DOI:

https://doi.org/10.4028/www.scientific.net/AEF.6-7.621

Citation:

Cite this paper

Online since:

September 2012

Authors:

Hong Bin Fang

Keywords:

Data Mining (DM), Outlier, Similarity Relation

Export:

RIS, BibTeX

Permissions:

Creative Commons CC BY 4.0

Citation:

References

[1] D Hawkins, Identifications of Outliers[M]. London: Chapman and Hall, (1980).

Google Scholar

[2] EKnorr, RNg. Algorithms for mining distance-based outliers in large datasets[A]. In Proc of the24th VLDB Conf[C]. NewYork: Morgan Kaufmann, 1998. 392-403.

Google Scholar

[3] J W Han, M Damber. Data Mining: Concepts and Technologies [M]. San Francisco: Morgan Kaufmann, (2001).

Google Scholar

[4] P J Rousseeuw, A M Leroy. Robust Regression and Outlier Detection[M]. New York: John Wiley& Sons, (1987).

DOI: 10.1002/0471725382

Google Scholar

[5] Rakes h Agra w a, l JohannesGehrke , D m i it rios Gunopulos , et al. Au to m at ic Subspace Clustering of H i gh D i m ens i ona lDat a for Data Mining Applicati on [ C ] / /Proceed i ngs of the 1998 ACMSIGMOD Internation a Conference on Management of Data, Seattle, Washington , (1998).

Google Scholar

[6] A ggarwal C C, P rocopiuc C, Wolf JL, etal Fast al gorithmsf or projected clustering [C]/Proc. of the ACM SIGMOD Conference Philadel Phia , P A, 1999: 61-72.

Google Scholar

[7] A gra w alR, Geh rke J . Gun opol os D, et a. l Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications . In ACM SIGMOD Con f eren ce, (1998).

DOI: 10.1145/276305.276314

Google Scholar

[8] Zenshui Xu, Meimei Xia. Distance and similarity measures for hesitant fuzzy sets[J]. Information Sciences, 2011. 2128-2138.

DOI: 10.1016/j.ins.2011.01.028

Google Scholar