A New Outlier Detection Algorithms Based on Markov Chain

Article Preview

Abstract:

Detecting outliers in a large set of data objects is a major data mining task aiming at finding different mechanisms responsible for different groups of objects in a data set. In high-dimensional data, these approaches are bound to deteriorate due to the notorious “curse of dimensionality”. In this paper, we propose a novel approach named ODMC (Outlier Detection Based On Markov Chain),the effects of the “curse of dimensionality” are alleviated compared to purely distance-based approaches. A main advantage of our new approach is that our method is to use a major feature of an undirected weighted graph to calculate the outlier degree of each node, In a thorough experimental evaluation, we compare ODMC to the ABOD and FindFPOF for various artificial and real data set and show ODMC to perform especially well on high-dimensional data.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

456-459

Citation:

Online since:

October 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Tian Jiang, Gu Hong. Outlier One Class Support Vector Machines[J]. Journal of Electronics and Information Technology, 2010, vol32(6), 1284- 1288.

Google Scholar

[2] Han J and Kamber M. Data Mining: Concepts and Techniques[M]. San Francisco: Morgan Kaufmann Publishers, 2006: 451-458.

Google Scholar

[3] Hinneburg, A., Aggarwal, C.C., Keim, D.A. What is the nearest neighbor in high dimensional spaces? In: Abbadi, A.E., Brodie, M.L., Chakravarthy, S., et al. eds. Proceedings of the 26th International Conference on Very Large Data Bases. Cairo: Morgan Kaufmann, 2000: 506~515.

Google Scholar

[4] Wei L, Gong XQ, Qian WN, Zhou AY. Finding outliers in high-dimensional space. Journal of Software, 2002, 13(2): 280−290 (in Chinese with English abstract). http: /www. jos. org. cn/ 1000-9825/13/280. pdf.

Google Scholar

[5] He ZY, Xu XF, Huang JZ, Deng SC. FP-Outlier: Frequent pattern based outlier detection. Computer Science and Information System, 2005, 2(1): 103−118.

DOI: 10.2298/csis0501103h

Google Scholar

[6] Xu Xue Song, Zhang Xu, Song Dong Ming. Outlier detection algorithm based on nonlinear data transformation [J], Chinese Journal of engineering science, 2008, 10(9): 74-78.

Google Scholar