Using Self-Similarity to Incorporate Dimensionality Reduction and Cluster Evolution Tracking

Article Preview

Abstract:

Cluster evolution tracking and dimensionality reduction have been studied intensively but separately in the time decayed and high dimensional stream data environment during the past decades. However, the interaction between the cluster evolution and the dimensionality reduction is the most common scenario in the time decayed stream data. Therefore, the dimensionality reduction should interact with cluster operation in the endless life cycle of stream data. In this paper, we first investigate the interaction between dimensionality reduction and cluster evolution in the high dimensional time decayed stream data. Then, we integrate the on-line sequential forward fractal dimensionality reduction technique with self-adaptive technique for cluster evolution tracking based on multi-fractal. Our performance experiments over a number of real and synthetic data sets illustrate the effectiveness and efficiency provided by our approach.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2242-2247

Citation:

Online since:

July 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Guha S, Meyerson A and Mishra N: IEEE Transactions on Knowledge and Data Engineering, Vol. 15, No. 3 (2003) pp.515-528.

Google Scholar

[2] Aoning Zhou, Feng Cao and Weining Qian: Knowledge and Information Systems, Vol. 15, No. 2 (2007) pp.1-34.

Google Scholar

[3] Aggarwal C, Han J and Wang J: A framework for clustering evolving data streams, In Proceedings of the 29th international conference on Very large databases, P. 81-92 (2003).

Google Scholar

[4] Jianlong Chang, Feng CAO and Aoning Zhou: Journal of Software, Vol. 18, No. 4(2007) pp.905-918. (In Chinese).

Google Scholar

[5] Huan Liu, Motoda Hiroshi and Lei Yu: Artificial Intelligence, Vol. 159, No. 1 (2004) pp.49-74.

Google Scholar

[6] Qinbao Song, Jinjie Ni and Guangtao Wang: IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society, Vol. 25, No. 1 (2013) pp.1-14.

DOI: 10.1109/tkde.2011.181

Google Scholar

[7] Guanghui Yan and Zhanhuai Li: Journal of Computer Research and Development, Vol. 45, No. 11 (2008) pp.1955-1964. (In Chinese).

Google Scholar

[8] Daniel Barbarâ and Ping Chen: Data Mining and Knowledge Discovery, Vol. 7, No. 2 (2003) pp.123-152.

Google Scholar

[9] Guanghui Yan, Zhanhuai Li and Jianwu Dang: Journal of Software, Vol. 19, No. 6 (2008) pp.1283-1300. (In Chinese).

Google Scholar

[10] Guanghui Yan, Zhicheng Ma, Shaoling He, Yun Liu and Xiaohui Dong: An autonomous method on tracking stream data cluster evolution. In: Proceedings. Of ICIS'10. pp.461-465 (2010).

DOI: 10.1109/icicisys.2010.5658483

Google Scholar

[11] Guanghui Yan, Yun Liu and Xin Shu: Integrating Fractal Dimensionality Reduction with Cluster Evolution Tracking. In: Proceedings of FSKD'11. pp.1668-1672 (2011).

Google Scholar

[12] Catherine Blake and Christopher J. Merz: UCI repository of machine learning databases on http: /www. ics. uci. edu/-mlearn/MLRePository. html.

Google Scholar