Paper Title:
Research on Dynamic Data Streams Clustering Algorithm –Pdstream Based on PCA and Density
  Abstract

The research on data streams clustering has become a focus in the field of data streams mining. Because the number of data streams is too large, and CPU of the computer has limited memory and time, it’s difficult to carry out clustering quickly and effectively. For that problem, we design an improved clustering algorithm for dynamic data streams based on principal component analysis and density. The PDStream algorithm effectively overcomes the shortcomings of the STREAM algorithm controlled by historical data and the CluStream algorithm is difficult to describe non-spherical and out "old data", resulting in huge amount of data. In the course of the experiment, we compare with the STREAM algorithm, the PDStream algorithm shows the superiority of handling mass data and the characteristics of high-quality clustering.

  Info
Periodical
Edited by
Zhenyu Du and Bin Liu
Pages
108-112
DOI
10.4028/www.scientific.net/AMM.26-28.108
Citation
M. Zheng, C. H. Ju, Z. Rui, "Research on Dynamic Data Streams Clustering Algorithm –Pdstream Based on PCA and Density", Applied Mechanics and Materials, Vols. 26-28, pp. 108-112, 2010
Online since
June 2010
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Zhong Ping Zhang, Yong Xin Liang
Abstract:This paper proposes a new data stream outlier detection algorithm SODRNN based on reverse nearest neighbors. We deal with the sliding window...
1032
Authors: Zeng Ying He
Chapter 5: Numerical Methods, Computation Methods and Algorithms for Modeling, Simulation and Optimization, Data Mining and Data Processing
Abstract:Aiming at some deficiencies of existing network intrusion detection system, the paper proposes a network intrusion detection system model...
2081
Authors: Ming Ji, Fei Wang, Jia Ning Wan, Yuan Liu
Chapter 5: Numerical Methods, Computation Methods and Algorithms for Modeling, Simulation and Optimization, Data Mining and Data Processing
Abstract:The purpose of this report is to investigate current existing algorithm to cluster sequential data based on hidden Markov model (HMM)....
1750
Authors: Zhi Hai Sun, Bin Hu, Ying Meng, Wen Hui Zhou
Chapter 4: Practice of Data Processing for Intelligent Systems
Abstract:Visual object detection and tracking have become an important step between computer vision and video analysis. Recent methods almost use mean...
600