Queries for Uncertain Data on Dataspace Based on Effective Clustering Algorithm

Article Preview

Abstract:

This paper presents a probabilistic data stream clustering method P-Stream. An effective clustering algorithm called P-Stream for probabilistic data stream is developed in this paper for the first time. For the uncertain tuples in the data stream, the concepts of strong cluster, transitional clusters and weak cluster are proposed in the P-Stream. With these concepts, an effective strategy of choosing candidate cluster is designed, which can find the sound cluster for every continuously arriving data point. In this paper, we systematically defined the dataspace, the uncertain data, and proposed a updated algorithm of queries on uncertain data based on Effective Clustering Algorithm.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1529-1532

Citation:

Online since:

August 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Alon Halevy, Michael Franklin, Principle of Dataspace Systems, 2006, dl. acm. org.

Google Scholar

[2] Zhou Fan, Li Shuquan, Top-k query processing on probabilistic data, Journal Of Electronic Measurement And Instrument, 2007, Vol. 24 No. 7 605~656.

Google Scholar

[3] Michael Franklin, Alon Halevy, From Database to Dataspaces: A New Abstraction For Information Management, 2005. 12, ACM SIGMOD.

DOI: 10.1145/1107499.1107502

Google Scholar

[4] X. Dong, A, Halevy and C. Yu. Probabilistic schema mapping. Technical Report 2006-12-01, Univ. of Washington, (2006).

Google Scholar

[5] X. Dong, A, Halevy. A platform for personal information management and integration. In CIDR, (2005).

Google Scholar

[6] Ao-Ying Zhou, Che-Qing Jin, A Survey on the Management of Uncertain Data, Chinese Journal of Computers, 2009, Vol. 32 No. 1.

Google Scholar

[7] Li Jian-Zhong, Li Jin-Bao, Concepts, issues and advance of sensor networks and data management of sensor networks. Journal of software, 2003, 14(10): 1717-1727.

Google Scholar