Effective Data Localization Using Consistent Hashing in Cloud Time-Series Databases

Article Preview

Abstract:

The commercial time-series database is suitable for processing the time-series data. However, a single commercial time-series database can only accommodate the time-series data acquired by limited amount of sensors. In this paper, in order to cope with the challenge of massive time-series data processing, we first propose a cloud time-series database framework based on commercial time-series databases, and then propose an effective consistent hashing based algorithm for solving the key problem, i.e., the data localization problem, in cloud time-series databases. A performance study shows the superiority of the framework and the algorithm for processing massive time-series data acquired by large amount of sensors.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2246-2251

Citation:

Online since:

August 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] G. Chen and L. Li, An Optimized Algorithm for Lossy Compression of Real-Time Data, Proceedings of the 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 10), IEEE Press, Dec. 2010, p.187–191.

DOI: 10.1109/icicisys.2010.5658770

Google Scholar

[2] A. Singhal and D. E. Seborg, Effect of Data Compression on Pattern Matching in Historical Data, Industrial & Engineering Chemistry Research, vol. 44, Mar. 2005, p.3203–3212.

DOI: 10.1021/ie049707a

Google Scholar

[3] David R. Karger, Eric Lehman, Frank Thomson Leighton, Rina Panigrahy, Matthew S. Levine, and Daniel Lewin, Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web, Proceedings of the 29th ACM Symposium on the Theory of Computing (STOC 97), ACM Press, May 1997, pp.654-663.

DOI: 10.1145/258533.258660

Google Scholar

[4] Ion Stoica, Robert Morris, David R. Karger, M. Frans Kaashoek, and Hari Balakrishnan, Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications, Proceedings of the ACM SIGCOMM 2001 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM 01), ACM Press, Aug. 2001, pp.149-160.

DOI: 10.1145/964723.383071

Google Scholar

[5] Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung, The Google File System, Proceedings of the 19th ACM Symposium on Operating Systems Principles (SOSP 03), ACM Press, Oct. 2003, pp.29-43.

DOI: 10.1145/945445.945450

Google Scholar

[6] Jeffrey Dean and Sanjay Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, Proceedings of the 6th USENIX Symposium on Operating System Design and Implementation (OSDI 04), USENIX Association, Dec. 2004, pp.137-150.

Google Scholar

[7] Michael Burrows, The Chubby Lock Service for Loosely-Coupled Distributed Systems, Proceedings of the 7th USENIX Symposium on Operating System Design and Implementation (OSDI 06), USENIX Association, Nov. 2006, pp.335-350.

Google Scholar

[8] Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Michael Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber, Bigtable: A Distributed Storage System for Structured Data, Proceedings of the 7th USENIX Symposium on Operating System Design and Implementation (OSDI 06), USENIX Association, Nov. 2006, pp.205-218.

DOI: 10.1145/1365815.1365816

Google Scholar

[9] Tengjiao Wang, Bishan Yang, Allen Huang, Qi Zhang, Jun Gao, Dongqing Yang, Shiwei Tang, and Jinzhong Niu, Dynamic Data Migration Policies for Query-Intensive Distributed Data Environments, Proceedings of the Joint Conference of the 11th Asia-Pacific Web Conference and the 10th International Conference on Web-Age Information Management (APWeb/WAIM 09), Apr. 2009, pp.63-75.

DOI: 10.1007/978-3-642-00672-2_8

Google Scholar

[10] Edward P. Holden, Jai W. Kang, Dianne P. Bills, and Mukhtar Ilyassov, Databases in the Cloud: A Work in Progress, Proceedings of the 10th ACM SIGITE International Conference on Information Technology Education (SIGITE 09), ACM Press, Oct. 2009, pp.138-143.

DOI: 10.1145/1631728.1631765

Google Scholar

[11] Ashraf Aboulnaga, Kenneth Salem, Ahmed A. Soror, Umar Farooq Minhas, Peter Kokosielis, and Sunil Kamath, Deploying Database Appliances in the Cloud, IEEE Data Engineering Bulletin, vol. 32, Mar. 2009, pp.13-20.

DOI: 10.1145/1376616.1376711

Google Scholar

[12] Daniel Abadi, Michael J. Carey, Surajit Chaudhuri, Hector Garcia-Molina, Jignesh M. Patel, and Raghu Ramakrishnan, Cloud Databases: What's New?, Proceedings of Very Large Data Base, vol. 3, Sep. 2010, p.1657.

DOI: 10.14778/1920841.1921069

Google Scholar

[13] Chun Chen, Gang Chen, Dawei Jiang, Beng Chin Ooi, Hoang Tam Vo, Sai Wu, and Quanqing Xu, Providing Scalable Database Services on the Cloud, Proceedings of the 11th International Conference on Web Information Systems Engineering (WISE 10), Springer, Dec. 2010. pp.1-19.

DOI: 10.1007/978-3-642-17616-6_1

Google Scholar

[14] Edward P. Holden, Jai W. Kang, Geoffrey R. Anderson, and Dianne P. Bills, Databases in the Cloud: A Status Report, Proceedings of the 12th ACM SIGITE International Conference on Information Technology Education (SIGITE 11), ACM Press, Oct. 2011, pp.171-176.

DOI: 10.1145/2047594.2047642

Google Scholar

[15] Magdalena Balazinska, Bill Howe, and Dan Suciu, Data Markets in the Cloud: An Opportunity for the Database Community, Proceedings of Very Large Data Bases, vol. 4, Oct. 2011, pp.1482-1485.

DOI: 10.14778/3402755.3402801

Google Scholar

[16] Maximilian Ahrens and Gustavo Alonso, Relational Databases, Virtualization, and the Cloud, Proceedings of the 27th International Conference on Data Engineering (ICDE 11), IEEE Press, Apr. 2011, p.1254.

DOI: 10.1109/icde.2011.5767966

Google Scholar

[17] PengCheng Xiong, Yun Chi, Shenghuo Zhu, Hyun Jin Moon, Calton Pu, and Hakan Hacigümüs, Intelligent Management of Virtualized Resources for Database Systems in Cloud Environment, Proceedings of the 27th International Conference on Data Engineering (ICDE 11), IEEE Press, Apr. 2011, pp.87-98.

DOI: 10.1109/icde.2011.5767928

Google Scholar

[18] Hoang Tam Vo, Sheng Wang, Divyakant Agrawal, Gang Chen, and Beng Chin Ooi, LogBase: A Scalable Log-structured Database System in the Cloud, Proceedings of Very Large Data Bases, vol. 5, Jun. 2012, pp.1004-1015.

DOI: 10.14778/2336664.2336673

Google Scholar

[19] Chao-Rui Chang, Meng-Ju Hsieh, Jan-Jan Wu, Po-Yen Wu, and Pangfeng Liu, HSQL: A Highly Scalable Cloud Database for Multi-user Query Processing, Proceedings of the 5th IEEE International Conference on Cloud Computing (CLOUD 12), IEEE Press, Jun. 2012, pp.943-944.

DOI: 10.1109/cloud.2012.76

Google Scholar

[20] Carlo Curino, Evan P. C. Jones, Raluca A. Popa, Nirmesh Malviya, Eugene Wu, Samuel Madden, Hari Balakrishnan, and Nickolai Zeldovich, Relational Cloud: A Database Service for the Cloud, Proceedings of the 5th Biennial Conference on Innovative Data Systems Research (CIDR 11), Jan. 2011, pp.235-240.

Google Scholar