Building Schema of Assorted Index Based on Huffman Encoding in HBase

Article Preview

Abstract:

In response to store and retrieve massive Data, many companies have prefer using a distributed database. HBase, as an open source distribute column-oriented database, has been widely used for its advantages of low cost and high scalability. As an important basis of retrieving stored data, the design of rowkey will directly affect the efficiency and success rate of retrieving data in HBase database. By classifying information and encoding the data type in rowkey design by Huffman coding, data type can be effectively bound with the data content in order to provide the efficient solution for class-based information retrieving. The results of experiment shows that this design achieves retrieving information by category as well as improve the overall efficiency of retrieval.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

6270-6274

Citation:

Online since:

May 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] HBase: bigtable-like structured storage for hadoop hdfs[EB/OL].

Google Scholar

[2010] http: / /hadoop. apache. org / hbase / , 2010.

Google Scholar

[2] Fan Chang, Jeffrey Dean, Sanjay Chemawat, et al. Bigtable : a dis-tributed storage system for structured data[C]. Proceedings of 7th USENIX Symposium on Operating Systems Design and Implementation( OSDI'06) , Seattle, WA, USA: USENIX Association, 2006: 205-218.

Google Scholar

[3] Lars George. HBase the definitive guid[M]. Sebastopol, CA 95472,: O'Reilly Media, inc, (2012).

Google Scholar

[4] xdataopen. the design of rowkey of parallel computing structure in Hbase[EB/OL] [2013/01/14]. http: /xdataopen. blog. 51cto. com/4219560/1117864.

Google Scholar

[5] Dietrich Fetherston : Flake: A Decentralized, K-Ordered Unique ID Generator in Erlang[EB/OL]. [2012/01/12]. boundary. com/blog/2012/01/12/flake-a-decentralized-k-ordered-unique-id-generator-in-erlang.

Google Scholar

[6] Twitterteam: announcing-snowflake[EB/OL].

Google Scholar

[2011] http: /engineering. twitter. com/2010/06/announcing-snowflake. html.

Google Scholar

[7] OpenTSDB: how does OpenTSDB work.

Google Scholar

[8] David A. Huffman : A Method for the Construction of Minimum-Redundancy Codes[J]. Proceedings of the IRE, 1952, 40: 1098-1101.

DOI: 10.1109/jrproc.1952.273898

Google Scholar

[9] Tom White. Hadoop: The Definitive Guide[M]. 2nd ed. O'Reilly Media,Inc,2011.

Google Scholar

[10] Konstantin Shvachko , Hairing Kuang , Sanyjy Radia , et al. The Hadoop Distributed File System[C]/ /Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies ( MSST) , May 03-07, 2010: 1-10.

DOI: 10.1109/msst.2010.5496972

Google Scholar