Research on Storage Strategy of Unstructured Small Files in HDFS

Article Preview

Abstract:

With the wide use of HDFS and increasing scale of small files, problems of HDFS in small files storage gradually exposed. Thus the article put forward a storage strategy of unstructured small file based on the type of file, and optimized the architecture of cluster to save memory and improve the efficiency of file access. Through experiment, the strategy is proved to be effective and reliable.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

3053-3056

Citation:

Online since:

September 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Konstantin Shvachko, Hairong Kuang, Sanjay Radia, Robert Chansler. The hadoop distributed file system. Sunnyvale, California USA, IEEE 2010: 1-10.

DOI: 10.1109/msst.2010.5496972

Google Scholar

[2] GAO Ji-chao. Study and optimization of Hadoop storage strategy. Traffic university of Beijing, (2012).

Google Scholar

[3] WANG Hong-yu. The application of Hadoop in clouding computing. Software, 2011, 32(4): 36-39.

Google Scholar

[4] HONG Xu-sheng, LIN Shi-ping. MapFile-based HDFS small file storage efficiency, Computer system application, 2012, 21(11): 179-182.

Google Scholar

[5] LIU Xu-hui, Han Ji-zhong and ZHONG Yun-qin. Implementing Web GIS on Hadoop: A Case Study of Improving Small File I/O Performance on HDFS. Proceedings of 2009 IEEE Conference on Cluster Computing.

DOI: 10.1109/clustr.2009.5289196

Google Scholar

[6] MACKEY G, SEHRI S, WANG Jun. Improving metadata management for small files in HDFS. Proceedings of 2009 IEEE International Conference on Cluster Computing and Workshops.

DOI: 10.1109/clustr.2009.5289133

Google Scholar

[7] Information on http: /Hadoop. apache. org.

Google Scholar

[8] Information on http: /wiki. apache. org/Hadoop/SequenceFile.

Google Scholar

[9] LIU Xiao-jun, XU Zheng-quan. A small file storage strategy combined RDBMS and Hadoop. Journal of wuhan university, 2013, 38(1).

Google Scholar

[10] ZHANG Chun-ming, HE Ting-ting. A small file storage and reading method in Hadoop. Computer application and software, 2012, 29(11): 95-100.

Google Scholar