Distributed File Information Management System Based on Hadoop

Article Preview

Abstract:

There are two main problems to store the system data on single machine: limited storage space and low reliability. The concept of distribution solves the two problems fundamentally. Many independent machines are integrated as a whole. As a result, these separated resources are integrated together. This paper focuses on developing a system, based on SSH, XFire and Hadoop, to help users store and manage the distributed files. All the files stored in HDFS should be encrypted to protect users privacy. In order to save resources, system is designed to avoid uploading the duplicate files by checking the files MD5 string.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 756-759)

Pages:

820-823

Citation:

Online since:

September 2013

Keywords:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] DingBo, ChaoAinong, Research on AJAX development based on Struts2 framework, Computer Engineering and Design, pp.3910-3913, 2009, 30(16).

Google Scholar

[2] WangMeiqin, Study on Method of Integration of the J2EE Framework Based on Struts, Spring and Hibernate, Computer Knowledge and Technolgy, Vol. 7, No. 24, pp.5911-5913, August (2011).

Google Scholar

[3] Jiang Xiaoping, Li Chenghua, Parallel implementing k-means clustering algorithm using MapReduce programming mode, Huazhong Univ. of Sci. & Tech. (Natural Science Edition), Vol. 39 Sup. I Jun. (2011).

Google Scholar

[4] Weiwei Li, Hang Zhao, Research on massive data mining based on MapReduce, Computer Engineer and Applications, 2012-06-01.

Google Scholar

[5] Hongbo Shi, Zhenxin Wu, Research on a distributed long term preservation system solution based on HDFS, Researches On Library Science.

Google Scholar

[6] Liu Tong, Zhang Yanan, Application of distributed Java Web Services based on Xfire, Journal of Chang chun University of Technology( Natural Science Edition) Vol29, No. 2 Apr (2008).

Google Scholar

[7] Zhang Yizhi, Zhao Yi, Tang Xiaobin, MD5 Algorithm, Computer Science, 2008 Vol 135, NO. 7.

Google Scholar