Design of the Web Log Analysis System Based on Hadoop

Article Preview

Abstract:

As a result of rapid development, the Internet has become an indispensable tool in people's daily life, and web logs have been growing rapidly as well. How to deal with massive logs timely and extract information people need from the logs has become a problem; handling web log by single computer can not meet people's needs any more. Combining cloud computing and Hadoop technology, this paper established a new processing system for the collection of logs and remote parallelization analysis, which not only solved the issues of traditional systems that data handing and collection could not proceed simultaneously, but reduced the performance bottleneck of computing power and storage capacity, thereby saving much time and improving efficiency significantly.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 926-930)

Pages:

2474-2477

Citation:

Online since:

May 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] T.D. HU. G, ZHOU L, KE L. Research on Hadoop-based Net work Log Analysis System . Computer Knowledge and Technology, 2010, 6(22): 6163-6164.

Google Scholar

[2] T.D. Chao Bai. Implementing of Masssive Log Analysis System Based on Parallel Computing . Computer Technology and Development, Vol. 23 (2013) No. 7, p.80. (In Chinese).

Google Scholar

[3] Cloud Computing Security: making Virtual Machines Cloud-Ready [R]. www. cloudready-security. com, (2008).

Google Scholar

[4] Shvachko K, Kuang H, Radia S, et al. The Hadoop distributed file system[C]/Mass Storage Systems and Technologies (MSST), 2010 IEEE 26th Symposium on. IEEE, 2010: 1-10.

DOI: 10.1109/msst.2010.5496972

Google Scholar

[5] Kai Gao, A net work packet analysis and design of software development . Net land, 2013, (In Chinese).

Google Scholar

[6] T.D. Z. Y Zhang. Research of Grid-Based TCP Log Two-Step Clustering Algorithm . Journal of university of jinan, Vol. 25 (2011) No. 2, p.196. (In Chinese).

Google Scholar

[7] H. Y Zhang. The Cloud Computing Based on Hadoop Platform and Log Analysis, (Harbin University of Science and Technology China 2012) (In Chinese).

Google Scholar