Research of Distributed Search Engine Based on Hadoop

Article Preview

Abstract:

Combined with the Map/Reduce programming model, the Hadoop distributed file system, Lucene inverted file indexing technology and ICTCLAS Chinese word segmentation technology, we designed and implemented a distributed search engine system based on Hadoop. By testing of the system in the four-node Hadoop cluster environment, experimental results show that Hadoop platform can be used in search engines to improve system performance, reliability and scalability.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

171-174

Citation:

Online since:

September 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Chenxi Fan: The Research and Application of Search Engine based on Hadoop (In Chinese) [D]. Electronics and communication engineering field, (2013).

Google Scholar

[2] Junsheng Wang, Yunmei Shi, Yangsen Zhang: Journal of Beijing Information Science and Technology University, Vol. 26 No. 4 (2011), pp.53-56.

Google Scholar

[3] Jeff Dean. Experiences with MapReduce, an Abstraction for Large-Scale Computation [R]. Proc. 15th. International Conference on Parallel Architectures and Compilation Techniques, (2006).

DOI: 10.1145/1152154.1152155

Google Scholar

[4] Owen O'Malley. Programming with Hadoop's Map/Reduce [R]. ApacheCon EU, (2008).

Google Scholar

[5] Yuzhong Cao. Distributed parallel programming with Hadoop(In Chinese), Part 1 [EB / OL]. http: /www. ibm. com/developerworks/cn/opensource/os-cn-hadoop1/, 2014-05-07.

Google Scholar

[6] ICTCLAS Chinese word segmentation system on http: /ictclas. org/, 2014-05-06.

Google Scholar