Full-Text Search Engine Technology Research Based on Lucene

Article Preview

Abstract:

As an important branch of modern information retrieval technology, full-text search is not only an important tool for dealing with unstructured data, but also one of the mainstream technology of search engines .This paper starts from studying the working principles and process of search engine model in deep and discuss Lucene's architecture with previously knowledge. The main emphasis is placed on the problem of some basic algorithms of Chinese word segmentation and Relevance Ranking. Finally, we set up a system based on Lucene of full-text retrieval by applying these technologies.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

355-358

Citation:

Online since:

May 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Michael McCandless, Erik Hatcher, Otis Gospodnetic. Lucene in Action [M]. Post&Telecoms Press (2011).

Google Scholar

[2] Michael McCandless, Erik Hatcher, Otis Gospodnetic. Lucene in Action 2nd Edition[M](2010).

Google Scholar

[3] W. Bruce Croft Donald Metzler Trevor Strohman.Search Engine Information retrieval in practice.

Google Scholar

[4] Beijing:China Machine Press (2010).

Google Scholar

[5] Wu zhong-xin, Shen jia-li. Lucene Analysis and Application [M]. Beijing: China Machine Press(2008).

Google Scholar

[6] Wu dai-wen, Guo jun-jun; Research and Design of Full-text Retrieval Web System Based on Lucene[J] . Modern electronic technology(2011).

Google Scholar

[7] Wang li-yun, Wang hua, Chen gang, Yao nai-ming; Research and Design of Full-text Retrieval System Based on Lucene [J]. Computer Engineering and Design(2007).

Google Scholar

[8] http : /jakarta. apache. org/lucene.

Google Scholar

[9] http: /blog. csdn. net/kangsheng/archive/2005/03 /19/323627. aspx.

Google Scholar

[10] Search Ranking Factors. http: /www. seomoz. org/article/search-ranking-factors.

Google Scholar