Research and Realization of a Search Engine System for Professional Field

Article Preview

Abstract:

In the light of the deficiency of general search engine technology in professional retrieval,This paper researched and designed a search engine system for professional field (SESPF for short).This system automatically crawls web pages by the spider program.It introduced professional dictionary and filtered the webpages information according to certain rules.At the same time,the system improved the PageRank algorithm and Lucene webpage ranking algorithm.The experimental results show that this system has a higher precision in professional field retrieval compared with the general search engine.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 850-851)

Pages:

745-750

Citation:

Online since:

December 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Zhao Ke, Lu Peng, Li Yongqiang: Design and Implementation of Search Engine Based on Lucene. Computer Engineering, 2011, 37(16): 39-41(In Chinese).

Google Scholar

[2] Information on http: /lucene. apache. org.

Google Scholar

[3] Information on http: /nutch. apache. org.

Google Scholar

[4] Liu Qinchuang: The Design and Realization of the Key of Technology Specialized Search Engine for Finance and Economics Profession. Journal of Hanshan Normal University, 2008, 29(3): 22-25(In Chinese).

Google Scholar

[5] Wang Shuo, You Feng, Shan Lan, Zhao Hengyong: Research of Chinese word segmentation system applies in professional search engine. Computer Engineering and Applications, 2008, 44(19): 142-145(In Chinese).

Google Scholar

[6] Liu Weidong, Lu Ling: Research and Application of PageRank Algorithm Combines with VSM Technique. Computer and Modernization, 2011, 191(7): 96-98(In Chinese).

Google Scholar

[7] Zhang Xian, Zhou Ya: Improvement of an Algorithm for Ranking Pages Based on Lucene. Computer Systems and Applications, 2009, 18(2): 155-158(In Chinese).

Google Scholar

[8] Gu Wenli, Chen Wei, Chen Jiao, Lu Xiaoye: Improved PageRank Algorithm. Computer Systems and Applications, 2012, 21(2): 214-217(In Chinese).

Google Scholar

[9] Wang Dong, Lei Jingsheng: An Improved Ranking Algorithm Based on PageRank. Microelectronics and Computer, 2009, 26(4): 210-213(In Chinese).

Google Scholar

[10] Liang Zhengyou, Pan Tao: Parallel Realization of PageRank Algorithm on Nutch. Computer Engineering and Design, 2010, 31(20): 4354-4356(In Chinese).

Google Scholar