A Design of a Sci-Tech Information Retrieval Platform Based on Apache Solr and Web Mining

Article Preview

Abstract:

In order to service the need of high-tech companies, allow companies get the sci-tech information more quickly and efficiently. The sci-tech information retrieval platform is proposed. The platform has four parts; the web spider, the Solr engine, the SQL Server 2008 database and the client. Each part deals a core issue, the mode make whole system more flexible, scalable and fault tolerant. The web spider collect sci-tech information from the Internet, the Solr engine takes charge of indexing documents gained by the web spider, the SQL Server database store all the users information and the configuration of the whole system, the client provides several REST-like APIs to modify the configurations and get the latest information in the platform.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

883-886

Citation:

Online since:

February 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Eric Slivka. Apple Becomes Most Valuable Publicly-Traded Stock Ever. 2012. 8. 20. http: /www. macrumors. com/2012/08/20/apple-becomes-most-valuable-publicly-traded-stock-ever.

Google Scholar

[2] The Wall Street Journal. Microsoft in $7 Billion Deal for Nokia Cellphone Business. 2013. 9. 3 http: /online. wsj. com/news/articles/SB10001424127887324432404579051931273019224.

Google Scholar

[3] Apache Lucene - Apache Solr. 2013. http: /lucene. apache. org/solr.

DOI: 10.1007/978-1-4842-1070-3_1

Google Scholar

[4] Introducing JSON. 2013. http: /www. json. org.

Google Scholar

[5] Representational state transfer. 2013. http: /en. wikipedia. org/wiki/Representational_state_ transfer.

Google Scholar

[6] Guanlin Chen, Mingming Chen. Design and Implementation of FTP Search Engine Based on Lucene. Proceedings of International Conference on Internet Technology and Applications. (2010), pp.1-4.

DOI: 10.1109/itapp.2010.5566474

Google Scholar

[7] Wencheng Cui, Mengjia Xu, Huayu Sun, Hong Shao, in: Research on application of Lucene in medical image retrieval system, Proceedings of International Conference on Computer Science and Network Technology, (2011), pp.661-664.

DOI: 10.1109/iccsnt.2011.6182053

Google Scholar

[8] Shengdong Li, Xueqiang Lv, Feng Ling, Shuicai Shi, in: Study on Efficiency of Full-Text Retrieval Based on Lucene, Proceedings of International Conference of Information Engineering and Computer Science, (2009), pp.1-4.

DOI: 10.1109/iciecs.2009.5363389

Google Scholar

[9] Shuang Liu, in: Design and Implementation of Face Recognition System Based on Embedded, edited by Microelectronics & Computer, Vol. 29 (2012).

Google Scholar

[10] Solr Wiki. Information on http: /wiki. apache. org/solr.

Google Scholar