Study on Focused Crawler's Application in Searching Petroleum News

Article Preview

Abstract:

The paper aims for the application of focused crawler in the petroleum news topic crawling, studies the related technologies of the focused crawler, and put forward a crawling engine strategy and review strategy on the petroleum news topics, adopt different extracting methods for different types of pages through web page classification, and design a corresponding link topic correlation calculating method for the crawling engine strategy; test and verify the above-mentioned crawling engine strategies through experiments, and the experimental results show that the strategy can greatly balance its accuracy and width for focused crawler.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 986-987)

Pages:

2131-2134

Citation:

Online since:

July 2014

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Y.Q. Liu, S.P. Ma and T. Hong, in: The basic technology of search engine, edtied by Tsinghua University Press, Beijing, (2010. 7).

Google Scholar

[2] S. Chakrabarti, M. Berg and B. Dom, in: Focused crawling: a new approach to topic-specific web resource discovery, edtied by Computer Networks, 31(11): 1623-1640. (1999).

DOI: 10.1016/s1389-1286(99)00052-3

Google Scholar

[3] L.B. Ouyang, X.Y. Li and G.H. Li, in: Survey of searching strategies of web spiders, edtied by Mini-Micro Computer Systems, 26(4): 703-706. (2005).

Google Scholar

[4] D.S. Li, in: Research of topic search engine: (a master's degree thesis), edtied by Harbin Engineering University, Harbin , (2005).

Google Scholar

[5] L.Z. Zhou and L. Lin, in: Survey on the research of focused crawling technique, 25(9): 1965-1969. (2005).

Google Scholar

[6] J.H. Liu and Y.L. Lu, in: Survey on topic - focused web crawler, edtied by Application Research of Computers , 24(10): 26-29. ( 2007).

Google Scholar

[7] P. Debra and R. Post, in: Information retrieval in the World-Wide Web: making client-based searching feasible, edtied by Computer Networks and ISDN Systems, 27(2): 183-192. (1994).

DOI: 10.1016/0169-7552(94)90132-5

Google Scholar