p.2388
p.2393
p.2398
p.2403
p.2407
p.2413
p.2419
p.2423
p.2428
Topic Crawling Strategy Based on Wikipedia and Analysis of Pages' Similarity
Abstract:
Considering the weaknesses existing in the present topic crawling strategies, this paper puts forward a new method which is based on Wikipedia and the analysis of page similarity. Firstly, the topic is described via Wikipedia. Then, handle the downloaded web. Finally, calculate the priorities of the links through text relativity and analysis of the web links. The result indicates that this new method is better than the traditional in terms of searching results and topic relativity and is worth popularizing.
Info:
Periodical:
Pages:
2407-2412
Citation:
Online since:
November 2012
Authors:
Price:
Сopyright:
© 2012 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: