Paper Title:
A Improved Topics Search Algorithm Based on PSO Strategy for Web Mining
  Abstract

HITS algorithm assigns same weight to links between Web pages,which results in topic drift. In this paper,a new focused crawling approach based on PSO Algorithm is proposed(PSOHITS). The method electively seeks out pages that are relevant to a pre-defined set of topics using PSO Algorithm,increases the crawling chance of the web page following the web page with the low content-relevance,and broadens the relevant-searching scope of crawlers.Meanwhile,the hyperlink metadata is used to predict the topic-relevance of the web page pointed and quickens the information crawling. Experiments show that the proposed algorithm can improve relevance ratio by 15%~36%.Furthermore,it can well avoid topic drift and improve the accuracy of information collection. It has important theoretical and practical values for search engines research.

  Info
Periodical
Key Engineering Materials (Volumes 439-440)
Edited by
Yanwen Wu
Pages
1481-1486
DOI
10.4028/www.scientific.net/KEM.439-440.1481
Citation
H. Q. Zhan, "A Improved Topics Search Algorithm Based on PSO Strategy for Web Mining", Key Engineering Materials, Vols. 439-440, pp. 1481-1486, 2010
Online since
June 2010
Authors
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Jun Zhang, Kan Yu Zhang
Chapter 19: Modeling, Analysis, and Simulation of Manufacturing Processes II
Abstract:Good dynamic performance of a system have great significance in the traditional sense, furthermore,it is more important at the point of...
4768
Authors: Da Wang, Hong Yu Bian
Chapter 1: Mechatronics
Abstract:In order to further improve the accuracy of the sonar image registration, a novel hybrid algorithm was proposed. It proposed the normalized...
1811
Authors: Wei Hua Fang
Chapter 6: Applied Mechanics
Abstract:In order to obtain geotechnical engineering material mechanical parameters correctly by using back analysis and overcome shortcoming of...
1647
Authors: Bei Zhan Wang, Xiang Deng, Wei Chuan Ye, Hai Fang Wei
Chapter 13: Mechanical Control and Information Processing Technology
Abstract:The particle swarm optimization (PSO) algorithm is a new type global searching method, which mostly focus on the continuous variables and...
1787
Authors: Sun Xin Wang, Yan Li, Yan Rong Zhang
Chapter 15: Economics, Marketing and Engineering Management
Abstract:In this paper a hybrid algorithm named IPSO-VND is proposed and applied to solving the vehicle routing problem with simultaneous pickup and...
2326