Research on PageRank Algorithm to Index Pages

Article Preview

Abstract:

PageRank algorithm is a vital method to determine the importance of pages. Useful as it is, the algorithm has many disadvantages. Therefore, we arrive at the conclusion that it’s not rational to calculate the importance degree of pages simply by links between them. Considering the timeliness problem of PageRank algorithm, we provide the time penalty factor W(n) to weigh the effects of update time on page ranking. After adding the time penalty factor to the original PageRank algorithm, we come up with the refined PageRank algorithm. Our algorithm is superior compared with the original one and many other existing methods that weigh the effects of update time. We judge update time by the times a page is crawled by Web crawlers. Consequently, drawbacks of the methods that use the real time to measure update time can be overcome and the order of pages can meet users’ need better.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1469-1474

Citation:

Online since:

September 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Li Zhiying, Yang Wu, Xie Zhijun. Research on PageRank Algorithm. Chongqing University of Technology, 2011, 10.

Google Scholar

[2] Yang Fan, Wang Xiuwei, Bai Zhenxing. Optimization Technology for Website Based On Google. Air Force Engineering University, 2006, 05.

Google Scholar

[3] ST. ANNE MARY, Asion Journal of Computering Updates ang Trends. ST. ANNE MARY Education Society, (2011).

Google Scholar

[4] Bing Liu, Web data mining exploring hyperlinks, contents, and usage data. Berlin; Heidelberg; Springer, (2007).

Google Scholar

[5] L Page, S Brin, R Motwani. The PageRank Citation Ranking: Bringing Order to the Web. Stanford University, 1998, 09.

Google Scholar

[6] http: /en. wikipedia. org/wiki/PageRank#Algorithm.

Google Scholar