A Novel Improved Web Structure Mining Algorithm

Article Preview

Abstract:

In this paper, a compact and effective web structure mining algorithm is developed according to the hypertext induced topic search (HITS). Firstly, the environments of traditional web structure mining are explored. Secondly, by analyzing the existing techniques, the existing problem and the lack of previous methods are explored. Finally, amid to overcome the deficiency of the traditional method, the paper introduced the time parameter and proposed a novel web structure mining algorithm. It turns out that the proposed algorithm has better effective and accuracy compared to the traditional methods.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

3741-3744

Citation:

Online since:

October 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Matthew Bennett, Julie Stone, Chaoyan Zhang: a scalab le parallel HITS algorithm for page ranking, 2006 First International Multi-Symposiums on Computer and Computational Sciences.

DOI: 10.1109/imsccs.2006.22

Google Scholar

[2] Bui Quang Hung, Masanori Otsubo, Yoshinori Hijikata, Shogo Nishida: HITS algorithm improvement using semantic text portionVol. 8, no. 2, pp.149-164(2010).

DOI: 10.3233/wia-2010-0184

Google Scholar

[3] Kleinberg, J: authoritative sources in a hyperlinked environment, journal of the ACM, vol. 46, no. 5, pp.604-632(1999).

DOI: 10.1145/324133.324140

Google Scholar

[4] B.Q. Hung. M. Qtsubo, Y. Hijikata and S. Nishida: extraction of semantic text portion related to anchor link, IEICE Trans. informaion and system, vol. E89, no. 6 (2006).

DOI: 10.1093/ietisy/e89-d.6.1834

Google Scholar

[5] E.J. Glover, K. Tsioutsiouliklis, S. Lawrence, D.M. Pennock, and G.W. Flake: Using web structure for classifying and describing web pages, Proc. 11th International World Wide Web Conference, pp.562-569, (2002).

DOI: 10.1145/511446.511520

Google Scholar

[6] L. Li. Y. Shang and W. Zhang: improvement of HITS-based algorithm on web documents, proc. WWW 2002, PP. 527-535(2002).

Google Scholar