Research on Intelligent Information Search Based on Web

Article Preview

Abstract:

This paper analyzes the deficiencies of the existing Web information extraction methods and reasons, and put forward the page text information extraction method based on multi-feature fusion. Compared with previous methods with a small selection of features, the method in this paper determine the choice of a variety of information via text features, better able to adapt to a variety of styles page. By comparing the experiment, this method has higher accuracy to meet practical application needs in the Web Content Extraction.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

434-437

Citation:

Online since:

July 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Zheng Changsong, Fu Yan, she Li. Automatic extraction method of Web information based on template. Application Research of computers, 2009 (2): 570-572.

Google Scholar

[2] Ren Zhong, Xue Yongsheng, Web information extraction of tree structure, computer science2009, Vol. 25, No. 3.

Google Scholar

[3] Wang J, LOCHOVSKY F H. Data-rich section extraction from HTML pages. Proc of the 3rd International Conference on Web Information Systems Engineering. Washington DC: IEEE Computer Society, 2002, 2313-2322.

DOI: 10.1109/wise.2002.1181667

Google Scholar

[4] Liu Hui, Chen Jingyu, Xu Xuezhou. Web information extraction based on template flow configuration. Computer Engineering,. 2008 (20): 55-57.

Google Scholar

[5] Yi gaofeng, Tang Yong, Tao Wei, Wu Guibin, Huang Fan, Wang Peng. Automatic Web information extraction based on XML. computer science, 2008(03): 87-90.

Google Scholar