Analysis of Web Pages Based the Changed Information and its’ Application in the Search Engine for one Web Site

Article Preview

Abstract:

The structures and contents of researching search engines are presented and the core technology is the analysis technology of web pages. The characteristic of analyzing web pages in one website is studied, relations between the web pages web crawler gained at two times are able to be obtained and the changed information among them are found easily. A new method of analyzing web pages in one website is introduced and the method analyzes web pages with the changed information of web pages. The result of applying the method shows that the new method is effective in the analysis of web pages.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2311-2316

Citation:

Online since:

February 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] WANG Ji Cheng; XIAO Rong; SUN Zheng Xing; and ZHANG Fu Yan. STATE OF THE ART OF INFORMATION RETRIEVAL ON THE WEB[J]. Journal of Computer Research and Development, 2001, 38(2):187-193.

Google Scholar

[2] OUYANG Liubo1; LI Xueyong2; LI Guohui3; WANG Xin2. A Survey of Web Spiders Searching Strategies of Topic-specific Search Engine[J]. Computer Engineering, ,2004,30(13):32-33.

Google Scholar

[3] YE Yun-ming; YU Shui; MA Fan-yuan; SONG Hui; ZHANG Ling. On Distributed Web Crawler: Architecture, Algorithms and Strategy[J]. Acta Electronica Sinica, 2002, 30(12A): 2008-(2011).

Google Scholar

[4] XU Xiao+; ZHANG Wei-Zhe; ZHANG Hong-Li; FANG Bin-Xing. WAN-Based Distributed Web Crawling[J]. Journal of Software, 2010, 21(5): 1065-1082.

DOI: 10.3724/sp.j.1001.2010.03725

Google Scholar

[5] XU Zhao-cai; CHENG Xian-yi. Focused Crawling Algorithm Based on Multi-agent System[J]. Computer Engineering, 2008 , 34(16): 204-206.

Google Scholar

[6] QI Xin. Design on Focused Web Crawler Based on Ontology[J]. Journal of Wuhan University of Technology, 2009 , 31( 2): 138-141.

Google Scholar

[7] WANG Tao~(1; 2); FAN Xiao-zhong~1; GU Yi-jun~1; LIU Lin~1. Design of Theme Crawler Based on Concept Analysis[J]. Journal of Beijing Institute of Technology, 2004, 24(10):890-893.

Google Scholar

[8] JIANG Zong-li; TIAN Xiao-yan; ZHAO Xu. A Topic Crawler Algorithm Based on Semantic Analysis[J]. Computer Engineering & Science, 2010,32(9):145-151.

Google Scholar

[9] WANG Da-Ling+; YU Ge; BAO Yu-Bin; ZHANG Mo; SHEN Zhou. Dynamically Generalizing Web Pages Based on Users' Search Intentions[J]. Journal of Software, 2010,21(5):1083-1097.

DOI: 10.3724/sp.j.1001.2010.03477

Google Scholar

[10] YUAN Yu-yu1; LUO Xue-chao2. A Measurement Method of Search Engine Retrieve Performance Based User Path Model[J]. Acta Electronica Sinica, 2008, 36(5): 969-973.

Google Scholar

[11] XUE Yewei; SHEN Junyi; ZHANG Yun. Modified Edit Distance Algorithm and Its Application in Web Search [J]. Journal of Xi an Jiaotong University, 2008, 42(12): 1450-1454.

Google Scholar

[12] XUE Yewei; SHEN Junyi; ZHANG Yun. Method of acquiring web features and its application in web search[J]. Journal of Southeast University(English Edition), 2008, 24(3): 330-334.

Google Scholar

[13] LIU Wei1) MENG Xiao-Feng1) MENG Wei-Yi2). A Survey of Deep Web Data Integration[J]. Chinese Journal of Computers, 2007, 30(9): 1475-1489.

Google Scholar

[14] LIN Chao; ZHAO Peng-peng; CUI Zhi-ming. Deep Web Sources Focused Crawler[J]. Computer Engineering, 2008, 34(7): 56-58.

Google Scholar

[15] TIAN Ye; DING Yue-wei. Crawlers Crawling Strategy of Deep Web Based on Keywords Relevant Weight[J]. Computer Engineering, 2008 , 34(15): 220-222.

Google Scholar

[16] LIU Fan-ping1; GAO Yan-hua1; YU Jiong1; ZHANG Wei2. Research and Realization of Searching System on Website Base on Search Key Decision[J]. Microelectronics & Computer, 2010 , 27 ( 8): 214-217.

Google Scholar

[17] SONG Rui-hua 1; 2; MA Shao-ping 1; CHEN Gang 2; LI Jing-yang. A HTML Parser to Improve Chinese Search Engines[J]. Journal of Chinese Information Processing, 2003,17(4):19-26.

Google Scholar

[18] ZHONG Chu Ling; ZHU Dan; CAO Er Tang. A Web page parser to improve search engine retrieval quality[J].

Google Scholar

[19] Chen Zhiping The Theory and Application Research on Intelligent Search Engine[D]. changsha: doctoral dissertation of Hunan University, (2003).

Google Scholar