Paper Title:
Web Data Extraction Research Based on Wrapper and XPath Technology
  Abstract

For satisfy people’s various need, some websites consist of pages that are dynamically generated using a common template populated with data from www, such as product description pages on e-commerce sites. In this paper, it merges wrapper technology with XPath to form a dependable, robust process for web data extraction. Through validating such a method in some experiments; we get results that it has high efficiency in extracting list page.

  Info
Periodical
Advanced Materials Research (Volumes 271-273)
Edited by
Junqiao Xiong
Pages
706-712
DOI
10.4028/www.scientific.net/AMR.271-273.706
Citation
H. Liu, Y. X. Ma, "Web Data Extraction Research Based on Wrapper and XPath Technology", Advanced Materials Research, Vols. 271-273, pp. 706-712, 2011
Online since
July 2011
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Jing Ping Liao, Chun Jie Wang
Abstract:According to the structure and environment of the spacecrafts, its necessary to have mechanical experiments on them. Besides, analysis models...
269
Authors: Cui Fang Zheng, Long Jiang, Li Qing Jiang, Zhi Jie Wu
Chapter 5: Information Processing and Computational Science
Abstract:Data mining techniques give us a feasible method to deal with great amount of data, which is generated during the software developing. Many...
738
Authors: Ling Li Zhao, Shuai Liu, Jun Sheng Li
Chapter 2: Computer Science and Computational Science, Information Processing
Abstract:The paper is focused on these thematic data index construction, and puts forward a kind of category data indexes for rapid queries for...
596
Authors: Shu Peng Wang, Yi Fei Xu, Su Xia Ma, Lin Hai Qi
Chapter 20: Computer Applications in Industry and Engineering
Abstract:With the putting forward and continuous expansion of the IEC 61970, the data interaction among heterogeneous system becomes possible, and...
3040
Authors: Jia Jia Miao, Guo You Chen, Kai Du, Xue Lin Fang
Chapter 13: Internet and E-Commerce Technology
Abstract:The increasing number of applications for large data, such as Web search engines, we need to have high availability 7*24 tracking, storage...
2792