Web Data Extraction and Integration in Domain

Article Preview

Abstract:

The purpose of WEB data extraction and integration is to provide the domain oriented value-added services. Based on the requirements of domain, and the features of web pages data. this paper proposes a WEB data schema and a domain data model. It also puts forward the web table positioning and web table records extracting based on WEB data schema and an integration algorithm based on the main data model. The experiment results are given to show effectiveness of the proposed algorithm and model.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 756-759)

Pages:

1585-1589

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Cafarella, M.J., In Proceedings of VLDB-08. Auckland, New Zealand. (2008), pp.538-549.

Google Scholar

[2] Crestan, E. and Pantel, P. 2010. In Proceedings of WWW-2010. Raleigh, North Carolina, USA.

Google Scholar

[3] Liao Tao, Liu Zongtian and Sun Rong. Computer science, Vol. 36(9) (2009), p.137.

Google Scholar

[4] Liu Bing. Web data mining .Memphis: Henry Dream press, (2007): pp.291-295.

Google Scholar

[5] Chen, H., Tsai, S. and Tsai, J. 2000. Mining Tables from Large-Scale HTML Texts. In Proceedings of COLING-00. (Saarbrücken, Germany. 2000).

DOI: 10.3115/990820.990845

Google Scholar

[6] Robert G, Wilks Y. Journal of Documentation, Vol. 54(1) (1998), pp.70-105.

Google Scholar