WDB's Query Interface Extraction Method Based on Watir & Ruglar Expression

Article Preview

Abstract:

With the wide application of the Web databases (WDB), it has become a hot topic of the current research to make full use of data. WDB query interface is an important way to get the WDB data, it is a significant prerequisite to obtain the data efficiently that we can realize the full representation and extraction for WDB query interface. This paper presents a new representation based on owl for WDB query interface; at the same time this paper gives the extraction methods based on regular expression and watir for the context of each query interface, form information and the relationship information between the form fields. This work provides an important foundation for the further classification and integration of query interface.

You might also be interested in these eBooks

Info:

Periodical:

Key Engineering Materials (Volumes 467-469)

Pages:

1764-1769

Citation:

Online since:

February 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Ghanem T M, Aref W G. Databases Deepen the Web [ J ] . IEEE Computer, 2004, 73 (1): 116.

Google Scholar

[2] Fetterly D, Manasse M., Najork M., Wiener J. L. A large-scale study of the evolution of web pages. In: Proceedings of the 12th International World Wide Web Conference, Budapest, 2003, 669-678.

DOI: 10.1145/775152.775246

Google Scholar

[3] Chang K Chen-Chuan, He Bin, Li Chengkai, et al. Structured database on the web: Observations and Implications[J]. SIGMOD Record, 2004, 33(3): 61-70.

Google Scholar

[4] Z. Zhang, B. He,K.C. -C. Chang. Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax. In Proceedings of the 2004 ACM SIGMOD Conference (SIGMOD 2004), Paris, France, June (2004).

DOI: 10.1145/1007568.1007583

Google Scholar

[5] B. Chidlovskii, A. Bergholz. Crawling for domain-specific hidden web resources. In Proceedings of 4thInternational Conference on Web Information Systems Engineering, (2003).

Google Scholar

[6] He H., Meng W., Yu C. T., Wu Z.: WISE-Integrator: an automatic integrator of Web search interfaces for e-commerce. In: Proceedings of the 29th International Conference on Very Large Data Bases, Berlin, 2003, 357-368.

DOI: 10.1016/b978-012722442-8/50039-2

Google Scholar

[7] SUN chong. Autofill the Entry Form of Deep Web [D]. JiLin University, Master's Thesis, (2007).

Google Scholar

[8] Yuan L, Li ZH, Chen SL. Ontology-Based annotation for deep Web data. Journal of Software, 2008, 19(2): 237−245.

DOI: 10.3724/sp.j.1001.2008.00237

Google Scholar

[9] Z. Zhang, B. He,K. Chang. Understanding Web Query interfaces: Best-effort parsing with Hidden Syntax[C]. In: Proceedings of the 23rd ACM SIGMOD International Conference on Management of Data. Paris, 2004: 107-118.

DOI: 10.1145/1007568.1007583

Google Scholar

[10] H. He, W. Meng, C.T. Yu, et al. Constructing Interface Schemas for Search Interfaces of Web Databases[C]. In: Proceedings of the 6th International Conference on Web Information Systems Engineering. New York, 2005: 29-42.

DOI: 10.1007/11581062_3

Google Scholar

[11] H. He, W. Meng, C.T. Yu, et al. Automatic extraction of web search interfaces for interface schema integration[C]. In: Proceedings of the 13th international World Wide Web conference on Alternate track papers&posters, New York, NY, USA, 2004: 414-415.

DOI: 10.1145/1013367.1013502

Google Scholar