Dynamic Non-Cooperative Structured Deep Web Selection

Article Preview

Abstract:

Most of structured deep web data sources are non-cooperation, therefore establish an accurate data source content summary by sampling is the core technology of data source selection. The content of deep web data source is updated from time to time, however existing efficient methods of non-cooperation structured data source selection does not consider summary update problem. Unrenewed summary of data source can not accurately characterize the content of the data source that produce a greater impact on data source selection. Base on this, we propose a dynamic data source selection method for non-cooperative structured deep web by combining subject headings sampling technology and subject headings extension technology. The experiment results show that our dynamic structured data source selection method has good recall ratio and precision besides being efficient.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2911-2914

Citation:

Online since:

September 2014

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] S. Milad. Central-rank-based collection selection in uncooperative distributed information retrieval [C] /Proc of the 29th European Conference on IR Research. Berlin: Springer, 2007: 160-172.

DOI: 10.1007/978-3-540-71496-5_17

Google Scholar

[2] P. Thomas, M. Shokouhi. SUSHI: scoring scaled samples for server selection [C] /Proc of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '09). New York: ACM, 2009: 419-426.

DOI: 10.1145/1571941.1572014

Google Scholar

[3] P.G. Ipeirotis, L. Gravano, M. Sahami. Probe, count and classify: categorizing hidden Web databases [C] /Proc of the 2001 ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD'01). New York: ACM, 2001: 21-24.

DOI: 10.1145/375663.375671

Google Scholar

[4] D. Hong, L. Si, P. Bracke, et al. A joint probabilistic classification model for resource selection [C] /Proc of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR'10). New York: ACM, 2010: 98-105.

DOI: 10.1145/1835449.1835468

Google Scholar

[5] V.Z. Liu, R.C. Luo, W. W Chu. Dpro: A probabilistic approach for hidden Web database selection using dynamic probing [C] /Proc of the 20th Int'l Conf. on Data Engineering (ICDE'04). Los Alamitos, CA: IEEE Computer Society, 2004: 1-12.

Google Scholar

[6] C. Yu, G. Philip, W.Y. Meng. Distributed top-N query processing with possibly uncooperative local systems [C] /Proc of the 29th Int'l Conf on Very Large Data Bases. Trondheim, Norway: VLDB Endowment, 2003: 117-128.

DOI: 10.1016/b978-012722442-8/50019-7

Google Scholar

[7] C.X. Wan, S. Deng, D.X. Liu, et al. Non-Cooperative Structured Deep Web Selection Based on Hybrid Type Keyword Retrieval[J]. Journal of Computer Research and Development, 2014, 51(4): 1905-1917(in Chinese).

Google Scholar