Query Optimization Based on Data Provenance

Article Preview

Abstract:

Data Provenance is a key of evaluating authority and uncertainty in data query. Query process technology based on data provenance overcomes the shortcomings of traditional data integration on query quality and efficiency. This paper constructs a data model of heterogeneous data sources provenance, i.e. Semiring Provenance, based on tracing provenance of data origination and evolution. It’s proved to be effective in creating mapping between heterogeneous schemas and optimizing query quality and authority evaluation. Experiments using real data set show that our approach provides an effective and scalable solution for query optimization technology.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

586-590

Citation:

Online since:

January 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] O. Benjelloun, A. Das Sarma, A. Halevy, J. Widom. ULDBs: Databases with uncertainty and lineage. In VLDB 2006, Seoul, Korea, September, 953-964(2006).

DOI: 10.1007/s00778-007-0080-z

Google Scholar

[2] T. J. Green, G. Karvounarakis, Z. G. Ives, and V. Tannen. Update exchange with mappings and provenance. In VLDB 2007. Amended version available as Univ. of Pennsylvania report MS-CIS-07-26(2007).

Google Scholar

[3] Boris Glavic, Klaus Dittrich. Data provenance: A categorization of existing approaches. In Proceeding of the 6th MMC Workshop of BTW 2007, Aachen, Germany, March, 227-241(2007).

Google Scholar

[4] P. Buneman, S. Khanna, W. C. Tan. Why and where: a characterization of data provenance. In VLDB 2001, London, UK, April, 316-330(2001).

DOI: 10.1007/3-540-44503-x_20

Google Scholar

[5] B. Alexe, L. Chiticariu, W. C. Tan. SPIDER: a Schema mapPIng DebuggeR. In VLDB 2006, Seoul, Korea , September 12-15, 1179–1182(2006).

Google Scholar