A Data Warehouse Cleansing Approach Based on Mathematical Association Rules

Article Preview

Abstract:

The alliance rules stated above based on the principle of data mining association rules provide a solution for detecting errors in the data sets. The errors are detected automatically. The manual intervention in the proposed algorithm is highly negligible resulting in high degree of automation and accuracy. The duplicity in the names field of the data warehouse has been remarkably cleansed and worked out. Domain independency has been achieved using the concept of integer domain which even adds on to the memory saving capability of the algorithm.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 490-495)

Pages:

1878-1882

Citation:

Online since:

March 2012

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] J. Karlsteen, Automation of metadata updates in a time critical environment, (2006).

Google Scholar

[2] P. Poonniah, Data Warehousing Fundamentals- A comprehensive guide for IT professionals, Ist ed., 81-265-0919-8, Glorious Printers: New Delhi , India, (2006).

Google Scholar

[3] B. Palace, Data mining, http: /www. anderson. ucla. edu. [Online], 1996, http: /www. anderson. ucla. edu/faculty/jason. frand/teacher/technologies/palace/datamining. htm.

Google Scholar

[4] C. Kelley, Best Uses of data warehoue, http: /www. itworld. com. [Online], 2003, http: /www. itworld. com/nl/db_mgr/, (2003).

Google Scholar

[5] T. Redman, The Impact of Poor Data Quality on the Typical Enterprise, Communications of the ACM, Vol. 41. 8. 02, (1998).

Google Scholar

[6] E. Rahm, H. H Do, Data Cleaning: Problems and Current Approaches, University of Leipzig, Germany.

Google Scholar

[7] A. Marcus J.I. Maletic, "Automated Identification of Errors in Data Sets, TR-CS-00-02, University of Memphis, (2002).

Google Scholar