Research and Design of Data Processing Based on ETL Framework

Article Preview

Abstract:

ETL is a key link in the construction of data warehouse. On the base of analyzing the mainstream ETL tool Datastage, the data extraction, transformation and loading, proposes a ETL framework based on data processing, and the realization method and steps are discussed in detail. The framework uses HIVE as a data processing station, improve the operating efficiency of the file; data task according to the E, T and L three parts and hierarchical partitioning, conversion of data users to better grasp the process; development data using the configuration file of the task, the development personnel free out from the heavy code, will to shift the focus of the work to the data logical task, which has greatly improved the efficiency of development personnel data processing.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 1049-1050)

Pages:

1966-1971

Citation:

Online since:

October 2014

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] PVassiliadis, ASimitsis, PGeorgantas, MTerrovitis, SSkiadopoulos. Ageneric and customizable framework for the design of ETL scenarios[J]. Information Systems Journa, l2005, 30(7): 492-525.

Google Scholar

[2] PVassiliadis, eta. l ARKTOS: to wards the modeling, design, control and execution of ETL processes[J]. Information Systems, 2001, 26: 537-561.

DOI: 10.1016/s0306-4379(01)00039-4

Google Scholar

[3] Chen Xian, Chen Song Qiao. According to the design and realization of the in general use ETL tool of data warehouse[J]. Calculator application study, 2004, (8) : 214-216.

Google Scholar

[4] You Yu Lin, Zhang Xian Min. The ETL strategy and the structure design in a kind of dependable data warehouse[J]. Calculator engineering and application, 2006, 3: 172-175.

Google Scholar

[5] week luxuriant Wei, Deng Su, Huang Hong Bin. According to the ETL tool design and realization of dollar data[J]. Science technique and engineering, 2006, 6(21): 3503-3507.

Google Scholar

[6] Hong source, the week is good. According to the design and realization of CWM standard ETL[J]. University college journal(information science version) in Jilin, 2006, 1(24): 50-55.

Google Scholar

[7] JoseZubcof, fJuanTrujillo. AUML2. 0 profile to design Association Rulemining models in the multidimensional conceptual modeling of data warehouses[J]. Data&Knowledge Engineering? October2007, 63(1): 44-62.

DOI: 10.1016/j.datak.2006.10.007

Google Scholar