Tuesday, September 6, 2011

ETL Basic Concepts

ETL stands for Extract Transform & Load; this is the foundation of Data Integration Systems. The concept which we are talking here is very simple and self explanatory. Here you go…

E – Extract:

Extracting data from different sources, where each source contains different types of data and in different formats.

T – Transform:

Transform the data into a required unified format.

L – Load:

Load the data into the required system. Normally it will be a Data Warehouse or any other applications.

What is done during processing the data in the ETL?

Below is the processing we normally do in ETL.

1. Aggregating Data

2. Cleansing Data

3. Deriving Data

4. Filtering Data

5. Integrating Data

6. Validating Data

1 comment: