The very first question which arises in people’s mind is what is it? Here is the definition by wikipedia.com, “IBM InfoSphere DataStage is an ETL tool and part of the IBM Information Platforms Solutions suite and IBM InfoSphere. It uses a graphical notation to construct data integration solutions and is available in various versions such as the Server Edition and the Enterprise Edition.”
IBM is the company which has developed this product; this product comes under a product line called InfoSphere. InfoSphere contains any many products (you can check out the products using the link http://www-01.ibm.com/software/data/infosphere/ )
ETL stands for Extract, Transform & Load, if you don’t know ETL concept I feel its better you get some brief knowledge about it and then continue reading.
ETL concept is used with the Data Warehouse, because we are going to handle billions or trillions or even more number of records in Data Warehouse and it will be dealt differently not like the way we deal with few thousands of records.
I suggest you to read about Data Warehouse, OLAP, OLTP and difference between Data Warehouse and Data Base before continuing reading on DataStage.
No comments:
Post a Comment