|
Data staging is the process of transferring the data from the data sources (operational systems) into the target database of the data warehouse. The ETL process can be divided into the following stages:- Extraction of data from various data sources
- Cleansing of data which includes trying to ensure that important data is not missing, field names are changed appropriately for good readability, etc.
- Transformation of the source data in a form suitable for data warehouse, (see below for various transformations)
- Loading of data into the target database
- Quality Assurance of data which is done at every stage of ETL to ensure that the data that is going into the data warehouse is accurate, consistent and reliable. This is also known as data scrubbing.
|