The overall data acquisition process, called ETL (extraction, transformation and loading), is generally grouped into three main components:- Extraction: Involves obtaining the required data from the various sources.
- Transformation: Source data undergoes a number of operations that pre-pare it for import into the data warehouse (target database). To perform this task, integration and transformation programs are used which can reformat, recalculate, modify structure and data elements, and add lime elements. They can also perform calculations, summarization, de-normalization, etc.
- Loading: Involves physically placing extracted and transformed data in the target database. The initial loading involves a massive data import into the data warehouse. Subsequently, an extraction procedure periodically loads fresh data based on business rules and a pre determined frequency.