The typical real-life ETL cycle consists of the following execution steps:
1. Cycle initiation
2.Build reference data
3.Extract (from sources)
4.Validate
5.Transform (clean, apply business rules, check for data integrity, create aggregates)
6.Stage (load into staging tables, if used)
7.Audit reports (for example, on compliance with business rules. Also, in case of failure, helps to diagnose/repair)
8.Publish (to target tables)
9.Archive
10.Clean up
No comments:
Post a Comment