In the previous pipeline, raw batch and CDC data was processed into clean Silver lakehouse tables.
But Silver tables are still closer to the source system. For analytics and reporting, data usually needs to be shaped into more business-friendly tables that teams can easily use.
In this pipeline, you will build the Gold Core layer.
You will take trusted Silver healthcare data and create Gold tables such as dimensions, facts, audit logs, and data quality metrics. These Gold tables become the curated layer that downstream analytics and reporting pipelines can use.
After completing this pipeline, you will be able to: