Raw data is useful, but it is rarely ready for analytics.
Customer details may need standardization, product data may need category and brand enrichment, orders may need the latest status, and order items may need to be connected back to products and orders.
In this pipeline, you will build the refined layer on GCP.
You will take raw retail tables from BigQuery, clean and enrich the data using BigQuery stored procedures, create refined business tables, and track watermarks so the pipeline can process only new or changed records.
After completing this pipeline, you will be able to: