AWS Cloud Track

This track covers the core data pipeline flows that are part of a typical AWS Data Engineer role.


Pipelines in this track

  • External RDBMS → Glue → S3 → Redshift (Batch Ingestion + ELT)
    External RDBMS → Glue → S3 → Redshift (Batch Ingestion + ELT)
  • External SFTP → Lambda → S3 → Redshift COPY
    External SFTP → Lambda → S3 → Redshift COPY
  • S3 Landing → Lambda → Athena (Serverless Analytics)
    S3 Landing → Lambda → Athena (Serverless Analytics)
  • Kinesis → Glue Streaming → Redshift
    Kinesis → Glue Streaming → Redshift
  • Glue → Data Lakehouse on S3 (Parquet)
    Glue → Data Lakehouse on S3 (Parquet)
  • Redshift → UNLOAD → S3 → Partner SFTP (Reverse ETL)
    Redshift → UNLOAD → S3 → Partner SFTP (Reverse ETL)