Data Preparation and Pipeline Orchestration: Step Functions, Airflow, and Glue Workflows

   |   9 minute read

Previous: Data Transformation

Book: AWS Certified Data Engineer Associate Study Guide Authors: Sakti Mishra, Dylan Qu, Anusha Challa Publisher: O’Reilly Media ISBN: 978-1-098-17007-3

Final part of Chapter 4. We move from raw transformation into two topics: data preparation for people who don’t write code, and orchestrating the whole pipeline end to end. Both matter for the exam. Both matter in real life.

Data Preparation for Nontechnical Personas

AWS Glue DataBrew is a low-code, visual tool for data cleaning and preparation. It targets data analysts, data scientists, and business users who need to work with data but don’t want to write PySpark or SQL.

Read More >>
Page 1 of 1
denis256 at denis256.dev