➔ Hands-On: Building Batch and Streaming Data Pipelines on AWS
Book: AWS Certified Data Engineer Associate Study Guide Authors: Sakti Mishra, Dylan Qu, Anusha Challa Publisher: O’Reilly Media ISBN: 978-1-098-17007-3
This is the chapter I was waiting for. After seven chapters of theory, services, security, and governance, Chapter 8 finally says: “OK, build something.” Two complete pipelines, end to end, with real code and real AWS services.
If you learn by doing, this chapter alone is worth the price of the book.
➔ Data Ingestion Patterns: Streaming, Zero-ETL, CDC, and Best Practices on AWS
Previous: AWS Auxiliary Services
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 4 covers data ingestion and transformation. This is Part 1, focused on ingestion. Getting data into AWS is the first step of any analytics pipeline. Sounds simple, but the number of services and patterns you need to know is big.
Data Ingestion Overview
Data ingestion is the process of importing data from various sources into AWS storage and processing systems. The book breaks it into three patterns:
➔ AWS Analytics Services: Kinesis, Glue, Athena, Redshift, and More
Previous: Prerequisite Knowledge
Book: AWS Certified Data Engineer Associate Study Guide Authors: Sakti Mishra, Dylan Qu, Anusha Challa Publisher: O’Reilly Media ISBN: 978-1-098-17007-3
Chapter 3 is where the real AWS content starts. This is the overview of analytics services you need to know for the DEA-C01 exam. Even if you’re not taking the exam, it’s a solid map of what AWS offers for data work.
There are a lot of services here. Some overlap. Some feel redundant. That’s just how AWS works.