A comprehensive study guide for the AWS Certified Data Engineer Associate (DEA-C01) exam covering data ingestion, transformation, storage, security, and governance on AWS.
The AWS Certified Data Engineer Associate Study Guide by Sakti Mishra, Dylan Qu, and Anusha Challa is a structured preparation resource published by O’Reilly Media in 2025. Written by three AWS Data Analytics Specialist Architects, the book covers all four domains of the DEA-C01 certification: data ingestion and transformation, data store management, data operations and support, and data security and governance. It walks readers through foundational concepts like databases, Spark, Flink, data lakes, and cloud computing basics before going deep into AWS-specific services.
The book stands out for its practical approach. Beyond exam prep, it includes hands-on chapters for building both batch pipelines (S3, Glue, Redshift, QuickSight) and real-time streaming pipelines (Kinesis, EMR Serverless, Apache Iceberg). It covers over 40 AWS services in detail, with comparison tables for choosing between similar services, best practices for production workloads, and reference architectures for common data engineering patterns.
The guide also includes 40+ practice exam questions with detailed explanations and a forward-looking chapter on newer AWS services like SageMaker Lakehouse, S3 Tables, and GenAI-powered development tools. It serves both certification candidates and working data engineers who want a structured overview of the AWS data ecosystem.
I grabbed the AWS Certified Data Engineer Associate Study Guide and decided to go through it chapter by chapter. Not a quick skim-and-forget kind of thing. I actually wanted to write down what each chapter covers, what’s useful, and what you can probably skip.
This is post one. I’ll explain what the book is, who made it, and why you might want to follow along.
Book Details
Title
AWS Certified Data Engineer Associate Study Guide
Authors
Sakti Mishra, Dylan Qu, Anusha Challa
Publisher
O’Reilly Media
ISBN
978-1-098-17007-3
Edition
First Edition, September 2025
What Is This Book
It’s a study guide for the AWS Certified Data Engineer Associate exam (DEA-C01). Covers everything you need to pass the certification. It also goes beyond just exam prep though – the authors actually teach data engineering concepts on AWS from scratch.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 1 is the warm-up. No heavy AWS services yet. It sets the stage: what the exam looks like, what topics it covers, and how to approach the questions. If you already know the exam structure, you can skim this. There are a few useful bits here worth your time though.
Book:AWS Certified Data Engineer Associate Study GuideAuthors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 2 is called “Prerequisite Knowledge for Aspiring Data Engineers.” It covers the foundation you need before touching any AWS service. Databases, big data concepts, processing frameworks, data lakes, warehouses, ETL patterns, CI/CD, and AWS basics.
If you already work in tech, a lot of this will feel like a refresher. It’s nice to have it all in one place though. Some of these topics honestly deserve more attention than people give them during cert prep.
Book:AWS Certified Data Engineer Associate Study GuideAuthors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 3 is where the real AWS content starts. This is the overview of analytics services you need to know for the DEA-C01 exam. Even if you’re not taking the exam, it’s a solid map of what AWS offers for data work.
There are a lot of services here. Some overlap. Some feel redundant. That’s just how AWS works.
Book:AWS Certified Data Engineer Associate Study GuideAuthors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 3 Part 1 covered the core analytics services: Kinesis, Glue, Redshift, Athena, and friends. Those services don’t exist in a vacuum though. They need compute to run on, databases to pull data from, storage to land results, networking to keep things secure, and monitoring to know when something breaks.
Book: AWS Certified Data Engineer Associate Study Guide Authors: Sakti Mishra, Dylan Qu, Anusha Challa Publisher: O’Reilly Media ISBN: 978-1-098-17007-3
Chapter 4 covers data ingestion and transformation. This is Part 1, focused on ingestion. Getting data into AWS is the first step of any analytics pipeline. Sounds simple, but the number of services and patterns you need to know is big.
Data Ingestion Overview
Data ingestion is the process of importing data from various sources into AWS storage and processing systems. The book breaks it into three patterns:
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Second part of Chapter 4, covering data transformation. The first part was about ingestion. Now we look at what happens after the data lands. You need to clean it, reshape it, enrich it, and get it into a format that analysts and applications can actually use.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Final part of Chapter 4. We move from raw transformation into two topics: data preparation for people who don’t write code, and orchestrating the whole pipeline end to end. Both matter for the exam. Both matter in real life.
Data Preparation for Nontechnical Personas
AWS Glue DataBrew is a low-code, visual tool for data cleaning and preparation. It targets data analysts, data scientists, and business users who need to work with data but don’t want to write PySpark or SQL.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 5 is where the book gets into data store management. Domain 2 territory on the exam, and a big one. How do you pick the right storage? What file format should you use? How do you keep your S3 bill from growing out of control?
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Data modeling is one of those topics that sounds academic until you get it wrong in production. Then it becomes very real, very fast. This section of Chapter 5 covers data modeling strategies for three different AWS services: Amazon Redshift, Amazon DynamoDB, and S3 data lakes. Each has its own rules, trade-offs, and gotchas.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 6 is titled “Data Operations and Support” and it’s a big one. This post covers the first part: analytics operations. QuickSight for BI dashboards, Athena for SQL on S3, Redshift for heavy SQL analytics, and notebooks for interactive data exploration. These are the tools you use after you’ve built your pipelines and stored your data. Now you actually look at it.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Second half of Chapter 6. It covers the stuff that separates a working pipeline from a production-grade pipeline: monitoring, alerting, data quality checks, disaster recovery, Infrastructure as Code, CI/CD, and cost optimization. If Part 1 was about running analytics, Part 2 is about keeping them running and not going broke doing it.
Chapter 7 covers data security and governance. You can build the most elegant data pipeline in the world, but if security is an afterthought, you’re one misconfigured S3 bucket away from a headline nobody wants.
Splitting this chapter into two posts. This first part covers network security, authentication, encryption, and access control. Second part covers data governance.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Data governance is that thing every team says they care about but nobody wants to own. Security gets attention because breaches make headlines. Governance? It just slowly rots your data platform from the inside when you ignore it. This part of Chapter 7 covers the governance pillars beyond security and privacy.
Book:AWS Certified Data Engineer Associate Study GuideAuthors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
This is the chapter I was waiting for. After seven chapters of theory, services, security, and governance, Chapter 8 finally says: “OK, build something.” Two complete pipelines, end to end, with real code and real AWS services.
If you learn by doing, this chapter alone is worth the price of the book.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 9 is the practice exam. 42 questions covering all four domains of the DEA-C01 certification. Stop reading, start testing yourself.
What the Practice Exam Looks Like
42 questions. Most are single-answer multiple choice. Some ask you to select two or three correct answers. Multi-select questions always tell you how many to pick.
Book:AWS Certified Data Engineer Associate Study GuideAuthors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Chapter 10 is the forward-looking chapter. Everything in Chapters 1 through 9 was about established AWS services. This one covers what AWS announced at re:Invent 2024 and what’s coming next.
Some services are still in preview. Some just reached general availability. They may or may not show up on your DEA-C01 exam today. If you’re building data pipelines on AWS in 2025 and beyond though, you need to know where the platform is heading.
Book: AWS Certified Data Engineer Associate Study Guide
Authors: Sakti Mishra, Dylan Qu, Anusha Challa
Publisher: O’Reilly Media
ISBN: 978-1-098-17007-3
Seventeen posts later, done with this book. Every chapter, every service, every exam domain. Time to step back and give an honest summary.
Overall Impressions
Solid study guide. Not perfect, but solid. The authors clearly know their stuff. They’re AWS Solution Architects who’ve built real data platforms, and it shows. Well structured, clear explanations, broad coverage that gives you a real foundation in AWS data engineering.