AWS DEA-C01 Practice Exam: Question Patterns, Tips, and Study Strategy
Previous: Batch and Streaming Pipelines
Book: AWS Certified Data Engineer Associate Study Guide Authors: Sakti Mishra, Dylan Qu, Anusha Challa Publisher: O’Reilly Media ISBN: 978-1-098-17007-3
Chapter 9 is the practice exam. 42 questions covering all four domains of the DEA-C01 certification. Stop reading, start testing yourself.
What the Practice Exam Looks Like
42 questions. Most are single-answer multiple choice. Some ask you to select two or three correct answers. Multi-select questions always tell you how many to pick.
Every question follows a scenario format. Company description, a problem, some constraints, four or five answer choices. The scenarios feel realistic. Healthcare companies with patient data, ecommerce platforms tracking clicks, fintech doing fraud detection, media companies doing streaming analytics. Standard stuff you’d see in real AWS projects.
Answers with detailed explanations are in the Appendix. Don’t skip that part. The explanations teach you more than the questions.
Patterns Across the Questions
After going through all 42 questions, clear patterns emerge. Knowing these patterns is half the battle.
Service Selection is King
Most questions test whether you know which AWS service fits a specific use case. Not just “what does this service do” but “which service has the least operational overhead for this exact scenario.”
You’ll see questions about:
- Kinesis vs MSK vs Firehose
- Glue vs EMR vs Lambda for processing
- When Athena is enough vs when you need Redshift
- Step Functions vs MWAA vs Glue Workflows for orchestration
- DMS vs zero-ETL vs custom pipelines
The key phrase that appears constantly: “least operational overhead.” AWS loves this. It usually means “pick the managed service.” Self-managed on EC2 vs fully managed AWS service? Managed wins almost every time.
Architecture Decision Questions
Some questions test end-to-end solution design. Pipeline that needs ingestion, processing, storage, and analytics. Pick the right combination of services.
Harder because you need to know how services connect. Redshift streaming ingestion from MSK. Athena federated queries to DynamoDB. Aurora zero-ETL to Redshift. You need these integration patterns.
Security and Access Control
Solid chunk of questions on security. IAM policies with least-privilege access. Lake Formation for fine-grained permissions. KMS encryption. Secrets Manager for credentials. VPC configurations.
One pattern: if a question mentions hardcoded credentials, the answer is almost always Secrets Manager or Parameter Store. Fine-grained table/column access across multiple compute engines? Lake Formation.
Cost Optimization
Several questions with cost as primary concern. Spot instances for fault-tolerant workloads. Glue Flex execution for non-urgent jobs. S3 lifecycle policies. DynamoDB capacity modes. Athena query result reuse.
The pattern: workload can tolerate interruptions and isn’t time-sensitive? Look for the cheapest compute option.
Question Types and How to Approach Them
Single Answer Questions
Read the scenario carefully. Find constraint keywords: “least operational overhead,” “most cost-effective,” “minimal code changes,” “near real-time.” These narrow your choices fast.
Usually two answers are obviously wrong. The challenge is the remaining two. That’s where constraints help. Both might work technically, but one fits the constraints better.
Multi-Select Questions
Trickier. Pick two or three from five or six options. Wrong answers are usually clearly wrong if you know the services.
Eliminate obviously wrong ones first. The correct answers usually form a coherent solution together. If your selections don’t make sense as a combination, reconsider.
The “Select Three” Pattern
When asked to select three steps, think of it as a mini-architecture. The three correct answers should work together as a sequence or complementary configurations. One enables a feature, another configures it, third sets up access control.
Common Traps
The “Works But is Not Best” Trap
Multiple answers can technically solve the problem. The exam asks “which is best given the constraints,” not “does this work.” Self-managed Airflow on EC2 works for orchestration. MWAA exists with less operational overhead. Read the constraints.
The Overly Broad Permission Trap
Security questions: watch for excessive permissions. AmazonS3FullAccess when you only need read access to one bucket. Resource: "*" when you should scope to specific ARNs. Least-privilege answer is almost always correct.
The “Sounds Fancy But Doesn’t Exist” Trap
Some answer choices reference features that sound plausible but aren’t real. Like using Kinesis to “natively read binary logs” from Aurora. Sounds reasonable, not how Kinesis works.
The Real-Time vs Near-Real-Time Trap
Some questions describe “real-time” but the best answer is near-real-time. Does the scenario truly need sub-second latency or is seconds-to-minutes acceptable? Changes which services apply.
Topics That Get Heavy Coverage
Based on the 42 questions:
Data ingestion and streaming gets a lot of attention. CDC pipelines, Kinesis configurations, MSK setups, streaming ingestion into Redshift. Weak on streaming? Study hard.
AWS Glue appears everywhere. Crawlers, Data Quality, PII detection, Flex execution, autoscaling, workflows, connections. Probably the single most tested service.
Security and governance questions are spread throughout. IAM policies, Lake Formation, encryption, VPC endpoints, Secrets Manager. Woven into every topic.
Query optimization for Athena and Redshift. Partitioning, bucketing, columnar formats, materialized views, provisioned capacity.
Orchestration comes up multiple times. Step Functions vs MWAA vs Glue Workflows. Know when to use each.
How Representative Are These Questions?
Pretty representative. Format matches the actual exam. Scenario-based, constraint-driven, multiple plausible answers. Difficulty feels about right. Not trivially easy, not unreasonably hard.
What I like: covers the breadth well. All four domains. Nothing feels like filler.
What could be better: 42 questions is less than the real exam’s 65. You get a sample, not a full simulation. The time pressure can’t really be simulated by reading a book chapter either.
How to Use This Chapter Effectively
First Pass: Simulate Exam Conditions
Set a timer. About 90 minutes for 42 questions. Roughly the same per-question pace as the real exam. Don’t look at answers. Write down choices. Tests your actual readiness.
Second Pass: Review Every Question
After scoring, read every explanation in the Appendix. Even for questions you got right. Sometimes you got the right answer for the wrong reason. Explanations for wrong answers often include details about why other options don’t work. Teaches you more than just knowing the right answer.
Third Pass: Group by Weakness
Sort missed questions by topic. Three Glue questions wrong? Study Glue. Security tripped you up? Review IAM and Lake Formation. Targeted study plan from your mistakes.
Track Your Patterns
Notice which traps catch you. Always picking the self-managed option? Missing the “least operational overhead” constraint? Struggling with multi-select? Knowing your weak patterns is valuable.
General Certification Tips
Read the last sentence first. The question is always at the end. Read it first, then read the scenario knowing what you’re looking for. Saves time, improves focus.
Watch for absolute words. “Always,” “never,” “only,” “all.” These make an option more likely wrong. AWS rarely deals in absolutes.
Understand “most” and “least.” “Most cost-effective” and “least operational overhead” are different optimization targets. Different answers for cheapest vs simplest to manage.
When stuck, eliminate. Self-managed on EC2 is almost never “least operational overhead.” Hardcoded credentials are never secure. Overly broad IAM permissions are never the correct security answer.
Flag and move on. In the real exam, don’t spend five minutes on one question. Flag it, come back later. Your subconscious might work it out while you answer other questions.
The Appendix Matters
Answers aren’t just letter choices. Each includes detailed explanation of why the correct answer is correct and why each wrong answer is wrong. Gold for learning.
Read the Appendix like a textbook, not an answer key. Explanations often reference specific AWS features and best practices that didn’t come up in the chapters. Bonus study material.
Final Thoughts
Chapter 9 is where the rubber meets the road. All the theory gets tested here. Score above 80% on first attempt without looking at answers? Good shape for the real exam. Below 65%? Go back and study weak areas before booking.
Practice exams aren’t about memorizing answers. They train your brain to recognize patterns, eliminate wrong options, and apply constraints. Do this chapter right and it’s worth more than re-reading the entire book.
Good luck with the DEA-C01.