Chapter DEA-C01 associate tier
Data Engineer
Editor's note — A study companion for the Data Engineer exam — every domain rebuilt from scratch, with worked practice questions and an exam-grade timed simulation.
65 questions 130 minutes threshold 720/1000 4 domains official guide
Table of Contents
I. Data Ingestion And Transformation 34% weight
Batch vs Streaming Ingestion Patterns Kinesis Data Streams, Firehose, and Amazon MSK Amazon Managed Service for Apache Flink — Stream Processing AWS Glue ETL — Job Bookmarks, DynamicFrames, and PySpark Lambda and EventBridge for Event-Driven Ingestion AppFlow, DataSync, DMS, and Snowball for Data Migration II. Data Store Management 26% weight
S3 Storage Classes, Lifecycle Policies, and Data Lake Foundations S3 Partitioning Strategies and File Formats — Parquet, ORC, Avro Glue Data Catalog, Lake Formation, and Schema Management Amazon Redshift — RA3, Materialized Views, Spectrum, and Serverless DynamoDB, Aurora, and Vector Stores for ML Serving III. Data Operations And Support 22% weight
Amazon Athena — Queries, Workgroups, and Cost Optimization Redshift Query Tuning, COPY/UNLOAD, and Operational Commands Amazon EMR with Spark/Hive and AWS Step Functions Orchestration CloudWatch, CloudTrail, and Data Pipeline Monitoring Data Quality with Glue DataBrew and Glue Data Quality