Medium1 markMultiple Choice
AWS SAA-C03 · Question 51 · Domain 3.5: Data Ingestion
A data engineering team needs to run daily ETL (Extract, Transform, Load) jobs to process logs stored in Amazon S3 and load the transformed data into Amazon Redshift. They want a fully managed, serverless Apache Spark environment. Which service should they use?
A data engineering team needs to run daily ETL (Extract, Transform, Load) jobs to process logs stored in Amazon S3 and load the transformed data into Amazon Redshift. They want a fully managed, serverless Apache Spark environment. Which service should they use?
Answer options:
A.
Amazon EMR
B.
AWS Glue
C.
AWS Data Pipeline
D.
Amazon Kinesis Data Analytics
How to approach this question
Keywords: 'serverless', 'ETL', 'Apache Spark'. AWS Glue is the exact match.
Full Answer
B.AWS Glue✓ Correct
AWS Glue is a fully managed, serverless extract, transform, and load (ETL) service. It automatically provisions the Apache Spark environment needed to execute your ETL jobs, meaning you don't have to manage any infrastructure.
Common mistakes
Choosing EMR, which traditionally requires cluster management.
Practice the full AWS SAA-C03 Practice Exam 3
65 questions · hints · full answers · grading
More questions from this exam
Q01A company stores sensitive documents in an Amazon S3 bucket. The security team requires that only...EasyQ02A large enterprise uses AWS Organizations to manage multiple accounts. The security team wants to...MediumQ03A company hosts a web application on Amazon EC2 instances behind an Application Load Balancer (AL...EasyQ04A company wants to continuously monitor its AWS accounts for malicious activity and unauthorized ...MediumQ05A company needs to encrypt data at rest in Amazon RDS and manage database credentials securely. T...Medium
Expert