Hard1 markMultiple Choice

GCP PCA · Question 16 · Domain 1: Designing and Planning a Cloud Solution Architecture

CASE STUDY: AutoMakers Inc

Company Overview:
AutoMakers Inc is a global vehicle manufacturer. They have recently launched a line of connected cars.

Current Technical Environment:

  • 1 million connected cars currently on the road
  • Cars send telemetry data (speed, engine temp, location) every 5 seconds
  • Current on-premises MQTT brokers are crashing under the load

Business Requirements:

  • Enable predictive maintenance to alert drivers before parts fail
  • Provide real-time fleet tracking for commercial customers
  • Support over-the-air (OTA) software updates

Executive Statements:

  • CEO: "Data is our new revenue stream. We need to monetize this telemetry data."
  • CTO: "We expect to have 10 million connected cars in 3 years. The architecture must scale infinitely without manual intervention."
  • CFO: "The cost of ingesting and storing this data must be strictly controlled. We cannot pay for idle capacity."

Technical Requirements:

  • Ingest up to 100,000 messages per second
  • Low-latency processing for real-time alerts
  • Time-series data storage for historical analysis
  • Handle variable network connectivity (cars driving through tunnels)

Constraints:

  • Strict budget for data ingestion
  • Small data engineering team

QUESTION:
To meet the CTO's requirement for infinite scaling and the technical requirement to ingest 100,000 messages per second, which ingestion and processing pipeline should you design?

Answer options:

A.

Ingest messages into Cloud SQL, process them with Compute Engine cron jobs, and store the results in BigQuery.

B.

Ingest messages into Cloud Pub/Sub, process them with Cloud Dataflow, and store the results in Cloud Bigtable.

C.

Ingest messages using an HTTP Load Balancer to Cloud Run, and store directly in Cloud Storage.

D.

Use Apache Kafka on Compute Engine for ingestion, Apache Spark on Dataproc for processing, and Cassandra on Compute Engine for storage.

How to approach this question

Recognize the standard GCP streaming data pipeline pattern: Pub/Sub (Ingest) -> Dataflow (Process) -> Bigtable/BigQuery (Store). Bigtable is preferred for high-throughput, low-latency time-series data.

Full Answer

B.Ingest messages into Cloud Pub/Sub, process them with Cloud Dataflow, and store the results in Cloud Bigtable.✓ Correct
The combination of Cloud Pub/Sub, Cloud Dataflow, and Cloud Bigtable is the standard architectural pattern for high-throughput IoT telemetry ingestion on GCP. Pub/Sub decouples the ingestion, Dataflow handles the windowing and late data (crucial for cars losing signal), and Bigtable provides single-digit millisecond latency for time-series reads/writes.

Common mistakes

Choosing the self-managed open-source stack (D). While Kafka/Spark/Cassandra works, managing it requires a large team and manual scaling, violating the case study constraints.

Practice the full GCP Professional Cloud Architect Practice Exam 3

50 questions · hints · full answers · grading

More questions from this exam