InterviewStack.io LogoInterviewStack.io

Stream Processing and Event Streaming Questions

Designing and operating systems that ingest, process, and serve continuous event streams with low latency and high throughput. Core areas include architecture patterns for stream native and event driven systems, trade offs between batch and streaming models, and event sourcing concepts. Candidates should demonstrate knowledge of messaging and ingestion layers, message brokers and commit log systems, partitioning and consumer group patterns, partition key selection, ordering guarantees, retention and compaction strategies, and deduplication techniques. Processing concerns include stream processing engines, state stores, stateful processing, checkpointing and fault recovery, processing guarantees such as at least once and exactly once semantics, idempotence, and time semantics including event time versus processing time, watermarks, windowing strategies, late and out of order event handling, and stream to stream and stream to table joins and aggregations over windows. Performance and operational topics cover partitioning and scaling strategies, backpressure and flow control, latency versus throughput trade offs, resource isolation, monitoring and alerting, testing strategies for streaming pipelines, schema evolution and compatibility, idempotent sinks, persistent storage choices for state and checkpoints, and operational metrics such as stream lag. Familiarity with concrete technologies and frameworks is expected when discussing designs and trade offs, for example Apache Kafka, Kafka Streams, Apache Flink, Spark Structured Streaming, Amazon Kinesis, and common serialization formats such as Avro, Protocol Buffers, and JSON.

MediumTechnical
37 practiced
Design a monitoring and alerting plan for a production streaming platform. Include key metrics (consumer lag per partition, processing latency percentiles, checkpoint durations, state size, GC pause duration), threshold-based alerts, dashboards, and ideas for automated remediation or runbooks. How would you correlate infrastructure metrics to business impact?
HardSystem Design
67 practiced
Define SLIs and SLOs for a mission-critical streaming pipeline used for fraud detection. Suggest concrete thresholds for consumer lag, processing lateness percentiles (p95/p99), state restore time, error rates, and outline escalation and remediation playbooks triggered by SLO breaches.
MediumSystem Design
39 practiced
Design a streaming ETL pipeline for clickstream analytics with the following constraints: ingest 1,000,000 events/sec, support per-user sessionization, provide dashboards with <2s freshness for top metrics, enable reprocessing/backfills, and be cost-conscious on AWS. Provide a high-level architecture (ingestion, messaging, processing, state storage, long-term storage, serving), recommended technologies, and key trade-offs.
EasyTechnical
35 practiced
Compare Apache Flink, Kafka Streams, and Spark Structured Streaming from the perspective of a Solutions Architect advising a client with low-latency, stateful processing requirements. Cover strengths and weaknesses, state management approaches, event-time handling, latency vs throughput trade-offs, deployment and operational complexity, and typical cost implications.
MediumTechnical
48 practiced
Design a deduplication mechanism for a stream of events where each event includes a unique event_id but producers may retry. Constraints: sustained 100k events/sec, dedupe window up to 24 hours, limited per-node memory, and the sink must not receive duplicates. Describe an architecture using stream processors (e.g., Flink), state store design (TTL, compaction), and scaling and failure-handling considerations.

Unlock Full Question Bank

Get access to hundreds of Stream Processing and Event Streaming interview questions and detailed answers.

Sign in to Continue

Join thousands of developers preparing for their dream job.