InterviewStack.io LogoInterviewStack.io

Stream Processing and Event Streaming Questions

Designing and operating systems that ingest, process, and serve continuous event streams with low latency and high throughput. Core areas include architecture patterns for stream native and event driven systems, trade offs between batch and streaming models, and event sourcing concepts. Candidates should demonstrate knowledge of messaging and ingestion layers, message brokers and commit log systems, partitioning and consumer group patterns, partition key selection, ordering guarantees, retention and compaction strategies, and deduplication techniques. Processing concerns include stream processing engines, state stores, stateful processing, checkpointing and fault recovery, processing guarantees such as at least once and exactly once semantics, idempotence, and time semantics including event time versus processing time, watermarks, windowing strategies, late and out of order event handling, and stream to stream and stream to table joins and aggregations over windows. Performance and operational topics cover partitioning and scaling strategies, backpressure and flow control, latency versus throughput trade offs, resource isolation, monitoring and alerting, testing strategies for streaming pipelines, schema evolution and compatibility, idempotent sinks, persistent storage choices for state and checkpoints, and operational metrics such as stream lag. Familiarity with concrete technologies and frameworks is expected when discussing designs and trade offs, for example Apache Kafka, Kafka Streams, Apache Flink, Spark Structured Streaming, Amazon Kinesis, and common serialization formats such as Avro, Protocol Buffers, and JSON.

HardTechnical
49 practiced
Walk through techniques to obtain end-to-end exactly-once semantics when consuming from Kafka, processing in Flink, and writing results to an external relational database that does not support distributed transactions. Discuss Flink's two-phase commit sink, idempotent upserts, deduplication patterns, and the performance trade-offs of each approach.
MediumTechnical
37 practiced
Design a low-latency enrichment approach for a clickstream that must attach user segments stored in a 10M-entry external key-value store. Discuss caching (local vs distributed), TTLs, cache warming strategies, fallback behavior for misses, and how to maintain freshness without breaching latency SLAs.
MediumTechnical
40 practiced
Explain best practices for schema evolution in streaming systems using Avro or Protobuf with a schema registry. How would you handle an incompatible change that must be deployed (for example renaming a required field), and what migration patterns can prevent consumer breakage in production?
HardTechnical
47 practiced
As a senior data scientist, you are leading remediation after a production streaming ML pipeline introduced inconsistent feature values due to a schema change. Describe immediate containment actions (stop ingestion, freeze feature writes), steps for root cause analysis, medium/long-term fixes (testing, schema governance), and how you would communicate impact and timeline to stakeholders.
HardSystem Design
39 practiced
Design a multi-tenant streaming platform that allows many teams to deploy Flink jobs securely and efficiently. Address resource isolation (CPU/memory quotas), tenant namespace isolation for state, monitoring and alerting per tenant, safe upgrade and migration strategies, and mechanisms to prevent noisy-tenant effects.

Unlock Full Question Bank

Get access to hundreds of Stream Processing and Event Streaming interview questions and detailed answers.

Sign in to Continue

Join thousands of developers preparing for their dream job.