InterviewStack.io LogoInterviewStack.io

Stream Processing and Event Streaming Questions

Designing and operating systems that ingest, process, and serve continuous event streams with low latency and high throughput. Core areas include architecture patterns for stream native and event driven systems, trade offs between batch and streaming models, and event sourcing concepts. Candidates should demonstrate knowledge of messaging and ingestion layers, message brokers and commit log systems, partitioning and consumer group patterns, partition key selection, ordering guarantees, retention and compaction strategies, and deduplication techniques. Processing concerns include stream processing engines, state stores, stateful processing, checkpointing and fault recovery, processing guarantees such as at least once and exactly once semantics, idempotence, and time semantics including event time versus processing time, watermarks, windowing strategies, late and out of order event handling, and stream to stream and stream to table joins and aggregations over windows. Performance and operational topics cover partitioning and scaling strategies, backpressure and flow control, latency versus throughput trade offs, resource isolation, monitoring and alerting, testing strategies for streaming pipelines, schema evolution and compatibility, idempotent sinks, persistent storage choices for state and checkpoints, and operational metrics such as stream lag. Familiarity with concrete technologies and frameworks is expected when discussing designs and trade offs, for example Apache Kafka, Kafka Streams, Apache Flink, Spark Structured Streaming, Amazon Kinesis, and common serialization formats such as Avro, Protocol Buffers, and JSON.

EasyTechnical
0 practiced
Define 'at least once' and 'exactly once' processing guarantees in stream processing systems. Explain why these semantics matter for downstream ML models and analytics. Provide concrete examples where at-least-once leads to duplicate model predictions or inflated metrics and how idempotence or deduplication can mitigate issues.
HardTechnical
0 practiced
Explain how watermarks are implemented conceptually in Apache Flink and in Spark Structured Streaming. Discuss periodic watermark advancement, punctuated watermarks, and how watermark strategies and delay thresholds affect correctness and completeness in the presence of out-of-order events.
MediumSystem Design
0 practiced
Design an online feature store that supports real-time computed features from streams and offline recompute for training. Describe ingestion (stream vs batch), the online store choices (Redis, DynamoDB, etc.), freshness guarantees, APIs for serving features to models, handling backfills and replays, and how to validate online vs offline consistency.
HardTechnical
0 practiced
Your streaming job's checkpoint duration has doubled recently and restore time now causes extended outages. Describe your investigation steps to identify the root cause and optimizations you would apply (for example enabling incremental checkpoints, tuning RocksDB settings, parallelizing checkpoint, state compaction) to reduce checkpoint and restore durations without sacrificing durability.
HardSystem Design
0 practiced
Design a multi-tenant streaming platform that allows many teams to deploy Flink jobs securely and efficiently. Address resource isolation (CPU/memory quotas), tenant namespace isolation for state, monitoring and alerting per tenant, safe upgrade and migration strategies, and mechanisms to prevent noisy-tenant effects.

Unlock Full Question Bank

Get access to hundreds of Stream Processing and Event Streaming interview questions and detailed answers.

Sign in to Continue

Join thousands of developers preparing for their dream job.