Multi Region Disaster Recovery Questions

Designing systems for resilience and availability across geographic regions, including strategies for cross region replication, failover, and operational recovery. Candidates should understand deployment models such as active active and active passive and the trade offs they imply for availability, consistency, cost, and operational complexity. Discuss replication topologies and the differences between synchronous and asynchronous replication and how those choices affect consistency and the recovery point objective. Cover leader election and failover coordination mechanisms, conflict resolution approaches including last write wins, version vectors, and convergent data types, and implications for transactional guarantees and global transactions. Include global traffic routing and failover techniques such as DNS based routing, global load balancing, health checks, and the impact of routing and time to live on failover behavior. Address data partitioning and cross region latency trade offs, strategies for orchestrating data recovery and region seeding, backup and restore practices, and testing approaches such as planned failovers, rehearsal drills, and chaos testing. Explain how to derive and meet recovery time objective and recovery point objective from business requirements, and consider monitoring, observability, automation, runbooks, cost considerations, and compliance and data residency requirements.

EasyTechnical

0 practiced

Explain synchronous vs asynchronous replication between regions and how each impacts RPO, write latency, and system throughput. Given a 100ms WAN RTT between regions, discuss practical limitations of synchronous cross-region replication for a high-throughput OLTP service.

MediumTechnical

0 practiced

You must evaluate cost vs availability for a global service and recommend whether to run active-active or active-passive. Create a decision framework (cost model and risk analysis) that includes probabilities of regional failure, cost per-hour for standby, and business impact of downtime.

MediumSystem Design

0 practiced

For an application relying on a regional cache for sessions, design a cross-region session failover approach that minimizes user disruption. Consider trade-offs between synchronous replication of session data, global session stores, and session replay, and describe eventual user experience after failover.

MediumTechnical

0 practiced

Design an automation workflow to orchestrate a region failover: include fencing of old primaries, database promotion, load balancer updates, DNS changes, smoke tests, and rollback triggers. Which orchestration tools would you pick and why? Provide idempotency and safety considerations.

HardTechnical

0 practiced

Provide pseudocode for merging two version vectors for concurrent updates and detecting whether one update causally dominates another or if they are concurrent. Explain how you'd use this in a reconciliation service that automatically merges non-concurrent updates and surfaces concurrent ones.

Unlock Full Question Bank

Get access to hundreds of Multi Region Disaster Recovery interview questions and detailed answers.

Join thousands of developers preparing for their dream job.