InterviewStack.io LogoInterviewStack.io
☁️

Cloud & Infrastructure Topics

Cloud platform services, infrastructure architecture, Infrastructure as Code, environment provisioning, and infrastructure operations. Covers cloud service selection, infrastructure provisioning patterns, container orchestration (Kubernetes), multi-cloud and hybrid architectures, infrastructure cost optimization, and cloud platform operations. For CI/CD pipeline and deployment automation, see DevOps & Release Engineering. For cloud security implementation, see Security Engineering & Operations. For data infrastructure design, see Data Engineering & Analytics Infrastructure.

Technical Vision and Infrastructure Roadmap

This topic assesses a candidate's ability to define a multi year technical vision for infrastructure, platform, and systems and to translate that vision into a practical execution roadmap. Core skills include evaluating technology choices and architecture evolution, planning migration and modernization paths, anticipating scalability and capacity needs, and balancing cost performance with resilience and operational reliability. Candidates should demonstrate approaches to managing technical debt, sequencing investments across quarters and releases, estimating resources and timelines, establishing measurable infrastructure goals and key performance indicators, and implementing governance and standards. Discussion may also cover reliability and observability, security and compliance considerations, trade offs between short term stability and long term rearchitecture, prioritization to enable business outcomes, and communicating technical trade offs to both technical and non technical stakeholders.

40 questions

Analyzing Requirements and Service Selection

Given a business requirement (e.g., 'store real-time game data with sub-millisecond latency'), systematically identify appropriate cloud services and justify your choice based on performance, cost, and operational considerations. Articulate trade-offs explicitly.

48 questions

Capacity Planning and Resource Optimization

Covers forecasting, provisioning, and operating compute, memory, storage, and network resources efficiently to meet demand and service level objectives. Key skills include monitoring resource utilization metrics such as central processing unit usage, memory consumption, storage input and output and network throughput; analyzing historical trends and workload patterns to predict future demand; and planning capacity additions, safety margins, and buffer sizing. Candidates should understand vertical versus horizontal scaling, autoscaling policy design and cooldowns, right sizing instances or containers, workload placement and isolation, load balancing algorithms, and use of spot or preemptible capacity for interruptible workloads. Practical topics include storage planning and archival strategies, database memory tuning and buffer sizing, batching and off peak processing, model compression and inference optimization for machine learning workloads, alerts and dashboards, stress and validation testing of planned changes, and methods to measure that capacity decisions meet both performance and cost objectives.

40 questions

Cost Optimization at Scale

Addresses cost conscious design and operational practices for systems operating at large scale and high volume. Candidates should discuss measuring and improving unit economics such as cost per request or cost per customer, multi tier storage strategies and lifecycle management, caching, batching and request consolidation to reduce resource use, data and model compression, optimizing network and input output patterns, and minimizing egress and transfer charges. Senior discussions include product level trade offs, prioritization of cost reductions versus feature velocity, instrumentation and observability for ongoing cost measurement, automation and runbook approaches to enforce cost controls, and organizational practices to continuously identify, quantify, and implement savings without compromising critical service level objectives. The topic emphasizes measurement, benchmarking, risk assessment, and communicating expected savings and operational impacts to stakeholders.

40 questions

Cloud Integration and Hybrid Network Architecture

Designing connectivity and integration between on premise infrastructure and public cloud environments, including hybrid and multi cloud topologies. Topics include dedicated interconnect options and internet based connectivity, routing and network topology trade offs, virtual private networks and software defined wide area networks, network virtualization, security boundary and identity considerations across environments, latency and throughput implications, cost trade offs for interconnects, and operational practices for managing hybrid infrastructure and multi cloud complexity.

40 questions

Platform Architecture for Organizational Scale

Designing internal platforms and infrastructure to support large engineering organizations and evolving teams. Topics include developer experience and self service platform design, deployment platforms that enable safe frequent releases for hundreds of engineers, platform automation and observability patterns that provide cross service visibility, governance and operational policies, service onboarding and lifecycle, and how to evolve platform capabilities as headcount and service count grows. Candidates should discuss trade offs between centralized platform services and team autonomy, metrics for platform health, and approaches to encourage adoption while minimizing operational friction.

42 questions

Understanding the Company's Infrastructure Context

Research the company's public infrastructure information (engineering blog, tech talks, published case studies, job description). Understand what systems they operate at scale, what problems they likely face, and what your role would contribute to.

40 questions

Infrastructure Automation and Provisioning

Covers designing, implementing, and operating automated infrastructure provisioning and configuration using Infrastructure as Code practices and complementary automation patterns. Candidates should be able to select and author declarative infrastructure definitions with tools such as Terraform, CloudFormation, and Azure Resource Manager templates, and discuss configuration management tools such as Ansible, Puppet, or Chef. Core skills include modular and reusable code organization for multiple environments, variable and output management, remote state management and locking, idempotency and atomicity of operations, and version control integration for infrastructure artifacts. Candidates should understand testing and validation practices including linting, plan or dry run validation, unit and integration testing of infrastructure changes, and drift detection and remediation. The topic includes strategies for safe changes and rollbacks, change coordination, error handling and recovery, and deployment patterns such as canary and blue green where applicable. It also encompasses automation and orchestration patterns, immutable infrastructure and self healing practices, autoscaling and scaling policies, automated patching and updates, secrets handling patterns using secret managers, and integrating observability and monitoring into automated workflows. Finally, candidates should be able to reason about trade offs between imperative and declarative approaches, scaling Infrastructure as Code across large projects and teams, and security and compliance considerations for automated provisioning.

40 questions

Cost Aware Architecture and Design

Focuses on how architectural decisions and design patterns affect operating cost and total cost of ownership. Interviewees should be able to reason about trade offs such as managed services versus self managed components, always on virtual machines versus event driven or serverless approaches, reserved versus on demand capacity, use of spot or preemptible instances, and multi region or edge placement. Candidates should demonstrate techniques for reducing cost through storage class selection and lifecycle policies, caching and batching, query and workload optimization, data transfer minimization, and workload isolation. The topic also covers modeling and communicating cost trade offs, estimating ongoing operating expense for alternative designs, and choosing architecture that balances budget constraints with reliability, performance, and engineering effort.

41 questions
Page 1/6