InterviewStack.io LogoInterviewStack.io

A and B Test Design Questions

Designing and running A and B tests and split tests to evaluate product and feature changes. Candidates should be able to form clear null and alternative hypotheses, select appropriate primary metrics and guardrail metrics that reflect both product goals and user safety, choose randomization and assignment strategies, and calculate sample size and test duration using power analysis and minimum detectable effect reasoning. They should understand applied statistical analysis concepts including p values confidence intervals one tailed and two tailed tests sequential monitoring and stopping rules and corrections for multiple comparisons. Practical abilities include diagnosing inconclusive or noisy experiments detecting and mitigating common biases such as peeking selection bias novelty effects seasonality instrumentation errors and network interference and deciding when experiments are appropriate versus alternative evaluation methods. Senior candidates should reason about trade offs between speed and statistical rigor plan safe rollouts and ramping define rollback plans and communicate uncertainty and business implications to technical and non technical stakeholders. For developer facing products candidates should also consider constraints such as small populations cross team effects ethical concerns and special instrumentation needs.

EasyTechnical
0 practiced
Explain the difference between user-level and session-level randomization. For a mobile shopping app where purchases are concentrated in sessions, recommend which unit to randomize and justify your decision considering contamination, intra-user correlation, and statistical power.
HardTechnical
0 practiced
Baseline conversion has been drifting upward by about 0.5% per week due to seasonality during your planned experiment. How do you adjust sample size and duration calculations to maintain power and avoid bias in estimated treatment effect? Describe both design and analysis strategies.
HardTechnical
0 practiced
You need to detect whether an observed uplift is driven by a novelty effect that fades over time. Design an experiment and analysis strategy to distinguish novelty from persistent treatment effects. Include time windows, decay modeling, and decision rules.
EasyTechnical
0 practiced
What is an A/A test and why might you run one before launching an A/B experiment? Describe the expected outcomes and the steps you would take if an A/A test shows a statistically significant difference between groups.
MediumTechnical
0 practiced
When is an A/B experiment inappropriate and what alternative evaluation methods would you propose? Consider scenarios like very small user populations, high deployment risk, or when user consent limits randomization.

Unlock Full Question Bank

Get access to hundreds of A and B Test Design interview questions and detailed answers.

Sign in to Continue

Join thousands of developers preparing for their dream job.