Probability and Statistical Inference Questions

Covers fundamental probability theory and statistical inference from first principles to practical applications. Core probability concepts include sample spaces and events, independence, conditional probability, Bayes theorem, expected value, variance, and standard deviation. Reviews common probability distributions such as normal, binomial, Poisson, uniform, and exponential, their parameters, typical use cases, computation of probabilities, and approximation methods. Explains sampling distributions and the Central Limit Theorem and their implications for estimation and confidence intervals. Presents descriptive statistics and data summary measures including mean, median, variance, and standard deviation. Details the hypothesis testing workflow including null and alternative hypotheses, p values, statistical significance, type one and type two errors, power, effect size, and interpretation of results. Reviews commonly used tests and methods and guidance for selection and assumptions checking, including z tests, t tests, chi square tests, analysis of variance, and basic nonparametric alternatives. Emphasizes practical issues such as correlation versus causation, impact of sample size and data quality, assumptions validation, reasoning about rare events and tail risks, and communicating uncertainty. At more advanced levels expect experimental design and interpretation at scale including A B tests, sample size and power calculations, multiple testing and false discovery rate adjustment, and design choices for robust inference in real world systems.

EasyTechnical

0 practiced

You are given a dataset of user click events with a binary label "converted" (1 if user purchased, 0 otherwise). Define the terms: sample space, event, outcome, and probability measure in the context of this dataset. Give one concrete example of an event and compute its empirical probability from a small sample of 200 events where 30 conversions occurred.

EasyTechnical

0 practiced

Define expected value, variance, and standard deviation. For a discrete random variable X taking values {0,1,2} with probabilities {0.2, 0.5, 0.3}, compute E[X], Var(X), and SD(X). Explain how variance differs from mean absolute deviation and why variance uses squared deviations.

HardTechnical

0 practiced

You suspect data collection bias where certain user segments are underrepresented. Explain how to test for sampling bias statistically and one method to adjust analyses or model training to account for unequal representation.

MediumTechnical

0 practiced

Explain the bootstrap method for estimating the sampling distribution of a statistic (e.g., median) using resampling. In Python, outline (pseudocode acceptable) how you'd implement a bootstrap to estimate a 95% CI for the median of a skewed metric collected from 1,000 users.

HardTechnical

0 practiced

A/B test has multiple sequential peeks by product managers checking metrics daily. Explain the statistical problem with peeking and demonstrate how to control Type I error when testing sequentially (describe alpha-spending or use of sequential tests).

Unlock Full Question Bank

Get access to hundreds of Probability and Statistical Inference interview questions and detailed answers.

Join thousands of developers preparing for their dream job.