InterviewStack.io LogoInterviewStack.io

Classification and Regression Fundamentals Questions

Covers the core concepts and distinctions between classification and regression in supervised learning. Classification predicts discrete categories, either binary or multi class, while regression predicts continuous numerical values. Candidates should understand how to format and encode target variables for each task, common algorithms for each family, and the theoretical foundations of representative models such as linear regression and logistic regression. For regression, know least squares estimation, coefficients interpretation, residual analysis, assumptions of the linear model, R squared, and common loss and error measures including mean squared error, root mean squared error, and mean absolute error. For classification, know logistic regression with its sigmoid transformation and probability interpretation, decision trees, k nearest neighbors, and other basic classifiers; understand loss functions such as cross entropy and evaluation metrics including accuracy, precision, recall, F one score, and area under the receiver operating characteristic curve. Also be prepared to discuss model selection, regularization techniques such as L one and L two regularization, handling class imbalance, calibration and probability outputs, feature preprocessing and encoding for targets and inputs, and trade offs when choosing approaches based on problem constraints and data characteristics.

MediumTechnical
27 practiced
You observe that your regression model residuals have a pattern when plotted against fitted values, suggesting non-linearity. As a BI analyst, how would you modify features or the modeling approach to address this, and what visualizations would you add to your report to show improvement?
HardTechnical
27 practiced
You built multiple regression models with different feature sets and need to pick one to embed into an automated BI report. Explain how you would use nested cross-validation to estimate generalization performance and avoid overfitting during model and feature selection.
MediumTechnical
23 practiced
Implement a short pseudocode algorithm for gradient descent to fit a simple linear regression (one feature). Explain learning rate selection and one method to detect if the algorithm is diverging during training. Keep the code language-agnostic.
HardTechnical
22 practiced
A BI dashboard needs to display an ensemble model's predictions but stakeholders want a simple explanation of how the prediction was made. Describe two techniques to explain ensemble outputs and how feasible they are for real-time dashboard tooltips.
EasyTechnical
22 practiced
You are asked to create a binary churn-prediction model for customers where churn events are only 3% of the data. List at least five practical steps you would take during data preparation and modeling to handle this class imbalance before handing the model to product managers.

Unlock Full Question Bank

Get access to hundreds of Classification and Regression Fundamentals interview questions and detailed answers.

Sign in to Continue

Join thousands of developers preparing for their dream job.