InterviewStack.io LogoInterviewStack.io

Computer Vision Fundamentals Questions

Core concepts and methods in computer vision with an emphasis on both traditional image processing and modern deep learning approaches. Candidates should understand how images are represented as matrices or tensors, common preprocessing steps and augmentation techniques to improve generalization, and fundamentals of convolutional neural networks including convolution operations, receptive fields, pooling, and normalization. Familiarity with common vision tasks such as image classification, object detection, semantic and instance segmentation, and key model design patterns is expected. Candidates should know common vision architectures and families such as residual networks and Visual Geometry Group style networks, the role of pretrained models and transfer learning, how to fine tune models for new tasks, and practical tooling including image processing libraries and deep learning frameworks for training and inference. Evaluation may include trade offs between accuracy, latency, and resource usage for deployment.

EasyTechnical
91 practiced
Describe the architectural differences between a VGG-style network and a ResNet block. Explain why residual connections help training deeper networks, and give an example scenario where a bottleneck residual block would be beneficial.
MediumTechnical
48 practiced
In a face-detection system where false negatives are critical, propose strategies to reduce false negatives at training and inference time without causing unacceptable spike in false positives. Discuss threshold tuning, loss weighting, cascaded detectors, and post-processing techniques.
HardTechnical
44 practiced
Propose a rigorous experimental protocol to fairly compare two object-detection algorithms on an internal dataset. Include dataset splitting, cross-validation or holdout strategy, hyperparameter tuning, seed control, compute reporting, metrics to prioritize, and statistical tests to assert significance.
EasyTechnical
62 practiced
List and explain the typical preprocessing steps applied to images before training a convolutional neural network. Cover resizing, cropping, normalization, color-space conversions, and handling aspect ratio. For inference, discuss deterministic preprocessing and when to use center crop versus resize with aspect-ratio preservation.
EasyTechnical
50 practiced
Explain the purpose and mechanics of Batch Normalization, Layer Normalization, Group Normalization, and Instance Normalization. For each, describe where it is applied in vision models, how it depends on batch size, and scenarios where one is preferred over the others.

Unlock Full Question Bank

Get access to hundreds of Computer Vision Fundamentals interview questions and detailed answers.

Sign in to Continue

Join thousands of developers preparing for their dream job.