Atlas | VitaVision

Atlas 91

Practical computer vision atlas — algorithms, models, and concepts.

Algorithms47

Daniilidis Dual-Quaternion Hand-Eye Calibration
Algorithm·Hand–eye calibration·1999

Solve the hand-eye equation AX=XB jointly for rotation and translation by parametrising rigid motions as unit dual quaternions and extractin…

compared withTsai-Lenz Hand-Eye
Tsai-Lenz Hand-Eye Calibration
Algorithm·Hand–eye calibration·1989

Recover the constant rigid transform from a robot gripper to a rigidly mounted camera by solving the AX=XB equation in two stages — modified…

Viola–Jones Object Detector
Algorithm·Detection·2001

Real-time frontal-face detection by sliding a fixed 24×24 sub-window across a grayscale image at multiple scales, scoring each position with…

compared withDeformable Part·HOG

Partition an image into perceptually coherent regions by a Kruskal-style greedy merge over a pixel graph, accepting an inter-component edge …

GrabCut Iterative Segmentation
Algorithm·Image segmentation·2004

Extract a foreground from a colour image using a single bounding rectangle as the only required input by alternating Gaussian mixture compon…

Graph-Cut Interactive Segmentation
Algorithm·Image segmentation·2001

Compute the global minimum of a binary region-and-boundary MRF energy as a single s-t min-cut on a pixel graph; user-marked seeds enter as h…

extended byGrabCut Iterative
Fischler–Bolles RANSAC
Algorithm·Geometry·1981

Founding random-sample-consensus paradigm: fit a parametric model to data containing an unknown fraction of gross outliers by drawing minima…

extended byUSAC·MAGSAC

Robust estimator that eliminates the user-tuned inlier threshold by treating the noise scale σ as a random variable on [0, σ_max] and margin…

compared withUSAC
USAC: Universal RANSAC Framework
Algorithm·Geometry·2013

Engineering decomposition of practical RANSAC into four pluggable stages — sampling (PROSAC), model verification (SPRT), local optimisation …

compared withMAGSAC

Rectifies a monocular image to a metric overhead (bird's-eye) view by constructing the rectifying homography in closed form from two CNN-reg…

Deformable Part Models
Algorithm·Detection·2010

Detect a target object class in arbitrary images by scoring every position and scale in a HOG feature pyramid with a mixture of star-structu…

compared withViola
Longuet-Higgins Linear Eight-Point Algorithm
Algorithm·Fundamental-matrix estimation·1981

1981 closed-form linear method for relative orientation of two viewpoints from eight calibrated point correspondences, introducing the bilin…

Yang Parametric-Model Sub-Pixel Corner Fit
Algorithm·Corner detection·2018

Refine pixel-level chessboard corner positions to sub-pixel accuracy by nonlinear least-squares fitting a seven-parameter ideal blurred-corn…

SURF: Speeded Up Robust Features
Algorithm·Feature detection·2006

Detects scale- and rotation-invariant blob keypoints as scale-space maxima of the Hessian determinant, approximated with box filters on an i…

compared withSIFT·FAST Corner·Harris Corner+3
Canny Edge Detector
Algorithm·Features·1986

Detect thin step edges in greyscale images by smoothing with a Gaussian, computing gradient magnitude and direction, suppressing non-maxima …

Scaramuzza Omnidirectional Camera Calibration
Algorithm·Camera calibration·2006

Calibrate any central catadioptric or fisheye camera from a few planar checkerboard views by fitting a radially-symmetric Taylor-polynomial …

EPnP: O(n) Perspective-n-Point
Algorithm·Geometry·2009

Non-iterative O(n) solver for the calibrated Perspective-n-Point problem: express the n reference points as weighted sums of four virtual co…

Fast Radial Symmetry Transform
Algorithm·Feature detection·2003

Gradient-vote operator that highlights pixels of high local radial symmetry — bright/dark blobs and approximately circular features. Each pi…

extended byGeneralised Fast
Generalised Fast Radial Symmetry
Algorithm·Feature detection·2012

Affine extension of FRST: each pixel votes along a corrected direction $\hat V = G M G^{-1} M^{-1} \nabla I$ at radius $n$, where $G = R D \…

Gao Dual-Homography Stitching
Algorithm·Image stitching·2011

Stitch two-plane outdoor panoramas by clustering SIFT correspondences into a ground group and a distant group via spatial K-means, fitting o…

Geiger Chessboard Corner Detector
Algorithm·Chessboard detection·2012

Detect checkerboard X-corners by computing a four-quadrant corner likelihood at each pixel using axis-aligned and 45°-rotated prototype filt…

GP Checkerboard Enhancement (PyCBD)
Algorithm·Corner detection·2023

Post-process a partially detected checkerboard by training two Gaussian processes (one per pixel coordinate) on the allocated (boardXY, boar…

compared withOCPAD
Lin Smoothly Varying Affine Stitching
Algorithm·Image stitching·2011

Stitch two images under moderate parallax by replacing the global affine with a per-feature deviation field, regularised to be smooth via a …

Normalised Eight-Point Algorithm
Algorithm·Fundamental-matrix estimation·1997

Compute the fundamental matrix from n ≥ 8 point correspondences by conditioning the linear DLT system via a similarity normalisation, recove…

Sturm-Maybank Plane-Based Calibration
Algorithm·Camera calibration·1999

Recover camera intrinsics from one or more views of one or more planar targets via the same two IAC-on-homography constraints as Zhang's met…

compared withZhang's Planar
Tsai's Versatile Camera Calibration
Algorithm·Camera calibration·1987

Two-stage 1987 camera calibration that uses the radial alignment constraint to recover extrinsics and image scale linearly from a precision …

extended byZhang's Planar
As-Projective-As-Possible Image Stitching
Algorithm·Image stitching·2013

Replace a global homography with a spatially varying field of homographies, each fit by a per-cell weighted DLT (Moving DLT) on the same poi…

Kumar-Ahuja Generalized Radial Alignment Constraint
Algorithm·Camera calibration·2014

Extend Tsai's radial alignment constraint to a non-frontal sensor by modelling lens–sensor tilt as a 2-DoF rotation, projecting observations…

Localized Radon Checkerboard Corners
Algorithm·Corner detection·2018

Detect checkerboard X-junctions by approximating a localized Radon transform with 1-D box filters on rotated copies of the image; the per-pi…

Zhang's Planar Camera Calibration
Algorithm·Camera calibration·2000

Recover camera intrinsics, radial distortion, and per-view extrinsics from at least three images of a planar pattern at different orientatio…

Pyramidal Blur-Aware X-Corner Chessboard Detector
Algorithm·Corner detection·2021

Detect chessboard X-junctions in heavily blurred or high-resolution images by computing a 16-sample circular x-corner intensity at every lev…

OCPAD: Occluded Checkerboard Pattern Detection
Algorithm·Corner detection·2016

Recover the largest visible checkerboard subgraph from a partially occluded pattern by running VF2 subgraph isomorphism against a model grap…

compared withGP Checkerboard
Topological Grid Finding
Algorithm·Chessboard detection·2009

Recover the integer $(i, j)$ grid coordinate of every corner in a checkerboard calibration image by Delaunay-triangulating the corners, merg…

ChESS Corners
Algorithm·Corner detection·2013

A chessboard-specific corner detector: scores each pixel by how well its local neighborhood matches an alternating bright-dark X-junction pa…

FAST Corner Detector
Algorithm·Corner detection·2006

Segment-test corner detector on a 16-pixel Bresenham ring of radius 3 around each candidate; classifies a point as a corner when N contiguou…

compared withHarris Corner·SIFT·SURF
HOG: Histograms of Oriented Gradients
Algorithm·Detection·2005

Compute a fixed-length descriptor for an image window by binning pixel gradients into 8×8 cells of 9 unsigned-orientation histograms, normal…

compared withViola

Encodes a Gaussian-smoothed image patch around a detected keypoint as a 128/256/512-bit binary string by running a fixed table of pairwise p…

extended byORB
ORB: Oriented FAST and Rotated BRIEF
Algorithm·Feature detection·2011

Detects rotation-invariant oriented keypoints by running FAST-9 on a √2 image pyramid, ranking by Harris cornerness, and assigning orientati…

compared withSIFT·SURF
ROCHADE: Robust Checkerboard Advanced Detection
Algorithm·Corner detection·2014

Detect a full planar checkerboard in an image by reducing the gradient-magnitude edge set to a single-pixel centreline graph, extracting inn…

Chessboard Detection via X-Corners and Topology
Algorithm·Chessboard detection·2013

Detect every corner of a chessboard calibration pattern and assign it an integer grid coordinate by counting ring-alternations to locate X-j…

PuzzleBoard
Algorithm·Chessboard detection·2024

Detect and decode a self-identifying checkerboard calibration pattern: saddle-point corners from a Hessian response, grid reconstruction via…

compared withChESS Corners
Harris Corner Detector
Algorithm·Corner detection·1988

Scores each pixel by the Harris response R = det(M) − k·tr(M)², where M is the gradient covariance matrix summed over a Gaussian window; ret…

Shi-Tomasi Corner Detector
Algorithm·Corner detection·1994

Scores each pixel by the smaller eigenvalue of the gradient structure tensor M; returns integer pixel locations where that eigenvalue exceed…

compared withHarris Corner·SIFT·SURF
Black-Anandan Robust Optical Flow
Algorithm·Features·1996

Optical flow that replaces the quadratic data and smoothness penalties of variational flow with redescending M-estimators, solved by SOR wit…

Horn-Schunck Optical Flow
Algorithm·Features·1981

Dense optical flow recovered by minimising a variational energy that combines the brightness-constancy constraint with a global smoothness p…

extended byBlack-Anandan Robust
Lucas-Kanade Image Registration
Algorithm·Features·1981

Iterative Newton-Raphson method that estimates the parametric warp between two images by linearising the residual and solving the resulting …

SIFT: Scale-Invariant Feature Transform
Algorithm·Feature detection·2004

Detects keypoints as scale-space extrema in a Difference-of-Gaussian image pyramid, refines location and scale by 3D quadratic interpolation…

Models18

Faster R-CNN
Model·Detection·2015

Two-stage CNN object detector that replaces external Selective Search / EdgeBoxes proposals with a learned Region Proposal Network sharing c…

extended byMask R-CNN
YOLOv1
Model·Detection·2016

Single-stage CNN object detector that frames detection as one regression problem from full-image pixels to a 7×7×30 tensor of grid-cell box …

compared withFaster R-CNN
DeepLab
Model·Image segmentation·2018

Dense semantic segmentation by repurposing an ImageNet classifier with atrous (dilated) convolution to preserve spatial resolution, an Atrou…

compared withU-Net
Mask R-CNN
Model·Image segmentation·2017

Two-stage instance segmentation by adding a parallel FCN mask branch to Faster R-CNN — per-class binary masks predicted at each RoI under a …

U-Net
Model·Image segmentation·2015

Symmetric encoder-decoder fully-convolutional network for dense pixel-wise biomedical image segmentation — contracting path with channel-dou…

compared withDeepLab
FCN: Fully Convolutional Networks
Model·Image segmentation·2015

Encoder-decoder CNN for dense pixel-wise classification — converts ImageNet classifiers into fully convolutional networks via 1×1-conv reint…

extended byU-Net·DeepLab
ResNet
Model·Image classification·2016

Family of very deep CNN image classifiers (18 to 152 layers) built from residual blocks $y = \mathcal{F}(x, \{W_i\}) + x$ that reformulate e…

compared withGoogLeNet
AlexNet
Model·Image classification·2012

Eight-layer convolutional neural network for 1000-class image classification on ImageNet, trained end-to-end on two GPUs with ReLU activatio…

extended byVGG
GoogLeNet
Model·Image classification·2015

Twenty-two-layer CNN built from Inception modules — parallel 1×1, 3×3, 5×5 convolutions and 3×3 max-pool concatenated along the channel axis…

compared withVGG·AlexNet·ResNet
VGG
Model·Image classification·2014

Family of very deep CNN image classifiers (11 to 19 weight layers) built from stacked 3×3 convolutions with stride 1 and 2×2 max-pool stride…

extended byResNet
MATE
Model·Corner detection·2016

First learned per-pixel checkerboard X-corner detector: a three-convolutional-layer CNN with 2,939 parameters trained with mean-squared-erro…

compared withCCDN·CCS
SuperPoint
Model·Feature detection·2018

Fully-convolutional CNN that jointly detects interest points and computes 256-D descriptors in a single forward pass, trained without human …

compared withXFeat
CCDN
Model·Corner detection·2023

Fully convolutional network that regresses a per-pixel checkerboard-corner response map; trained with weighted cross-entropy and paired with…

compared withCCS·MATE
XFeat
Model·Feature detection·2024

Lightweight CNN that jointly detects keypoints, extracts 64-D dense descriptors, and refines semi-dense matches from coarse descriptor pairs…

compared withLoFTR·SuperPoint
CCS
Model·Camera calibration·2022

Three-stage learning-based camera calibration pipeline: a CNN regresses radial-distortion-correction parameters, a UNet predicts per-corner …

compared withCCDN·MATE
LightGlue
Model·Feature matching·2023

Adaptive-depth Transformer matcher for sparse local features: stacks 9 self+cross-attention layers with rotary positional encoding and a per…

compared withLoFTR
LoFTR
Model·Feature matching·2021

Detector-free dense feature matcher: shared CNN backbone produces coarse and fine feature maps, a Linear Transformer with interleaved self- …

compared withXFeat·LightGlue·SuperGlue
SuperGlue
Model·Feature matching·2020

Graph neural network that matches two sets of sparse local features by jointly finding correspondences and rejecting unmatched keypoints in …

extended byLightGlue

Concepts26

Attention Mechanism
Concept·Features·2020

Computes each output element as a learned, input-dependent weighted average of value vectors, letting every element aggregate information fr…

Convolution
Concept·Image formation·1986

The linear, shift-invariant operation that produces each output pixel as a kernel-weighted sum of input pixels in a local neighbourhood.

Energy Minimization
Concept·Segmentation·2001

The framework that poses image labelling and segmentation as minimising an objective combining a per-pixel data term and a pairwise smoothne…

Image Pyramid
Concept·Image formation·2004

A discrete multi-resolution representation — a sequence of images at progressively coarser resolution, each smoothed and downsampled from it…

Integral Image
Concept·Features·1984

A precomputed prefix-sum array that returns the sum of pixel values over any axis-aligned rectangle in constant time with four array reads.

Non-Maximum Suppression
Concept·Features·1986

Reducing a dense response map or a set of overlapping detections to a sparse set of local maxima by discarding every element that is not str…

Pinhole Camera Model
Concept·Image formation·2000

The projective map from 3-D scene points to 2-D image pixels through a single centre of projection, parameterised by an intrinsic matrix and…

SVD Null-Space Estimation
Concept·Geometry·1981

Estimating a geometric entity defined only up to scale by stacking constraints into a homogeneous linear system and taking the smallest righ…

RANSAC
Concept·Geometry

Random sample consensus — a paradigm for fitting a parametric model to data containing an unknown fraction of gross outliers, by drawing min…

Topological Grid Recovery
Concept·Features

Verify candidate calibration-pattern corners by constructing a graph over them (Delaunay triangulation, k-nearest-neighbours, or proximity) …

Bundle Adjustment
Concept·Calibration·2000

Joint nonlinear least-squares refinement of all camera parameters — and, in structure-from-motion, all 3-D points — that minimises the total…

Convolutional Neural Network
Concept·Features·2012

A feed-forward network that builds a spatial hierarchy of learned features by alternating weight-shared convolution layers, pointwise nonlin…

Pose Estimation
Concept·Geometry·2009

Recovery of the 6-DOF rigid transformation — rotation and translation — relating a camera to a scene, an object, or a second camera.

Camera Distortion Models
Concept·Image formation

Mathematical models for departures from the ideal pinhole projection — radial barrel/pincushion, tangential decentering, thin-prism — and th…

DLT Normalisation
Concept·Geometry

A two-line similarity transform — translate the point centroid to the origin, isotropically scale so the average distance is √2 — that condi…

Epipolar Geometry
Concept·Geometry

The intrinsic projective geometry of two views of a scene, encoding the constraint that a point visible in one image must lie on a specific …

Homography
Concept·Geometry

An invertible projective transformation of the plane, represented by a 3×3 matrix defined up to a non-zero scalar, mapping points between tw…

Image Gradient
Concept·Features

The 2-vector of partial derivatives of image intensity with respect to spatial coordinates, measuring the rate and direction of brightness c…

Scale Space
Concept·Image formation

A one-parameter family of images obtained by progressively blurring an input image with Gaussians of increasing standard deviation, providin…

Feature Descriptors
Concept·Features·2004

Fixed-length vectors encoding the local image appearance around a keypoint, built so the same physical point yields similar descriptors acro…

Twenty-five years of methods for finding the inner corners of a planar checkerboard calibration target — from Harris-on-thresholded-images t…

Hessian Saddle Response
Concept·Features

A scalar response computed from the determinant of the image Hessian, negative at saddle points (X-corners) and zero at flat regions, edges,…

A 2011–2013 lineage of stitching methods that replace the single global homography with a spatially varying warp field — fitted as either tw…

Structure Tensor
Concept·Features

A symmetric 2×2 matrix formed by summing the outer products of the image gradient over a local window, encoding the dominant orientation and…

Feature Matching
Concept·Features·2004

Establishing keypoint correspondences between two images by comparing descriptors and resolving them into a consistent partial assignment — …

Optical Flow
Concept·Features·1981

The apparent 2-D velocity field of image brightness between consecutive frames, recovered from the spatio-temporal gradient under the bright…