Atlas 117

Practical computer vision atlas — algorithms, models, and concepts.

Recently added8

Model·Depth·2024

A monocular relative-depth foundation model that scales training to 62M unlabeled images via a teacher pseudo-labeling data engine, a CutMix…

→extended byDepth Anything

NEWDepth Anything 3

Model·Geometry·2025

A single plain-transformer model that predicts spatially consistent geometry — depth plus camera rays — from one to many images, with or wit…

NEWDepth Anything V2

Model·Depth·2024

A monocular depth foundation model that trains its teacher purely on synthetic images for label precision, then distills to a student over 6…

→extended byDepth Anything

NEWDINOv2

Model·Features·2023

A self-supervised ViT trained on a curated 142M-image dataset that yields general-purpose visual features usable frozen — via kNN or linear …

NEWDUSt3R

Model·Geometry·2023

A feed-forward network that regresses two dense pointmaps in a shared coordinate frame from an uncalibrated, unposed image pair, jointly rec…

→extended byMASt3R

NEWFeed-Forward 3D Reconstruction

Concept·Geometry·2023

Recovering 3D geometry — point maps, depth, and camera poses — directly from images in a single network pass, replacing the detect-match-tri…

NEWMASt3R

Model·Feature matching·2024

A 3D-grounded image matcher that adds a dense local-descriptor head and an InfoNCE matching loss on top of DUSt3R's pointmap regression, wit…

→compared withLoFTR·LightGlue

NEWMiDaS

Model·Depth·2019

A monocular depth network trained for zero-shot cross-dataset transfer by mixing incompatible depth datasets under a scale-and-shift-invaria…

Algorithms47

Daniilidis Dual-Quaternion Hand-Eye Calibration

Algorithm·Hand–eye calibration·1999

Solve the hand-eye equation AX=XB jointly for rotation and translation by parametrising rigid motions as unit dual quaternions and extractin…

→compared withTsai-Lenz Hand-Eye

Tsai-Lenz Hand-Eye Calibration

Algorithm·Hand–eye calibration·1989

Recover the constant rigid transform from a robot gripper to a rigidly mounted camera by solving the AX=XB equation in two stages — modified…

→compared withDaniilidis Dual-Quaternion

Viola–Jones Object Detector

Algorithm·Detection·2001

Real-time frontal-face detection by sliding a fixed 24×24 sub-window across a grayscale image at multiple scales, scoring each position with…

→compared withDeformable Part·HOG

Felzenszwalb–Huttenlocher Graph-Based Image Segmentation

Algorithm·Image segmentation·2004

Partition an image into perceptually coherent regions by a Kruskal-style greedy merge over a pixel graph, accepting an inter-component edge …

GrabCut Iterative Segmentation

Algorithm·Image segmentation·2004

Extract a foreground from a colour image using a single bounding rectangle as the only required input by alternating Gaussian mixture compon…

Graph-Cut Interactive Segmentation

Algorithm·Image segmentation·2001

Compute the global minimum of a binary region-and-boundary MRF energy as a single s-t min-cut on a pixel graph; user-marked seeds enter as h…

→extended byGrabCut Iterative

Fischler–Bolles RANSAC

Algorithm·Geometry·1981

Founding random-sample-consensus paradigm: fit a parametric model to data containing an unknown fraction of gross outliers by drawing minima…

→extended byUSAC·MAGSAC

MAGSAC: Marginalising Sample Consensus

Algorithm·Geometry·2019

Robust estimator that eliminates the user-tuned inlier threshold by treating the noise scale σ as a random variable on [0, σ_max] and margin…

→compared withUSAC

USAC: Universal RANSAC Framework

Algorithm·Geometry·2013

Engineering decomposition of practical RANSAC into four pluggable stages — sampling (PROSAC), model verification (SPRT), local optimisation …

→compared withMAGSAC

Geometric Bird's-Eye-View Rectification

Algorithm·Geometry·2019

Rectifies a monocular image to a metric overhead (bird's-eye) view by constructing the rectifying homography in closed form from two CNN-reg…

Deformable Part Models

Algorithm·Detection·2010

Detect a target object class in arbitrary images by scoring every position and scale in a HOG feature pyramid with a mixture of star-structu…

→compared withViola

Longuet-Higgins Linear Eight-Point Algorithm

Algorithm·Fundamental-matrix estimation·1981

1981 closed-form linear method for relative orientation of two viewpoints from eight calibrated point correspondences, introducing the bilin…

→extended byNormalised Eight-Point

Yang Parametric-Model Sub-Pixel Corner Fit

Algorithm·Corner detection·2018

Refine pixel-level chessboard corner positions to sub-pixel accuracy by nonlinear least-squares fitting a seven-parameter ideal blurred-corn…

→compared withROCHADE·Geiger Chessboard·Pyramidal Blur-Aware+1

SURF: Speeded Up Robust Features

Algorithm·Feature detection·2006

Detects scale- and rotation-invariant blob keypoints as scale-space maxima of the Hessian determinant, approximated with box filters on an i…

→compared withSIFT·FAST Corner·Harris Corner+3

Canny Edge Detector

Algorithm·Features·1986

Detect thin step edges in greyscale images by smoothing with a Gaussian, computing gradient magnitude and direction, suppressing non-maxima …

Scaramuzza Omnidirectional Camera Calibration

Algorithm·Camera calibration·2006

Calibrate any central catadioptric or fisheye camera from a few planar checkerboard views by fitting a radially-symmetric Taylor-polynomial …

EPnP: O(n) Perspective-n-Point

Algorithm·Geometry·2009

Non-iterative O(n) solver for the calibrated Perspective-n-Point problem: express the n reference points as weighted sums of four virtual co…

Fast Radial Symmetry Transform

Algorithm·Feature detection·2003

Gradient-vote operator that highlights pixels of high local radial symmetry — bright/dark blobs and approximately circular features. Each pi…

→extended byGeneralised Fast

Generalised Fast Radial Symmetry

Algorithm·Feature detection·2012

Affine extension of FRST: each pixel votes along a corrected direction $\hat V = G M G^{-1} M^{-1} \nabla I$ at radius $n$, where $G = R D \…

Gao Dual-Homography Stitching

Algorithm·Image stitching·2011

Stitch two-plane outdoor panoramas by clustering SIFT correspondences into a ground group and a distant group via spatial K-means, fitting o…

→extended byAs-Projective-As-Possible Image

Geiger Chessboard Corner Detector

Algorithm·Chessboard detection·2012

Detect checkerboard X-corners by computing a four-quadrant corner likelihood at each pixel using axis-aligned and 45°-rotated prototype filt…

→compared withPyramidal Blur-Aware·Chessboard Detection·Topological Grid+1

GP Checkerboard Enhancement (PyCBD)

Algorithm·Corner detection·2023

Post-process a partially detected checkerboard by training two Gaussian processes (one per pixel coordinate) on the allocated (boardXY, boar…

→compared withOCPAD

Lin Smoothly Varying Affine Stitching

Algorithm·Image stitching·2011

Stitch two images under moderate parallax by replacing the global affine with a per-feature deviation field, regularised to be smooth via a …

→extended byAs-Projective-As-Possible Image

Normalised Eight-Point Algorithm

Algorithm·Fundamental-matrix estimation·1997

Compute the fundamental matrix from n ≥ 8 point correspondences by conditioning the linear DLT system via a similarity normalisation, recove…

Sturm-Maybank Plane-Based Calibration

Algorithm·Camera calibration·1999

Recover camera intrinsics from one or more views of one or more planar targets via the same two IAC-on-homography constraints as Zhang's met…

→compared withZhang's Planar

Tsai's Versatile Camera Calibration

Algorithm·Camera calibration·1987

Two-stage 1987 camera calibration that uses the radial alignment constraint to recover extrinsics and image scale linearly from a precision …

→extended byZhang's Planar

As-Projective-As-Possible Image Stitching

Algorithm·Image stitching·2013

Replace a global homography with a spatially varying field of homographies, each fit by a per-cell weighted DLT (Moving DLT) on the same poi…

Kumar-Ahuja Generalized Radial Alignment Constraint

Algorithm·Camera calibration·2014

Extend Tsai's radial alignment constraint to a non-frontal sensor by modelling lens–sensor tilt as a 2-DoF rotation, projecting observations…

Localized Radon Checkerboard Corners

Algorithm·Corner detection·2018

Detect checkerboard X-junctions by approximating a localized Radon transform with 1-D box filters on rotated copies of the image; the per-pi…

→compared withChESS Corners·Yang Parametric-Model

Zhang's Planar Camera Calibration

Algorithm·Camera calibration·2000

Recover camera intrinsics, radial distortion, and per-view extrinsics from at least three images of a planar pattern at different orientatio…

→compared withSturm-Maybank Plane-Based

Pyramidal Blur-Aware X-Corner Chessboard Detector

Algorithm·Corner detection·2021

Detect chessboard X-junctions in heavily blurred or high-resolution images by computing a 16-sample circular x-corner intensity at every lev…

→compared withChESS Corners·Geiger Chessboard·ROCHADE+1

OCPAD: Occluded Checkerboard Pattern Detection

Algorithm·Corner detection·2016

Recover the largest visible checkerboard subgraph from a partially occluded pattern by running VF2 subgraph isomorphism against a model grap…

→compared withGP Checkerboard

Topological Grid Finding

Algorithm·Chessboard detection·2009

Recover the integer $(i, j)$ grid coordinate of every corner in a checkerboard calibration image by Delaunay-triangulating the corners, merg…

→compared withChessboard Detection·Geiger Chessboard

ChESS Corners

Algorithm·Corner detection·2013

A chessboard-specific corner detector: scores each pixel by how well its local neighborhood matches an alternating bright-dark X-junction pa…

→compared withROCHADE·Pyramidal Blur-Aware·PuzzleBoard+2

FAST Corner Detector

Algorithm·Corner detection·2006

Segment-test corner detector on a 16-pixel Bresenham ring of radius 3 around each candidate; classifies a point as a corner when N contiguou…

→compared withHarris Corner·SIFT·SURF

HOG: Histograms of Oriented Gradients

Algorithm·Detection·2005

Compute a fixed-length descriptor for an image window by binning pixel gradients into 8×8 cells of 9 unsigned-orientation histograms, normal…

→compared withViola

BRIEF: Binary Robust Independent Elementary Features

Algorithm·Feature matching·2010

Encodes a Gaussian-smoothed image patch around a detected keypoint as a 128/256/512-bit binary string by running a fixed table of pairwise p…

→extended byORB

ORB: Oriented FAST and Rotated BRIEF

Algorithm·Feature detection·2011

Detects rotation-invariant oriented keypoints by running FAST-9 on a √2 image pyramid, ranking by Harris cornerness, and assigning orientati…

→compared withSIFT·SURF

ROCHADE: Robust Checkerboard Advanced Detection

Algorithm·Corner detection·2014

Detect a full planar checkerboard in an image by reducing the gradient-magnitude edge set to a single-pixel centreline graph, extracting inn…

→compared withPyramidal Blur-Aware·ChESS Corners·Yang Parametric-Model

Chessboard Detection via X-Corners and Topology

Algorithm·Chessboard detection·2013

Detect every corner of a chessboard calibration pattern and assign it an integer grid coordinate by counting ring-alternations to locate X-j…

→compared withGeiger Chessboard·Topological Grid

PuzzleBoard

Algorithm·Chessboard detection·2024

Detect and decode a self-identifying checkerboard calibration pattern: saddle-point corners from a Hessian response, grid reconstruction via…

→compared withChESS Corners

Harris Corner Detector

Algorithm·Corner detection·1988

Scores each pixel by the Harris response R = det(M) − k·tr(M)², where M is the gradient covariance matrix summed over a Gaussian window; ret…

→compared withShi-Tomasi Corner·FAST Corner·ChESS Corners+2

Shi-Tomasi Corner Detector

Algorithm·Corner detection·1994

Scores each pixel by the smaller eigenvalue of the gradient structure tensor M; returns integer pixel locations where that eigenvalue exceed…

→compared withHarris Corner·SIFT·SURF

Black-Anandan Robust Optical Flow

Algorithm·Features·1996

Optical flow that replaces the quadratic data and smoothness penalties of variational flow with redescending M-estimators, solved by SOR wit…

Horn-Schunck Optical Flow

Algorithm·Features·1981

Dense optical flow recovered by minimising a variational energy that combines the brightness-constancy constraint with a global smoothness p…

→extended byBlack-Anandan Robust

Lucas-Kanade Image Registration

Algorithm·Features·1981

Iterative Newton-Raphson method that estimates the parametric warp between two images by linearising the residual and solving the resulting …

→extended byShi-Tomasi Corner·Black-Anandan Robust

SIFT: Scale-Invariant Feature Transform

Algorithm·Feature detection·2004

Detects keypoints as scale-space extrema in a Difference-of-Gaussian image pyramid, refines location and scale by 3D quadratic interpolation…

→compared withHarris Corner·Shi-Tomasi Corner·FAST Corner+3

Models42

FocalClick

Model·Image segmentation·2022

Practical click-based interactive segmentation that runs each click as a small local-crop forward pass (Segmentor on a Target Crop, Refiner …

→compared withMobileSAM·SAM

RITM

Model·Image segmentation·2021

Feedforward click-based interactive segmentation: HRNet+OCR encoder-decoder taking RGB + positive/negative disk-encoded clicks + previous ma…

→extended byFocalClick

Faster R-CNN

Model·Detection·2015

Two-stage CNN object detector that replaces external Selective Search / EdgeBoxes proposals with a learned Region Proposal Network sharing c…

→extended byMask R-CNN

YOLOv1

Model·Detection·2016

Single-stage CNN object detector that frames detection as one regression problem from full-image pixels to a 7×7×30 tensor of grid-cell box …

→compared withFaster R-CNN

DeepLab

Model·Image segmentation·2018

Dense semantic segmentation by repurposing an ImageNet classifier with atrous (dilated) convolution to preserve spatial resolution, an Atrou…

→compared withU-Net·BiSeNet·Fast-SCNN+1

Mask R-CNN

Model·Image segmentation·2017

Two-stage instance segmentation by adding a parallel FCN mask branch to Faster R-CNN — per-class binary masks predicted at each RoI under a …

→compared withMask2Former·SAM

U-Net

Model·Image segmentation·2015

Symmetric encoder-decoder fully-convolutional network for dense pixel-wise biomedical image segmentation — contracting path with channel-dou…

→compared withDeepLab·SegFormer

FCN: Fully Convolutional Networks

Model·Image segmentation·2015

Encoder-decoder CNN for dense pixel-wise classification — converts ImageNet classifiers into fully convolutional networks via 1×1-conv reint…

→extended byU-Net·DeepLab

NEWMiDaS

Model·Depth·2019

A monocular depth network trained for zero-shot cross-dataset transfer by mixing incompatible depth datasets under a scale-and-shift-invaria…

NEWMnasNet

Model·Image classification·2019

Mobile-CPU CNN image classifier discovered by platform-aware neural architecture search: an RNN controller trained with reinforcement learni…

NEWMobileNetV2

Model·Image classification·2018

Efficient mobile CNN backbone built from inverted-residual blocks with a linear bottleneck — depthwise-separable convolution expanded to a w…

NEWMobileNetV3

Model·Image classification·2019

Mobile-CPU-latency-targeted CNN backbone found by combined platform-aware NAS and NetAdapt, built from MobileNetV2 inverted-residual blocks …

→compared withFast-SCNN·BiSeNet

NEWRF-DETR

Model·Detection·2025

Light-weight specialist real-time detection transformer that discovers an accuracy-latency Pareto frontier for any target dataset from a sin…

BiSeNet

Model·Image segmentation·2018

Two-branch (bilateral) CNN for real-time semantic segmentation: a wide shallow path preserves spatial detail while a deep narrow path with g…

→compared withSegFormer·DeepLab·HRNet+2

Fast-SCNN

Model·Image segmentation·2019

Real-time semantic segmentation CNN whose shared shallow 'Learning to Downsample' prefix feeds both a deep low-resolution global-feature bra…

→compared withBiSeNet·DeepLab·SegFormer+2

SegFormer

Model·Image segmentation·2021

Hierarchical Transformer encoder (MiT) producing multi-scale features at $1/4, 1/8, 1/16, 1/32$ without positional encodings, plus an all-ML…

→compared withFCN·DeepLab·U-Net+4

DETR

Model·Detection·2020

End-to-end object detector that recasts detection as direct set prediction — CNN backbone (ResNet-50/101) extracts $H/32 \times W/32$ featur…

→compared withFaster R-CNN

HRNet

Model·Features·2019

CNN backbone family for dense prediction that maintains a high-resolution branch throughout the network and runs four parallel multi-resolut…

→compared withResNet·BiSeNet·Fast-SCNN+1

MAE

Model·Image classification·2021

Masked Autoencoder — self-supervised pretraining for Vision Transformers: randomly mask 75 % of input patches, feed the visible 25 % through…

Mask2Former

Model·Image segmentation·2022

Universal image segmentation family — MaskFormer (v1, NeurIPS 2021) reframes semantic segmentation as **mask classification**: predict a set…

→compared withMask R-CNN·SegFormer

SAM

Model·Image segmentation·2023

Promptable segmentation foundation model family — SAM (v1, 2023) introduces image-prompt segmentation with a heavy ViT-H encoder and lightwe…

→extended byMobileSAM

ViT

Model·Image classification·2020

Vision Transformer — a pure-transformer image classification backbone that treats an image as a sequence of fixed-size patches: split RGB im…

→compared withResNet

ResNet

Model·Image classification·2016

Family of very deep CNN image classifiers (18 to 152 layers) built from residual blocks $y = \mathcal{F}(x, \{W_i\}) + x$ that reformulate e…

→compared withGoogLeNet·HRNet·ViT

AlexNet

Model·Image classification·2012

Eight-layer convolutional neural network for 1000-class image classification on ImageNet, trained end-to-end on two GPUs with ReLU activatio…

→extended byVGG

GoogLeNet

Model·Image classification·2015

Twenty-two-layer CNN built from Inception modules — parallel 1×1, 3×3, 5×5 convolutions and 3×3 max-pool concatenated along the channel axis…

→compared withVGG·AlexNet·ResNet

VGG

Model·Image classification·2014

Family of very deep CNN image classifiers (11 to 19 weight layers) built from stacked 3×3 convolutions with stride 1 and 2×2 max-pool stride…

→extended byResNet

MATE

Model·Corner detection·2016

First learned per-pixel checkerboard X-corner detector: a three-convolutional-layer CNN with 2,939 parameters trained with mean-squared-erro…

→compared withCCDN·CCS

SuperPoint

Model·Feature detection·2018

Fully-convolutional CNN that jointly detects interest points and computes 256-D descriptors in a single forward pass, trained without human …

→compared withXFeat

CCDN

Model·Corner detection·2023

Fully convolutional network that regresses a per-pixel checkerboard-corner response map; trained with weighted cross-entropy and paired with…

→compared withCCS·MATE

XFeat

Model·Feature detection·2024

Lightweight CNN that jointly detects keypoints, extracts 64-D dense descriptors, and refines semi-dense matches from coarse descriptor pairs…

→compared withLoFTR·SuperPoint

NEWDepth Anything

Model·Depth·2024

A monocular relative-depth foundation model that scales training to 62M unlabeled images via a teacher pseudo-labeling data engine, a CutMix…

→extended byDepth Anything

NEWDepth Anything 3

Model·Geometry·2025

A single plain-transformer model that predicts spatially consistent geometry — depth plus camera rays — from one to many images, with or wit…

NEWDepth Anything V2

Model·Depth·2024

A monocular depth foundation model that trains its teacher purely on synthetic images for label precision, then distills to a student over 6…

→extended byDepth Anything

NEWDINOv2

Model·Features·2023

A self-supervised ViT trained on a curated 142M-image dataset that yields general-purpose visual features usable frozen — via kNN or linear …

NEWDUSt3R

Model·Geometry·2023

A feed-forward network that regresses two dense pointmaps in a shared coordinate frame from an uncalibrated, unposed image pair, jointly rec…

→extended byMASt3R

NEWVGGT (Visual Geometry Grounded Transformer)

Model·Geometry·2025

A large feed-forward transformer that predicts cameras, depth maps, point maps and 3D point tracks for one to hundreds of views in a single …

→extended byDepth Anything

MobileSAM

Model·Image segmentation·2023

Lightweight SAM family — replaces SAM's heavy ViT-H image encoder (632M params, ~452 ms on a single GPU) with a distilled TinyViT encoder (5…

→compared withFocalClick

CCS

Model·Camera calibration·2022

Three-stage learning-based camera calibration pipeline: a CNN regresses radial-distortion-correction parameters, a UNet predicts per-corner …

→compared withCCDN·MATE

NEWMASt3R

Model·Feature matching·2024

A 3D-grounded image matcher that adds a dense local-descriptor head and an InfoNCE matching loss on top of DUSt3R's pointmap regression, wit…

→compared withLoFTR·LightGlue

LightGlue

Model·Feature matching·2023

Adaptive-depth Transformer matcher for sparse local features: stacks 9 self+cross-attention layers with rotary positional encoding and a per…

→compared withLoFTR·MASt3R

LoFTR

Model·Feature matching·2021

Detector-free dense feature matcher: shared CNN backbone produces coarse and fine feature maps, a Linear Transformer with interleaved self- …

→compared withXFeat·LightGlue·MASt3R+1

SuperGlue

Model·Feature matching·2020

Graph neural network that matches two sets of sparse local features by jointly finding correspondences and rejecting unmatched keypoints in …

→extended byLightGlue

Concepts28

Attention Mechanism

Concept·Features·2020

Computes each output element as a learned, input-dependent weighted average of value vectors, letting every element aggregate information fr…

Convolution

Concept·Image formation·1986

The linear, shift-invariant operation that produces each output pixel as a kernel-weighted sum of input pixels in a local neighbourhood.

Energy Minimization

Concept·Segmentation·2001

The framework that poses image labelling and segmentation as minimising an objective combining a per-pixel data term and a pairwise smoothne…

Image Pyramid

Concept·Image formation·2004

A discrete multi-resolution representation — a sequence of images at progressively coarser resolution, each smoothed and downsampled from it…

Integral Image

Concept·Features·1984

A precomputed prefix-sum array that returns the sum of pixel values over any axis-aligned rectangle in constant time with four array reads.

Non-Maximum Suppression

Concept·Features·1986

Reducing a dense response map or a set of overlapping detections to a sparse set of local maxima by discarding every element that is not str…

Pinhole Camera Model

Concept·Image formation·2000

The projective map from 3-D scene points to 2-D image pixels through a single centre of projection, parameterised by an intrinsic matrix and…

SVD Null-Space Estimation

Concept·Geometry·1981

Estimating a geometric entity defined only up to scale by stacking constraints into a homogeneous linear system and taking the smallest righ…

RANSAC

Concept·Geometry

Random sample consensus — a paradigm for fitting a parametric model to data containing an unknown fraction of gross outliers, by drawing min…

Topological Grid Recovery

Concept·Features

Verify candidate calibration-pattern corners by constructing a graph over them (Delaunay triangulation, k-nearest-neighbours, or proximity) …

NEWMonocular Depth Estimation

Concept·Depth·2019

Predicting per-pixel scene depth from a single image — the scale ambiguity that forces relative (affine-invariant) versus metric formulation…

Bundle Adjustment

Concept·Calibration·2000

Joint nonlinear least-squares refinement of all camera parameters — and, in structure-from-motion, all 3-D points — that minimises the total…

Convolutional Neural Network

Concept·Features·2012

A feed-forward network that builds a spatial hierarchy of learned features by alternating weight-shared convolution layers, pointwise nonlin…

Pose Estimation

Concept·Geometry·2009

Recovery of the 6-DOF rigid transformation — rotation and translation — relating a camera to a scene, an object, or a second camera.

Camera Distortion Models

Concept·Image formation

Mathematical models for departures from the ideal pinhole projection — radial barrel/pincushion, tangential decentering, thin-prism — and th…

DLT Normalisation

Concept·Geometry

A two-line similarity transform — translate the point centroid to the origin, isotropically scale so the average distance is √2 — that condi…

Epipolar Geometry

Concept·Geometry

The intrinsic projective geometry of two views of a scene, encoding the constraint that a point visible in one image must lie on a specific …

Homography

Concept·Geometry

An invertible projective transformation of the plane, represented by a 3×3 matrix defined up to a non-zero scalar, mapping points between tw…

Image Gradient

Concept·Features

The 2-vector of partial derivatives of image intensity with respect to spatial coordinates, measuring the rate and direction of brightness c…

Scale Space

Concept·Image formation

A one-parameter family of images obtained by progressively blurring an input image with Gaussians of increasing standard deviation, providin…

NEWFeed-Forward 3D Reconstruction

Concept·Geometry·2023

Recovering 3D geometry — point maps, depth, and camera poses — directly from images in a single network pass, replacing the detect-match-tri…

Feature Descriptors

Concept·Features·2004

Fixed-length vectors encoding the local image appearance around a keypoint, built so the same physical point yields similar descriptors acro…

Chessboard X-Corner Detection

Concept·Features

Twenty-five years of methods for finding the inner corners of a planar checkerboard calibration target — from Harris-on-thresholded-images t…

Hessian Saddle Response

Concept·Features

A scalar response computed from the determinant of the image Hessian, negative at saddle points (X-corners) and zero at flat regions, edges,…

Spatially Varying Image Stitching

Concept·Geometry

A 2011–2013 lineage of stitching methods that replace the single global homography with a spatially varying warp field — fitted as either tw…

Structure Tensor

Concept·Features

A symmetric 2×2 matrix formed by summing the outer products of the image gradient over a local window, encoding the dominant orientation and…

Feature Matching

Concept·Features·2004

Establishing keypoint correspondences between two images by comparing descriptors and resolving them into a consistent partial assignment — …

Optical Flow

Concept·Features·1981

The apparent 2-D velocity field of image brightness between consecutive frames, recovered from the spatio-temporal gradient under the bright…