Question 1

What is structure from motion (SfM)?

Accepted Answer

Structure from motion simultaneously estimates 3D scene structure and camera positions from a collection of unordered 2D images. By detecting and matching features across images, triangulating 3D points, and refining everything through bundle adjustment, SfM produces a sparse 3D point cloud and calibrated camera poses.

Question 2

How does bundle adjustment work?

Accepted Answer

Bundle adjustment is a nonlinear least squares optimization that minimizes the total reprojection error — the sum of squared distances between observed feature points and their predicted positions through the estimated camera models and 3D points. It jointly refines all camera parameters and point coordinates.

Question 3

How many images do you need for SfM?

Accepted Answer

A minimum of 3 images is needed for 3D reconstruction, but practical SfM requires 70-80% overlap between adjacent images. For a complete object, 20-60 images from diverse viewpoints typically suffice. Larger image sets improve robustness and fill gaps.

Question 4

What is the difference between SfM and MVS?

Accepted Answer

SfM produces sparse 3D points and camera poses from feature matches. Multi-view stereo (MVS) then uses these calibrated cameras to compute dense depth maps and generate detailed 3D surface models. SfM is the geometric backbone; MVS adds surface detail.

Structure from Motion Simulator: SfM 3D Reconstruction Pipeline

Formula

From Photos to 3D

Feature Detection and Matching

Incremental Reconstruction

Bundle Adjustment

FAQ

Sources

Embed

Structure from Motion Simulator: SfM 3D Reconstruction Pipeline

Formula

From Photos to 3D

Feature Detection and Matching

Incremental Reconstruction

Bundle Adjustment

FAQ

Sources

Other simulations: Photogrammetry & 3D Reconstruction

Camera Calibration & Lens Distortion

Orthorectification & Map Projection

Point Cloud Generation & Visualization

Stereo Matching & Disparity Maps

Embed