VGGT-CD: Training-Free Robust Registration for 3D Change Detection

Wei Zhang (1); Songhua Li (1); Yihang Wu (1); Qiang Li (1); Qi Wang (1) ((1) Northwestern Polytechnical University; Xi'an; China)

arXiv:2605.16859·cs.CV·May 19, 2026

VGGT-CD: Training-Free Robust Registration for 3D Change Detection

Wei Zhang (1), Songhua Li (1), Yihang Wu (1), Qiang Li (1), Qi Wang (1) ((1) Northwestern Polytechnical University, Xi'an, China)

PDF

TL;DR

VGGT-CD is a training-free, robust 3D change detection pipeline that improves registration accuracy and efficiency using a decoupled, multi-stage approach with theoretical guarantees.

Contribution

It introduces a novel training-free method that separates registration from change detection, addressing scale ambiguity and noise issues in 3D scene analysis.

Findings

01

Reduces Absolute Trajectory Error by 44% outdoors and 59% indoors.

02

Over 6 times faster registration process.

03

Produces high-purity 3D change maps without task-specific training.

Abstract

3D change detection from multi-view images is essential for urban monitoring, disaster assessment, and autonomous driving. However, existing methods predominantly operate in the 2D domain, where viewpoint variations are mistaken for physical changes and depth is unavailable. While visual geometry foundation models like VGGT rapidly produce dense point clouds from unposed images, independent per-epoch reconstruction encounters fundamental obstacles: unpredictable inter-epoch scale ambiguity, registration-change paradox where scene changes corrupt alignment, and pervasive edge-flying noise. To address these challenges, we present VGGT-CD, a training-free pipeline decoupling cross-temporal registration from dynamic-change interference. In the Coarse Stage, sparse keyframe joint inference establishes a unified metric space and yields an initial Sim(3) prior. In the Fine Stage, dense…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.