VGGT-CD: Training-Free Robust Registration for 3D Change Detection
Wei Zhang (1), Songhua Li (1), Yihang Wu (1), Qiang Li (1), Qi Wang (1) ((1) Northwestern Polytechnical University, Xi'an, China)

TL;DR
VGGT-CD is a training-free, robust 3D change detection pipeline that improves registration accuracy and efficiency using a decoupled, multi-stage approach with theoretical guarantees.
Contribution
It introduces a novel training-free method that separates registration from change detection, addressing scale ambiguity and noise issues in 3D scene analysis.
Findings
Reduces Absolute Trajectory Error by 44% outdoors and 59% indoors.
Over 6 times faster registration process.
Produces high-purity 3D change maps without task-specific training.
Abstract
3D change detection from multi-view images is essential for urban monitoring, disaster assessment, and autonomous driving. However, existing methods predominantly operate in the 2D domain, where viewpoint variations are mistaken for physical changes and depth is unavailable. While visual geometry foundation models like VGGT rapidly produce dense point clouds from unposed images, independent per-epoch reconstruction encounters fundamental obstacles: unpredictable inter-epoch scale ambiguity, registration-change paradox where scene changes corrupt alignment, and pervasive edge-flying noise. To address these challenges, we present VGGT-CD, a training-free pipeline decoupling cross-temporal registration from dynamic-change interference. In the Coarse Stage, sparse keyframe joint inference establishes a unified metric space and yields an initial Sim(3) prior. In the Fine Stage, dense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
