TrajVG: 3D Trajectory-Coupled Visual Geometry Learning
Xingyu Miao, Weiguang Zhao, Tao Lu, Linning Xu, Mulin Yu, Yang Long, Jiangmiao Pang, Junting Dong

TL;DR
TrajVG introduces a novel 3D trajectory-based framework for multi-frame 3D reconstruction that improves accuracy and robustness in videos with object motion by explicitly modeling cross-frame correspondences.
Contribution
It proposes a new method that predicts 3D trajectories to enhance multi-frame 3D reconstruction, incorporating geometric consistency and self-supervised learning from pseudo 2D tracks.
Findings
Outperforms existing feedforward models in 3D tracking and pose estimation.
Reduces drift and misalignment in dynamic scenes.
Achieves superior pointmap reconstruction and video depth results.
Abstract
Feed-forward multi-frame 3D reconstruction models often degrade on videos with object motion. Global-reference becomes ambiguous under multiple motions, while the local pointmap relies heavily on estimated relative poses and can drift, causing cross-frame misalignment and duplicated structures. We propose TrajVG, a reconstruction framework that makes cross-frame 3D correspondence an explicit prediction by estimating camera-coordinate 3D trajectories. We couple sparse trajectories, per-frame local point maps, and relative camera poses with geometric consistency objectives: (i) bidirectional trajectory-pointmap consistency with controlled gradient flow, and (ii) a pose consistency objective driven by static track anchors that suppresses gradients from dynamic regions. To scale training to in-the-wild videos where 3D trajectory labels are scarce, we reformulate the same coupling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Robotics and Sensor-Based Localization
