TL;DR
Online3R is an online learning framework for sequential 3D reconstruction that adapts to new scenes using lightweight prompts and self-supervised strategies, improving consistency and efficiency.
Contribution
It introduces a novel online learning approach with lightweight prompts and self-supervised constraints to enhance scene reconstruction consistency in real-time.
Findings
Outperforms previous state-of-the-art methods on various benchmarks.
Effectively adapts to new environments with minimal computational overhead.
Maintains high reconstruction quality through local and global consistency constraints.
Abstract
We present Online3R, a new sequential reconstruction framework that is capable of adapting to new scenes through online learning, effectively resolving inconsistency issues. Specifically, we introduce a set of learnable lightweight visual prompts into a pretrained, frozen geometry foundation model to capture the knowledge of new environments while preserving the fundamental capability of the foundation model for geometry prediction. To solve the problems of missing groundtruth and the requirement of high efficiency when updating these visual prompts at test time, we introduce a local-global self-supervised learning strategy by enforcing the local and global consistency constraints on predictions. The local consistency constraints are conducted on intermediate and previously local fused results, enabling the model to be trained with high-quality pseudo groundtruth signals; the global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
