TL;DR
This paper introduces a new approach for 3D reconstruction from disjoint views, leveraging foundation models to synthesize intermediate perspectives and improve geometric accuracy in scenarios lacking visual overlap.
Contribution
It proposes GLADOS, a modular framework for generative reconstruction from disjoint views, and establishes benchmarks and metrics for zero-overlap 3D reconstruction tasks.
Findings
Existing methods fail on disjoint view reconstruction, producing incoherent geometries.
GLADOS effectively synthesizes intermediate views to connect disjoint inputs.
The framework improves geometric coherence in zero-overlap scenarios.
Abstract
3D vision systems are fundamentally constrained by their reliance on visual overlap: reconstruction methods require it for geometric alignment, while generative models use it to enforce multi-view consistency. This limitation is particularly acute in real-world scenarios such as distributed swarm robotics or crowd-sourced data collection, where capturing overlapping perspectives, both in terms of spatial and appearance overlap, is often impossible. We introduce Generative Reconstruction from Disjoint Views as a new paradigm, establish a comprehensive dataset, and propose specialized evaluation metrics for zero-overlap scenarios. Our benchmarking demonstrates that existing state-of-the-art methods fail catastrophically on this task, producing disconnected geometries or semantically incoherent reconstructions. To address these limitations, we propose GLADOS, a general, modular framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
