DGTR: Distributed Gaussian Turbo-Reconstruction for Sparse-View Vast   Scenes

Hao Li; Yuanyuan Gao; Haosong Peng; Chenming Wu; Weicai Ye; Yufeng; Zhan; Chen Zhao; Dingwen Zhang; Jingdong Wang; Junwei Han

arXiv:2411.12309·cs.CV·November 21, 2024

DGTR: Distributed Gaussian Turbo-Reconstruction for Sparse-View Vast Scenes

Hao Li, Yuanyuan Gao, Haosong Peng, Chenming Wu, Weicai Ye, Yufeng, Zhan, Chen Zhao, Dingwen Zhang, Jingdong Wang, Junwei Han

PDF

Open Access

TL;DR

DGTR introduces a distributed Gaussian-based framework for efficient, high-quality sparse-view reconstruction of large scenes, significantly reducing training time and improving scalability in novel-view synthesis tasks.

Contribution

The paper proposes a novel distributed Gaussian reconstruction method that enables fast, scalable, and high-quality scene reconstruction from sparse views, addressing limitations of existing methods.

Findings

01

Achieves high-quality large-scale scene reconstruction within minutes.

02

Outperforms existing approaches in speed and scalability.

03

Effectively integrates synthetic views and depth priors for enhanced results.

Abstract

Novel-view synthesis (NVS) approaches play a critical role in vast scene reconstruction. However, these methods rely heavily on dense image inputs and prolonged training times, making them unsuitable where computational resources are limited. Additionally, few-shot methods often struggle with poor reconstruction quality in vast environments. This paper presents DGTR, a novel distributed framework for efficient Gaussian reconstruction for sparse-view vast scenes. Our approach divides the scene into regions, processed independently by drones with sparse image inputs. Using a feed-forward Gaussian model, we predict high-quality Gaussian primitives, followed by a global alignment algorithm to ensure geometric consistency. Synthetic views and depth priors are incorporated to further enhance training, while a distillation-based model aggregation mechanism enables efficient reconstruction. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Medical Image Segmentation Techniques · Computer Graphics and Visualization Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings