Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
Junsheng Zhou, Yu-Shen Liu, Zhizhong Han

TL;DR
This paper introduces deep prior assembly, a zero-shot framework that combines large model priors for accurate scene reconstruction from single images without additional training.
Contribution
It proposes a novel method to assemble diverse deep priors for scene reconstruction, eliminating the need for 3D or 2D data-driven training.
Findings
Demonstrates superior performance on various datasets.
Achieves robust scene reconstruction without extra data.
Outperforms recent methods in visual and numerical evaluations.
Abstract
Large language and vision models have been leading a revolution in visual computing. By greatly scaling up sizes of data and model parameters, the large models learn deep priors which lead to remarkable performance in various tasks. In this work, we present deep prior assembly, a novel framework that assembles diverse deep priors from large models for scene reconstruction from single images in a zero-shot manner. We show that this challenging task can be done without extra knowledge but just simply generalizing one deep prior in one sub-task. To this end, we introduce novel methods related to poses, scales, and occlusion parsing which are keys to enable deep priors to work together in a robust way. Deep prior assembly does not require any 3D or 2D data-driven training in the task and demonstrates superior performance in generalizing priors to open-world scenes. We conduct evaluations on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced X-ray and CT Imaging · Medical Imaging Techniques and Applications · Advanced Optical Sensing Technologies
