One at a Time: Progressive Multi-step Volumetric Probability Learning for Reliable 3D Scene Perception
Bohan Li, Yasheng Sun, Jingxin Dong, Zheng Zhu, Jinming Liu, Xin Jin,, Wenjun Zeng

TL;DR
This paper introduces a progressive multi-step diffusion-based framework for 3D scene perception, improving volumetric probability learning in challenging regions and surpassing LiDAR methods on SemanticKITTI.
Contribution
It proposes a novel multi-step diffusion framework, VPD, for refining 3D volumetric probabilities, addressing limitations of single-step methods in complex scene regions.
Findings
Outperforms existing methods in multi-view stereo and semantic scene completion.
First to surpass LiDAR-based methods on SemanticKITTI for SSC.
Demonstrates improved accuracy in occluded and reflective regions.
Abstract
Numerous studies have investigated the pivotal role of reliable 3D volume representation in scene perception tasks, such as multi-view stereo (MVS) and semantic scene completion (SSC). They typically construct 3D probability volumes directly with geometric correspondence, attempting to fully address the scene perception tasks in a single forward pass. However, such a single-step solution makes it hard to learn accurate and convincing volumetric probability, especially in challenging regions like unexpected occlusions and complicated light reflections. Therefore, this paper proposes to decompose the complicated 3D volume representation learning into a sequence of generative steps to facilitate fine and reliable scene perception. Considering the recent advances achieved by strong generative diffusion models, we introduce a multi-step learning framework, dubbed as VPD, dedicated to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques
MethodsDiffusion
