OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow
Simon Boeder, Fabian Gigengack, Benjamin Risse

TL;DR
This paper introduces OccFlowNet, a self-supervised method for 3D occupancy estimation using differentiable rendering and occupancy flow, reducing reliance on costly 3D labels while achieving state-of-the-art results.
Contribution
It proposes a novel self-supervised approach leveraging 2D labels, differentiable volumetric rendering, and occupancy flow to improve 3D occupancy estimation accuracy.
Findings
Achieves state-of-the-art performance with only 2D supervision.
Outperforms methods using 3D labels when combined with additional techniques.
Significantly surpasses previous occupancy estimation models.
Abstract
Semantic occupancy has recently gained significant traction as a prominent 3D scene representation. However, most existing methods rely on large and costly datasets with fine-grained 3D voxel labels for training, which limits their practicality and scalability, increasing the need for self-monitored learning in this domain. In this work, we present a novel approach to occupancy estimation inspired by neural radiance field (NeRF) using only 2D labels, which are considerably easier to acquire. In particular, we employ differentiable volumetric rendering to predict depth and semantic maps and train a 3D network based on 2D supervision only. To enhance geometric accuracy and increase the supervisory signal, we introduce temporal rendering of adjacent time steps. Additionally, we introduce occupancy flow as a mechanism to handle dynamic objects in the scene and ensure their temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Data Quality and Management · Human Mobility and Location-Based Analysis
