RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye   View Segmentation

Henrique Pi\~neiro Monteagudo; Leonardo Taccari; Aurel Pjetri,; Francesco Sambo; Samuele Salti

arXiv:2502.14792·cs.CV·April 11, 2025

RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation

Henrique Pi\~neiro Monteagudo, Leonardo Taccari, Aurel Pjetri,, Francesco Sambo, Samuele Salti

PDF

Open Access

TL;DR

RendBEV introduces a self-supervised approach for bird's eye view semantic segmentation using differentiable volumetric rendering, enabling zero-shot performance and improving low-annotation regime results.

Contribution

The paper presents RendBEV, a novel self-supervised training method for BEV segmentation that leverages volumetric rendering and semantic perspective views, reducing reliance on annotated data.

Findings

01

Achieves competitive zero-shot BEV segmentation results.

02

Significantly improves performance when fine-tuned with limited labels.

03

Sets new state-of-the-art results with full labeled data.

Abstract

Bird's Eye View (BEV) semantic maps have recently garnered a lot of attention as a useful representation of the environment to tackle assisted and autonomous driving tasks. However, most of the existing work focuses on the fully supervised setting, training networks on large annotated datasets. In this work, we present RendBEV, a new method for the self-supervised training of BEV semantic segmentation networks, leveraging differentiable volumetric rendering to receive supervision from semantic perspective views computed by a 2D semantic segmentation model. Our method enables zero-shot BEV semantic segmentation, and already delivers competitive results in this challenging setting. When used as pretraining to then fine-tune on labeled BEV ground-truth, our method significantly boosts performance in low-annotation regimes, and sets a new state of the art when fine-tuning on all available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques

MethodsSoftmax · Attention Is All You Need