Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images
Nikhil Gosala, Abhinav Valada

TL;DR
This paper introduces the first monocular frontal view to Bird's-Eye-View panoptic segmentation method, using a novel dense transformer architecture to improve scene understanding and object identification in BEV space.
Contribution
It presents a novel BEV panoptic segmentation approach from a single monocular image, incorporating a dense transformer module and a sensitivity-based pixel weighting scheme.
Findings
Outperforms state-of-the-art in PQ metric on KITTI-360 and nuScenes datasets.
Introduces a novel dense transformer module for BEV mapping.
Provides a mathematical formulation for FV-BEV transformation sensitivity.
Abstract
Bird's-Eye-View (BEV) maps have emerged as one of the most powerful representations for scene understanding due to their ability to provide rich spatial context while being easy to interpret and process. Such maps have found use in many real-world tasks that extensively rely on accurate scene segmentation as well as object instance identification in the BEV space for their operation. However, existing segmentation algorithms only predict the semantics in the BEV space, which limits their use in applications where the notion of object instances is also critical. In this work, we present the first BEV panoptic segmentation approach for directly predicting dense panoptic segmentation maps in the BEV, given a single monocular image in the frontal view (FV). Our architecture follows the top-down paradigm and incorporates a novel dense transformer module consisting of two distinct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Face Recognition and Perception
