BEV-Seg: Bird's Eye View Semantic Segmentation Using Geometry and Semantic Point Cloud
Mong H. Ng, Kaahan Radia, Jianfei Chen, Dequan Wang, Ionel Gog, and, Joseph E. Gonzalez

TL;DR
This paper introduces a novel two-stage perception pipeline for bird's-eye-view semantic segmentation from side RGB images, leveraging explicit depth prediction and geometric features to improve accuracy and domain transferability.
Contribution
The work presents a new pipeline that explicitly predicts pixel depths and uses geometric features, outperforming existing methods in BEV segmentation and domain transfer.
Findings
Improves state-of-the-art by 24% mIoU on BEVSEG-Carla dataset.
Effective domain transfer with high accuracy in unseen environments.
Introduces a new dataset for BEV segmentation in simulated driving scenes.
Abstract
Bird's-eye-view (BEV) is a powerful and widely adopted representation for road scenes that captures surrounding objects and their spatial locations, along with overall context in the scene. In this work, we focus on bird's eye semantic segmentation, a task that predicts pixel-wise semantic segmentation in BEV from side RGB images. This task is made possible by simulators such as Carla, which allow for cheap data collection, arbitrary camera placements, and supervision in ways otherwise not possible in the real world. There are two main challenges to this task: the view transformation from side view to bird's eye view, as well as transfer learning to unseen domains. Existing work transforms between views through fully connected layers and transfer learns via GANs. This suffers from a lack of depth reasoning and performance degradation across domains. Our novel 2-staged perception…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Remote Sensing and LiDAR Applications
