OneBEV: Using One Panoramic Image for Bird's-Eye-View Semantic Mapping
Jiale Wei, Junwei Zheng, Ruiping Liu, Jie Hu, Jiaming Zhang, and, Rainer Stiefelhagen

TL;DR
OneBEV introduces a novel approach for bird's-eye-view semantic mapping using a single panoramic image, employing a distortion-aware module to handle spatial distortions and achieve state-of-the-art results in autonomous driving scenarios.
Contribution
The paper presents a new BEV mapping method that simplifies the process by using only one panoramic image and a specialized distortion-aware module, along with two new datasets.
Findings
Achieves 51.1% mIoU on nuScenes-360
Achieves 36.1% mIoU on DeepAccident-360
Outperforms existing methods in BEV semantic mapping
Abstract
In the field of autonomous driving, Bird's-Eye-View (BEV) perception has attracted increasing attention in the community since it provides more comprehensive information compared with pinhole front-view images and panoramas. Traditional BEV methods, which rely on multiple narrow-field cameras and complex pose estimations, often face calibration and synchronization issues. To break the wall of the aforementioned challenges, in this work, we introduce OneBEV, a novel BEV semantic mapping approach using merely a single panoramic image as input, simplifying the mapping process and reducing computational complexities. A distortion-aware module termed Mamba View Transformation (MVT) is specifically designed to handle the spatial distortions in panoramas, transforming front-view features into BEV features without leveraging traditional attention mechanisms. Apart from the efficient framework,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage · Robotics and Sensor-Based Localization
MethodsSoftmax · Attention Is All You Need · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
