Benchmarking and Improving Bird's Eye View Perception Robustness in   Autonomous Driving

Shaoyuan Xie; Lingdong Kong; Wenwei Zhang; Jiawei Ren and; Liang Pan; Kai Chen; Ziwei Liu

arXiv:2405.17426·cs.CV·February 4, 2025

Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving

Shaoyuan Xie, Lingdong Kong, Wenwei Zhang, Jiawei Ren and, Liang Pan, Kai Chen, Ziwei Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces RoboBEV, a comprehensive benchmark suite for evaluating the robustness of bird's eye view perception models in autonomous driving under diverse conditions, and proposes strategies to enhance their resilience.

Contribution

We present RoboBEV, the first extensive benchmark for BEV perception robustness, and demonstrate effective methods like pre-training and temporal information to improve model resilience.

Findings

01

Model performance correlates with robustness to out-of-distribution data.

02

Pre-training and depth-free BEV transformations enhance robustness.

03

Temporal information significantly improves model resilience.

Abstract

Recent advancements in bird's eye view (BEV) representations have shown remarkable promise for in-vehicle 3D perception. However, while these methods have achieved impressive results on standard benchmarks, their robustness in varied conditions remains insufficiently assessed. In this study, we present RoboBEV, an extensive benchmark suite designed to evaluate the resilience of BEV algorithms. This suite incorporates a diverse set of camera corruption types, each examined over three severity levels. Our benchmarks also consider the impact of complete sensor failures that occur when using multi-modal models. Through RoboBEV, we assess 33 state-of-the-art BEV-based perception models spanning tasks like detection, map segmentation, depth estimation, and occupancy prediction. Our analyses reveal a noticeable correlation between the model's performance on in-distribution datasets and its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Daniel-xsy/RoboBEV
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Robotics and Sensor-Based Localization · Autonomous Vehicle Technology and Safety

MethodsSparse Evolutionary Training · Contrastive Language-Image Pre-training