Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop   End-To-End Autonomous Driving

Xiaosong Jia; Zhenjie Yang; Qifeng Li; Zhiyuan Zhang; Junchi Yan

arXiv:2406.03877·cs.RO·November 28, 2024·2 cites

Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving

Xiaosong Jia, Zhenjie Yang, Qifeng Li, Zhiyuan Zhang, Junchi Yan

PDF

Open Access 4 Repos 1 Models 1 Video

TL;DR

Bench2Drive introduces a comprehensive, fair, and realistic benchmark for evaluating multi-ability, closed-loop end-to-end autonomous driving systems across diverse scenarios, weather, and locations.

Contribution

It is the first benchmark to evaluate E2E-AD systems' multiple abilities in a realistic closed-loop setting with extensive, diverse data and standardized evaluation protocols.

Findings

01

Current models show varied performance across scenarios.

02

Benchmark reveals strengths and weaknesses of state-of-the-art E2E-AD models.

03

Provides a foundation for future research and development in autonomous driving.

Abstract

In an era marked by the rapid scaling of foundation models, autonomous driving technologies are approaching a transformative threshold where end-to-end autonomous driving (E2E-AD) emerges due to its potential of scaling up in the data-driven manner. However, existing E2E-AD methods are mostly evaluated under the open-loop log-replay manner with L2 errors and collision rate as metrics (e.g., in nuScenes), which could not fully reflect the driving performance of algorithms as recently acknowledged in the community. For those E2E-AD methods evaluated under the closed-loop protocol, they are tested in fixed routes (e.g., Town05Long and Longest6 in CARLA) with the driving score as metrics, which is known for high variance due to the unsmoothed metric function and large randomness in the long route. Besides, these methods usually collect their own data for training, which makes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
andreas122001/temporal_tfpp
model

Videos

Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving· slideslive

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Robotic Path Planning Algorithms · Real-time simulation and control systems

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator