UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning   Leveraging Planning

Christopher Diehl; Timo Sievernich; Martin Kr\"uger; Frank Hoffmann,; Torsten Bertram

arXiv:2111.11097·cs.RO·June 29, 2022·1 cites

UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning

Christopher Diehl, Timo Sievernich, Martin Kr\"uger, Frank Hoffmann,, Torsten Bertram

PDF

Open Access

TL;DR

UMBRELLA introduces an uncertainty-aware model-based offline RL method that improves decision-making in automated driving by jointly modeling stochastic traffic dynamics and leveraging planning for safety and explainability.

Contribution

It presents a novel approach integrating aleatoric uncertainty into model-based offline RL for autonomous driving, enhancing safety and interpretability.

Findings

01

Effective in challenging simulated driving scenarios

02

Demonstrates improved decision quality on real-world datasets

03

Outperforms existing offline RL methods in safety-critical tasks

Abstract

Offline reinforcement learning (RL) provides a framework for learning decision-making from offline data and therefore constitutes a promising approach for real-world applications as automated driving. Self-driving vehicles (SDV) learn a policy, which potentially even outperforms the behavior in the sub-optimal data set. Especially in safety-critical applications as automated driving, explainability and transferability are key to success. This motivates the use of model-based offline RL approaches, which leverage planning. However, current state-of-the-art methods often neglect the influence of aleatoric uncertainty arising from the stochastic behavior of multi-agent systems. This work proposes a novel approach for Uncertainty-aware Model-Based Offline REinforcement Learning Leveraging plAnning (UMBRELLA), which solves the prediction, planning, and control problem of the SDV jointly in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Traffic control and management