Uncertainty-Aware Decision Transformer for Stochastic Driving   Environments

Zenan Li; Fan Nie; Qiao Sun; Fang Da; Hang Zhao

arXiv:2309.16397·cs.LG·October 8, 2024·2 cites

Uncertainty-Aware Decision Transformer for Stochastic Driving Environments

Zenan Li, Fan Nie, Qiao Sun, Fang Da, Hang Zhao

PDF

Open Access

TL;DR

This paper presents UNREST, a novel uncertainty-aware decision transformer for stochastic driving environments, improving offline RL by estimating uncertainties and using truncated returns for more reliable planning.

Contribution

Introduces UNREST, which estimates uncertainties via mutual information and employs truncated returns, enhancing offline RL in stochastic driving scenarios without complex models.

Findings

01

UNREST outperforms existing methods in various driving scenarios.

02

Uncertainty estimation improves planning safety and reliability.

03

Truncated returns reduce the impact of environment stochasticity.

Abstract

Offline Reinforcement Learning (RL) enables policy learning without active interactions, making it especially appealing for self-driving tasks. Recent successes of Transformers inspire casting offline RL as sequence modeling, which, however, fails in stochastic environments with incorrect assumptions that identical actions can consistently achieve the same goal. In this paper, we introduce an UNcertainty-awaRE deciSion Transformer (UNREST) for planning in stochastic driving environments without introducing additional transition or complex generative models. Specifically, UNREST estimates uncertainties by conditional mutual information between transitions and returns. Discovering 'uncertainty accumulation' and 'temporal locality' properties of driving environments, we replace the global returns in decision transformers with truncated returns less affected by environments to learn from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)

MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Absolute Position Encodings · Dense Connections · Layer Normalization · Multi-Head Attention · Byte Pair Encoding