Planning on the fast lane: Learning to interact using attention mechanisms in path integral inverse reinforcement learning
Sascha Rosbach, Xing Li, Simon Gro{\ss}johann, Silviu Homoceanu and, Stefan Roth

TL;DR
This paper introduces a neural network architecture with attention mechanisms for path integral inverse reinforcement learning, enabling extended horizon reward prediction and improved interaction modeling in automated driving scenarios.
Contribution
It proposes a novel neural network with policy and temporal attention mechanisms for better sequential reward prediction in dynamic driving environments.
Findings
Policy attention focuses on collision-free trajectories.
Temporal attention captures persistent interactions over time.
Model performs well in complex simulated driving situations.
Abstract
General-purpose trajectory planning algorithms for automated driving utilize complex reward functions to perform a combined optimization of strategic, behavioral, and kinematic features. The specification and tuning of a single reward function is a tedious task and does not generalize over a large set of traffic situations. Deep learning approaches based on path integral inverse reinforcement learning have been successfully applied to predict local situation-dependent reward functions using features of a set of sampled driving policies. Sample-based trajectory planning algorithms are able to approximate a spatio-temporal subspace of feasible driving policies that can be used to encode the context of a situation. However, the interaction with dynamic objects requires an extended planning horizon, which depends on sequential context modeling. In this work, we are concerned with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
