Driving Style Encoder: Situational Reward Adaptation for General-Purpose Planning in Automated Driving
Sascha Rosbach, Vinit James, Simon Gro{\ss}johann, Silviu Homoceanu,, Xing Li, Stefan Roth

TL;DR
This paper introduces a deep learning method that generates situational reward functions for automated driving, improving adaptability and reducing manual tuning compared to traditional linear reward functions.
Contribution
It presents a novel inverse reinforcement learning approach that creates situation-dependent reward functions for general-purpose planning in automated driving.
Findings
Deep network outperforms clustered linear reward functions.
Achieves comparable results to linear reward functions with prior situational knowledge.
Reduces manual effort in reward function design.
Abstract
General-purpose planning algorithms for automated driving combine mission, behavior, and local motion planning. Such planning algorithms map features of the environment and driving kinematics into complex reward functions. To achieve this, planning experts often rely on linear reward functions. The specification and tuning of these reward functions is a tedious process and requires significant experience. Moreover, a manually designed linear reward function does not generalize across different driving situations. In this work, we propose a deep learning approach based on inverse reinforcement learning that generates situation-dependent reward functions. Our neural network provides a mapping between features and actions of sampled driving policies of a model-predictive control-based planner and predicts reward functions for upcoming planning cycles. In our evaluation, we compare the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
