Multi-Objective Reinforcement Learning for Adaptable Personalized Autonomous Driving

Hendrik Surmann; Jorge de Heuvel; Maren Bennewitz

arXiv:2505.05223·cs.RO·July 21, 2025

Multi-Objective Reinforcement Learning for Adaptable Personalized Autonomous Driving

Hendrik Surmann, Jorge de Heuvel, Maren Bennewitz

PDF

Open Access

TL;DR

This paper introduces a multi-objective reinforcement learning approach for autonomous driving that dynamically adapts to individual user preferences in real-time without retraining, improving user satisfaction and safety.

Contribution

It presents a novel preference-driven MORL method enabling real-time adaptation of autonomous driving behavior along multiple style objectives without policy retraining.

Findings

01

The agent successfully adapts to changing preferences in urban scenarios.

02

Maintains safety and efficiency while adjusting driving style.

03

Operates effectively in complex mixed-traffic environments.

Abstract

Human drivers exhibit individual preferences regarding driving style. Adapting autonomous vehicles to these preferences is essential for user trust and satisfaction. However, existing end-to-end driving approaches often rely on predefined driving styles or require continuous user feedback for adaptation, limiting their ability to support dynamic, context-dependent preferences. We propose a novel approach using multi-objective reinforcement learning (MORL) with preference-driven optimization for end-to-end autonomous driving that enables runtime adaptation to driving style preferences. Preferences are encoded as continuous weight vectors to modulate behavior along interpretable style objectives $\unicode x 2013$ including efficiency, comfort, speed, and aggressiveness $\unicode x 2013$ without requiring policy retraining. Our single-policy agent integrates vision-based perception in complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Social Robot Interaction and HRI

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator