Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways
Branka Mirchevska, Maria H\"ugle, Gabriel Kalweit, Moritz Werling,, Joschka Boedecker

TL;DR
This paper presents a reinforcement learning approach combined with trajectory planning to optimize long-term highway driving strategies, outperforming several benchmark methods in realistic traffic simulations.
Contribution
It introduces a novel RL-based framework that integrates model-based action proposals with trajectory planning for improved long-term autonomous driving.
Findings
Outperforms four benchmark approaches in SUMO simulations
Achieves more optimal long-term driving strategies
Balances continuous and discrete action spaces effectively
Abstract
Well-established optimization-based methods can guarantee an optimal trajectory for a short optimization horizon, typically no longer than a few seconds. As a result, choosing the optimal trajectory for this short horizon may still result in a sub-optimal long-term solution. At the same time, the resulting short-term trajectories allow for effective, comfortable and provable safe maneuvers in a dynamic traffic environment. In this work, we address the question of how to ensure an optimal long-term driving strategy, while keeping the benefits of classical trajectory planning. We introduce a Reinforcement Learning based approach that coupled with a trajectory planner, learns an optimal long-term decision-making strategy for driving on highways. By online generating locally optimal maneuvers as actions, we balance between the infinite low-level continuous action space, and the limited…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
