Amortized Q-learning with Model-based Action Proposals for Autonomous   Driving on Highways

Branka Mirchevska; Maria H\"ugle; Gabriel Kalweit; Moritz Werling,; Joschka Boedecker

arXiv:2012.03234·cs.LG·December 8, 2020

Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways

Branka Mirchevska, Maria H\"ugle, Gabriel Kalweit, Moritz Werling,, Joschka Boedecker

PDF

TL;DR

This paper presents a reinforcement learning approach combined with trajectory planning to optimize long-term highway driving strategies, outperforming several benchmark methods in realistic traffic simulations.

Contribution

It introduces a novel RL-based framework that integrates model-based action proposals with trajectory planning for improved long-term autonomous driving.

Findings

01

Outperforms four benchmark approaches in SUMO simulations

02

Achieves more optimal long-term driving strategies

03

Balances continuous and discrete action spaces effectively

Abstract

Well-established optimization-based methods can guarantee an optimal trajectory for a short optimization horizon, typically no longer than a few seconds. As a result, choosing the optimal trajectory for this short horizon may still result in a sub-optimal long-term solution. At the same time, the resulting short-term trajectories allow for effective, comfortable and provable safe maneuvers in a dynamic traffic environment. In this work, we address the question of how to ensure an optimal long-term driving strategy, while keeping the benefits of classical trajectory planning. We introduce a Reinforcement Learning based approach that coupled with a trajectory planner, learns an optimal long-term decision-making strategy for driving on highways. By online generating locally optimal maneuvers as actions, we balance between the infinite low-level continuous action space, and the limited…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.