Learning robust driving policies without online exploration

Daniel Graves; Nhat M. Nguyen; Kimia Hassanzadeh; Jun Jin; Jun Luo

arXiv:2103.08070·cs.RO·March 16, 2021

Learning robust driving policies without online exploration

Daniel Graves, Nhat M. Nguyen, Kimia Hassanzadeh, Jun Jin, Jun Luo

PDF

Open Access

TL;DR

This paper introduces a multi-time-scale predictive representation learning approach for offline reinforcement learning, enabling robust driving policies that generalize to new road conditions without online exploration.

Contribution

The paper presents a novel offline learning method that improves generalization and robustness of driving policies in unseen environments, reducing reliance on online exploration.

Findings

01

Effective in generalizing to novel road geometries

02

Robust to damaged and distracting lane conditions

03

Performs well in both simulation and real-world tests

Abstract

We propose a multi-time-scale predictive representation learning method to efficiently learn robust driving policies in an offline manner that generalize well to novel road geometries, and damaged and distracting lane conditions which are not covered in the offline training data. We show that our proposed representation learning method can be applied easily in an offline (batch) reinforcement learning setting demonstrating the ability to generalize well and efficiently under novel conditions compared to standard batch RL methods. Our proposed method utilizes training data collected entirely offline in the real-world which removes the need of intensive online explorations that impede applying deep reinforcement learning on real-world robot training. Various experiments were conducted in both simulator and real-world scenarios for the purpose of evaluation and analysis of our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Traffic control and management