Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World   Reinforcement Learning

Ari Viitala; Rinu Boney; Yi Zhao; Alexander Ilin; Juho Kannala

arXiv:2008.00715·cs.RO·November 9, 2020

Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World Reinforcement Learning

Ari Viitala, Rinu Boney, Yi Zhao, Alexander Ilin, Juho Kannala

PDF

TL;DR

L2D is a simple, reproducible benchmark for real-world reinforcement learning using a Donkey car, demonstrating that RL algorithms can quickly learn to drive from scratch with minimal interaction.

Contribution

The paper introduces L2D, a low-cost, open-source benchmark for autonomous driving with a Donkey car, enabling easy application of RL algorithms to real-world driving tasks.

Findings

01

RL algorithms can learn to drive in less than five minutes.

02

RL can outperform imitation learning and human operators in this setup.

03

The benchmark is accessible and reproducible for the research community.

Abstract

We present Learning to Drive (L2D), a low-cost benchmark for real-world reinforcement learning (RL). L2D involves a simple and reproducible experimental setup where an RL agent has to learn to drive a Donkey car around three miniature tracks, given only monocular image observations and speed of the car. The agent has to learn to drive from disengagements, which occurs when it drives off the track. We present and open-source our training pipeline, which makes it straightforward to apply any existing RL algorithm to the task of autonomous driving with a Donkey car. We test imitation learning, state-of-the-art model-free, and model-based algorithms on the proposed L2D benchmark. Our results show that existing RL algorithms can learn to drive the car from scratch in less than five minutes of interaction. We demonstrate that RL algorithms can learn from sparse and noisy disengagement to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.