Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World Reinforcement Learning
Ari Viitala, Rinu Boney, Yi Zhao, Alexander Ilin, Juho Kannala

TL;DR
L2D is a simple, reproducible benchmark for real-world reinforcement learning using a Donkey car, demonstrating that RL algorithms can quickly learn to drive from scratch with minimal interaction.
Contribution
The paper introduces L2D, a low-cost, open-source benchmark for autonomous driving with a Donkey car, enabling easy application of RL algorithms to real-world driving tasks.
Findings
RL algorithms can learn to drive in less than five minutes.
RL can outperform imitation learning and human operators in this setup.
The benchmark is accessible and reproducible for the research community.
Abstract
We present Learning to Drive (L2D), a low-cost benchmark for real-world reinforcement learning (RL). L2D involves a simple and reproducible experimental setup where an RL agent has to learn to drive a Donkey car around three miniature tracks, given only monocular image observations and speed of the car. The agent has to learn to drive from disengagements, which occurs when it drives off the track. We present and open-source our training pipeline, which makes it straightforward to apply any existing RL algorithm to the task of autonomous driving with a Donkey car. We test imitation learning, state-of-the-art model-free, and model-based algorithms on the proposed L2D benchmark. Our results show that existing RL algorithms can learn to drive the car from scratch in less than five minutes of interaction. We demonstrate that RL algorithms can learn from sparse and noisy disengagement to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
