TL;DR
This paper demonstrates that a vision-based reinforcement learning controller trained solely in simulation can effectively perform vehicle control tasks like lane following and collision avoidance in the real world, using domain randomization.
Contribution
It introduces a simple convolutional network trained with Proximal Policy Optimization and domain randomization for sim-to-real transfer in vehicle control.
Findings
Successful real-world lane following on a small-scale robot
Effective sim-to-real transfer demonstrated through performance metrics
Salient object maps provide insights into learned behaviors
Abstract
In this work, we study vision-based end-to-end reinforcement learning on vehicle control problems, such as lane following and collision avoidance. Our controller policy is able to control a small-scale robot to follow the right-hand lane of a real two-lane road, while its training was solely carried out in a simulation. Our model, realized by a simple, convolutional network, only relies on images of a forward-facing monocular camera and generates continuous actions that directly control the vehicle. To train this policy we used Proximal Policy Optimization, and to achieve the generalization capability required for real performance we used domain randomization. We carried out thorough analysis of the trained policy, by measuring multiple performance metrics and comparing these to baselines that rely on other methods. To assess the quality of the simulation-to-reality transfer learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
