Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning
Rohan Banerjee, Prishita Ray, Mark Campbell

TL;DR
This paper introduces a Bayesian optimization-based curriculum learning method to enhance the environmental robustness of deep reinforcement learning agents in autonomous racing, outperforming traditional and hand-engineered curricula.
Contribution
It proposes a novel Bayesian optimization approach to automatically select curricula, improving robustness in deep RL for autonomous racing tasks.
Findings
Bayesian optimization-based curriculum outperforms vanilla deep RL agents.
The method surpasses hand-engineered curricula in robustness.
Code availability facilitates reproducibility and further research.
Abstract
Deep reinforcement learning (RL) approaches have been broadly applied to a large number of robotics tasks, such as robot manipulation and autonomous driving. However, an open problem in deep RL is learning policies that are robust to variations in the environment, which is an important condition for such systems to be deployed into real-world, unstructured settings. Curriculum learning is one approach that has been applied to improve generalization performance in both supervised and reinforcement learning domains, but selecting the appropriate curriculum to achieve robustness can be a user-intensive process. In our work, we show that performing probabilistic inference of the underlying curriculum-reward function using Bayesian Optimization can be a promising technique for finding a robust curriculum. We demonstrate that a curriculum found with Bayesian optimization can outperform a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Data Classification · Software Reliability and Analysis Research
