Derivative-Free Reinforcement Learning: A Review
Hong Qian, Yang Yu

TL;DR
This paper reviews the recent developments in derivative-free reinforcement learning, highlighting methods, challenges, and future directions in this emerging field that combines optimization and decision-making in unknown environments.
Contribution
It provides a comprehensive summary and organization of derivative-free reinforcement learning methods, addressing current limitations and suggesting future research directions.
Findings
Summarizes various derivative-free RL methods and their applications.
Identifies key challenges and limitations in current approaches.
Proposes future research directions for the field.
Abstract
Reinforcement learning is about learning agent models that make the best sequential decisions in unknown environments. In an unknown environment, the agent needs to explore the environment while exploiting the collected information, which usually forms a sophisticated problem to solve. Derivative-free optimization, meanwhile, is capable of solving sophisticated problems. It commonly uses a sampling-and-updating framework to iteratively improve the solution, where exploration and exploitation are also needed to be well balanced. Therefore, derivative-free optimization deals with a similar core issue as reinforcement learning, and has been introduced in reinforcement learning approaches, under the names of learning classifier systems and neuroevolution/evolutionary reinforcement learning. Although such methods have been developed for decades, recently, derivative-free reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
