A survey of benchmarking frameworks for reinforcement learning
Belinda Stapelberg, Katherine M. Malan

TL;DR
This survey reviews various reinforcement learning benchmarking frameworks, emphasizing their role in ensuring reproducibility, fair comparison, and standardization to advance research in the field.
Contribution
It provides a comprehensive overview of recent and widely used RL benchmarks, highlighting their implementation, tasks, and algorithm support to aid researchers.
Findings
Highlights the importance of benchmarks for reproducibility and fair comparison.
Summarizes recent popular RL benchmarking frameworks and their features.
Encourages standardization and wider adoption of benchmarking practices.
Abstract
Reinforcement learning has recently experienced increased prominence in the machine learning community. There are many approaches to solving reinforcement learning problems with new techniques developed constantly. When solving problems using reinforcement learning, there are various difficult challenges to overcome. To ensure progress in the field, benchmarks are important for testing new algorithms and comparing with other approaches. The reproducibility of results for fair comparison is therefore vital in ensuring that improvements are accurately judged. This paper provides an overview of different contributions to reinforcement learning benchmarking and discusses how they can assist researchers to address the challenges facing reinforcement learning. The contributions discussed are the most used and recent in the literature. The paper discusses the contributions in terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
