A survey of benchmarking frameworks for reinforcement learning

Belinda Stapelberg; Katherine M. Malan

arXiv:2011.13577·cs.LG·December 14, 2020

A survey of benchmarking frameworks for reinforcement learning

Belinda Stapelberg, Katherine M. Malan

PDF

TL;DR

This survey reviews various reinforcement learning benchmarking frameworks, emphasizing their role in ensuring reproducibility, fair comparison, and standardization to advance research in the field.

Contribution

It provides a comprehensive overview of recent and widely used RL benchmarks, highlighting their implementation, tasks, and algorithm support to aid researchers.

Findings

01

Highlights the importance of benchmarks for reproducibility and fair comparison.

02

Summarizes recent popular RL benchmarking frameworks and their features.

03

Encourages standardization and wider adoption of benchmarking practices.

Abstract

Reinforcement learning has recently experienced increased prominence in the machine learning community. There are many approaches to solving reinforcement learning problems with new techniques developed constantly. When solving problems using reinforcement learning, there are various difficult challenges to overcome. To ensure progress in the field, benchmarks are important for testing new algorithms and comparing with other approaches. The reproducibility of results for fair comparison is therefore vital in ensuring that improvements are accurately judged. This paper provides an overview of different contributions to reinforcement learning benchmarking and discusses how they can assist researchers to address the challenges facing reinforcement learning. The contributions discussed are the most used and recent in the literature. The paper discusses the contributions in terms of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.