Natural Environment Benchmarks for Reinforcement Learning
Amy Zhang, Yuxin Wu, Joelle Pineau

TL;DR
This paper introduces three new benchmark RL domains that incorporate natural environment complexity, enabling better evaluation of algorithm robustness and generalization in more realistic settings.
Contribution
It proposes three novel benchmark RL domains with natural complexity, supporting fast data collection and fair evaluation of generalization capabilities.
Findings
New benchmark domains support complex, naturalistic environments.
Domains enable fair train/test separation for generalization assessment.
Facilitate comparison and replication of RL results.
Abstract
While current benchmark reinforcement learning (RL) tasks have been useful to drive progress in the field, they are in many ways poor substitutes for learning with real-world data. By testing increasingly complex RL algorithms on low-complexity simulation environments, we often end up with brittle RL policies that generalize poorly beyond the very specific domain. To combat this, we propose three new families of benchmark RL domains that contain some of the complexity of the natural world, while still supporting fast and extensive data acquisition. The proposed domains also permit a characterization of generalization through fair train/test separation, and easy comparison and replication of results. Through this work, we challenge the RL research community to develop more robust algorithms that meet high standards of evaluation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Smart Grid Energy Management
