Escape Room: A Configurable Testbed for Hierarchical Reinforcement Learning
Jacob Menashe, Peter Stone

TL;DR
The paper introduces the Escape Room Domain (ERD), a new flexible and scalable benchmark for Hierarchical Reinforcement Learning that addresses the limitations of existing testbeds by providing moderate complexity challenges.
Contribution
It presents ERD as a new open-source testbed that bridges the gap between simple and complex HRL environments, facilitating better evaluation and development of HRL algorithms.
Findings
ERD offers scalable difficulty levels for HRL testing.
ERD is compatible with existing RL frameworks and interfaces.
ERD fills the gap between simple and complex HRL benchmarks.
Abstract
Recent successes in Reinforcement Learning have encouraged a fast-growing network of RL researchers and a number of breakthroughs in RL research. As the RL community and the body of RL work grows, so does the need for widely applicable benchmarks that can fairly and effectively evaluate a variety of RL algorithms. This need is particularly apparent in the realm of Hierarchical Reinforcement Learning (HRL). While many existing test domains may exhibit hierarchical action or state structures, modern RL algorithms still exhibit great difficulty in solving domains that necessitate hierarchical modeling and action planning, even when such domains are seemingly trivial. These difficulties highlight both the need for more focus on HRL algorithms themselves, and the need for new testbeds that will encourage and validate HRL research. Existing HRL testbeds exhibit a Goldilocks problem; they…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Software Engineering Research
