Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Yuji Kanagawa, Tomoyuki Kaneko

TL;DR
Rogue-Gym is a roguelike environment designed to evaluate the generalization capabilities of reinforcement learning agents, revealing limitations of current methods and the need for improved generalization techniques.
Contribution
This paper introduces Rogue-Gym as a new benchmark for testing RL generalization, highlighting the challenges and limitations of existing methods like PPO.
Findings
Standard PPO struggles with overfitting in Rogue-Gym.
Some enhancements slightly improve generalization, but overall challenges remain.
Rogue-Gym provides a diverse and fair environment for assessing RL generalization.
Abstract
In this paper, we propose Rogue-Gym, a simple and classic style roguelike game built for evaluating generalization in reinforcement learning (RL). Combined with the recent progress of deep neural networks, RL has successfully trained human-level agents without human knowledge in many games such as those for Atari 2600. However, it has been pointed out that agents trained with RL methods often overfit the training environment, and they work poorly in slightly different environments. To investigate this problem, some research environments with procedural content generation have been proposed. Following these studies, we propose the use of roguelikes as a benchmark for evaluating the generalization ability of RL agents. In our Rogue-Gym, agents need to explore dungeons that are structured differently each time they start a new game. Thanks to the very diverse structures of the dungeons, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Artificial Intelligence in Games
MethodsEntropy Regularization · Proximal Policy Optimization
