Robust Agents in Open-Ended Worlds
Mikayel Samvelyan

TL;DR
This paper introduces new frameworks and methods for training and evaluating robust AI agents across diverse, open-ended environments, focusing on generalisation, adversarial robustness, and multi-agent interactions.
Contribution
It presents MiniHack for environment generation, Maestro for adversarial curriculum creation, and applies quality-diversity and evolutionary methods to improve robustness in multi-agent and language models.
Findings
MiniHack enables diverse environment creation for RL training.
Maestro improves agent robustness through adversarial curricula.
Evolutionary search identifies vulnerabilities in LLMs.
Abstract
The growing prevalence of artificial intelligence (AI) in various applications underscores the need for agents that can successfully navigate and adapt to an ever-changing, open-ended world. A key challenge is ensuring these AI agents are robust, excelling not only in familiar settings observed during training but also effectively generalising to previously unseen and varied scenarios. In this thesis, we harness methodologies from open-endedness and multi-agent learning to train and evaluate robust AI agents capable of generalising to novel environments, out-of-distribution inputs, and interactions with other co-player agents. We begin by introducing MiniHack, a sandbox framework for creating diverse environments through procedural content generation. Based on the game of NetHack, MiniHack enables the construction of new tasks for reinforcement learning (RL) agents with a focus on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Games · Reinforcement Learning in Robotics
