Robust Agents in Open-Ended Worlds

Mikayel Samvelyan

arXiv:2512.08139·cs.LG·December 10, 2025

Robust Agents in Open-Ended Worlds

Mikayel Samvelyan

PDF

Open Access

TL;DR

This paper introduces new frameworks and methods for training and evaluating robust AI agents across diverse, open-ended environments, focusing on generalisation, adversarial robustness, and multi-agent interactions.

Contribution

It presents MiniHack for environment generation, Maestro for adversarial curriculum creation, and applies quality-diversity and evolutionary methods to improve robustness in multi-agent and language models.

Findings

01

MiniHack enables diverse environment creation for RL training.

02

Maestro improves agent robustness through adversarial curricula.

03

Evolutionary search identifies vulnerabilities in LLMs.

Abstract

The growing prevalence of artificial intelligence (AI) in various applications underscores the need for agents that can successfully navigate and adapt to an ever-changing, open-ended world. A key challenge is ensuring these AI agents are robust, excelling not only in familiar settings observed during training but also effectively generalising to previously unseen and varied scenarios. In this thesis, we harness methodologies from open-endedness and multi-agent learning to train and evaluate robust AI agents capable of generalising to novel environments, out-of-distribution inputs, and interactions with other co-player agents. We begin by introducing MiniHack, a sandbox framework for creating diverse environments through procedural content generation. Based on the game of NetHack, MiniHack enables the construction of new tasks for reinforcement learning (RL) agents with a focus on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Games · Reinforcement Learning in Robotics