An Optimisation Framework for Unsupervised Environment Design

Nathan Monette; Alistair Letcher; Michael Beukman; Matthew T. Jackson; Alexander Rutherford; Alexander D. Goldie; Jakob N. Foerster

arXiv:2505.20659·cs.LG·July 10, 2025

An Optimisation Framework for Unsupervised Environment Design

Nathan Monette, Alistair Letcher, Michael Beukman, Matthew T. Jackson, Alexander Rutherford, Alexander D. Goldie, Jakob N. Foerster

PDF

Open Access

TL;DR

This paper introduces an optimization framework for unsupervised environment design in reinforcement learning, providing theoretical guarantees and demonstrating improved robustness of agents across diverse environments.

Contribution

It offers a new optimization-based approach with provable convergence for UED, advancing beyond prior methods reliant on convergence guarantees.

Findings

01

Outperforms prior UED methods in various environments

02

Provides theoretical convergence guarantees for the proposed algorithm

03

Enhances agent robustness in high-risk settings

Abstract

For reinforcement learning agents to be deployed in high-risk settings, they must achieve a high level of robustness to unfamiliar scenarios. One method for improving robustness is unsupervised environment design (UED), a suite of methods aiming to maximise an agent's generalisability across configurations of an environment. In this work, we study UED from an optimisation perspective, providing stronger theoretical guarantees for practical settings than prior work. Whereas previous methods relied on guarantees if they reach convergence, our framework employs a nonconvex-strongly-concave objective for which we provide a provably convergent algorithm in the zero-sum setting. We empirically verify the efficacy of our method, outperforming prior methods in a number of environments with varying difficulties.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBIM and Construction Integration