Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting   Diversity

Robby Costales; Stefanos Nikolaidis

arXiv:2411.04466·cs.LG·November 8, 2024

Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity

Robby Costales, Stefanos Nikolaidis

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces DIVA, an evolutionary method for generating diverse training tasks in complex simulators, enhancing adaptive agent training and overcoming limitations of traditional environment design techniques.

Contribution

DIVA uniquely combines evolutionary strategies with semi-supervised environment design, allowing flexible and effective task generation in complex, open-ended simulators.

Findings

01

DIVA outperforms prior methods in training adaptive agents.

02

It effectively handles complex parameterizations.

03

Results demonstrate improved robustness of agents trained with DIVA.

Abstract

The wider application of end-to-end learning methods to embodied decision-making domains remains bottlenecked by their reliance on a superabundance of training data representative of the target domain. Meta-reinforcement learning (meta-RL) approaches abandon the aim of zero-shot generalization--the goal of standard reinforcement learning (RL)--in favor of few-shot adaptation, and thus hold promise for bridging larger generalization gaps. While learning this meta-level adaptive behavior still requires substantial data, efficient environment simulators approaching real-world complexity are growing in prevalence. Even so, hand-designing sufficiently diverse and numerous simulated training tasks for these complex domains is prohibitively labor-intensive. Domain randomization (DR) and procedural generation (PG), offered as solutions to this problem, require simulators to possess…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

robbycostales/diva
pytorchOfficial

Videos

Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity· slideslive

Taxonomy

TopicsMulti-Agent Systems and Negotiation