Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
Robby Costales, Stefanos Nikolaidis

TL;DR
This paper introduces DIVA, an evolutionary method for generating diverse training tasks in complex simulators, enhancing adaptive agent training and overcoming limitations of traditional environment design techniques.
Contribution
DIVA uniquely combines evolutionary strategies with semi-supervised environment design, allowing flexible and effective task generation in complex, open-ended simulators.
Findings
DIVA outperforms prior methods in training adaptive agents.
It effectively handles complex parameterizations.
Results demonstrate improved robustness of agents trained with DIVA.
Abstract
The wider application of end-to-end learning methods to embodied decision-making domains remains bottlenecked by their reliance on a superabundance of training data representative of the target domain. Meta-reinforcement learning (meta-RL) approaches abandon the aim of zero-shot generalization--the goal of standard reinforcement learning (RL)--in favor of few-shot adaptation, and thus hold promise for bridging larger generalization gaps. While learning this meta-level adaptive behavior still requires substantial data, efficient environment simulators approaching real-world complexity are growing in prevalence. Even so, hand-designing sufficiently diverse and numerous simulated training tasks for these complex domains is prohibitively labor-intensive. Domain randomization (DR) and procedural generation (PG), offered as solutions to this problem, require simulators to possess…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMulti-Agent Systems and Negotiation
