Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models
Aaron Dharna, Cong Lu, Jeff Clune

TL;DR
This paper introduces Foundation-Model Self-Play (FMSP), a novel approach leveraging foundation models to enhance self-play in multi-agent settings, enabling diverse strategy discovery and overcoming local optima.
Contribution
The paper proposes FMSP and its variants, combining foundation models with self-play to improve strategy diversity and quality in complex multi-agent environments.
Findings
FMSP surpasses human-designed strategies in policy quality.
FMSP successfully jailbreaks and patches defenses in AI safety simulations.
FMSP enables automatic discovery of diverse, high-quality strategies.
Abstract
Multi-agent interactions have long fueled innovation, from natural predator-prey dynamics to the space race. Self-play (SP) algorithms try to harness these dynamics by pitting agents against ever-improving opponents, thereby creating an implicit curriculum toward learning high-quality solutions. However, SP often fails to produce diverse solutions and can get stuck in locally optimal behaviors. We introduce Foundation-Model Self-Play (FMSP), a new direction that leverages the code-generation capabilities and vast knowledge of foundation models (FMs) to overcome these challenges by leaping across local optima in policy space. We propose a family of approaches: (1) \textbf{Vanilla Foundation-Model Self-Play (vFMSP)} continually refines agent policies via competitive self-play; (2) \textbf{Novelty-Search Self-Play (NSSP)} builds a diverse population of strategies, ignoring performance; and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
