Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models

Aaron Dharna; Cong Lu; Jeff Clune

arXiv:2507.06466·cs.LG·July 10, 2025

Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models

Aaron Dharna, Cong Lu, Jeff Clune

PDF

Open Access

TL;DR

This paper introduces Foundation-Model Self-Play (FMSP), a novel approach leveraging foundation models to enhance self-play in multi-agent settings, enabling diverse strategy discovery and overcoming local optima.

Contribution

The paper proposes FMSP and its variants, combining foundation models with self-play to improve strategy diversity and quality in complex multi-agent environments.

Findings

01

FMSP surpasses human-designed strategies in policy quality.

02

FMSP successfully jailbreaks and patches defenses in AI safety simulations.

03

FMSP enables automatic discovery of diverse, high-quality strategies.

Abstract

Multi-agent interactions have long fueled innovation, from natural predator-prey dynamics to the space race. Self-play (SP) algorithms try to harness these dynamics by pitting agents against ever-improving opponents, thereby creating an implicit curriculum toward learning high-quality solutions. However, SP often fails to produce diverse solutions and can get stuck in locally optimal behaviors. We introduce Foundation-Model Self-Play (FMSP), a new direction that leverages the code-generation capabilities and vast knowledge of foundation models (FMs) to overcome these challenges by leaping across local optima in policy space. We propose a family of approaches: (1) \textbf{Vanilla Foundation-Model Self-Play (vFMSP)} continually refines agent policies via competitive self-play; (2) \textbf{Novelty-Search Self-Play (NSSP)} builds a diverse population of strategies, ignoring performance; and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics