DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play

Akash Karthikeyan; Yash Vardhan Pant

arXiv:2511.13186·cs.LG·November 18, 2025

DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play

Akash Karthikeyan, Yash Vardhan Pant

PDF

Open Access

TL;DR

DiffFP introduces a diffusion-based fictitious play framework that enables learning robust, adaptive strategies in continuous multi-agent environments, achieving faster convergence and higher success rates than traditional reinforcement learning methods.

Contribution

This paper presents a novel diffusion-based fictitious play approach that estimates best responses to unseen opponents, improving convergence speed and robustness in continuous multi-agent games.

Findings

01

Converges to $$-Nash equilibria in continuous zero-sum games.

02

Achieves up to 3x faster convergence than baseline RL methods.

03

Attains 30x higher success rates against diverse opponents.

Abstract

Self-play reinforcement learning has demonstrated significant success in learning complex strategic and interactive behaviors in competitive multi-agent games. However, achieving such behaviors in continuous decision spaces remains challenging. Ensuring adaptability and generalization in self-play settings is critical for achieving competitive performance in dynamic multi-agent environments. These challenges often cause methods to converge slowly or fail to converge at all to a Nash equilibrium, making agents vulnerable to strategic exploitation by unseen opponents. To address these challenges, we propose DiffFP, a fictitious play (FP) framework that estimates the best response to unseen opponents while learning a robust and multimodal behavioral policy. Specifically, we approximate the best response using a diffusion policy that leverages generative modeling to learn adaptive and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Experimental Behavioral Economics Studies