Coordinated Diffusion: Generating Multi-Agent Behavior Without Multi-Agent Demonstrations
Lasse Peters, Laura Ferranti, Andrea Bajcsy, Javier Alonso-Mora

TL;DR
CoDi introduces a diffusion-based framework that learns multi-agent coordination from single-agent demonstrations and a user-defined cost function, avoiding the need for multi-agent demonstration data.
Contribution
This work presents a novel diffusion-based method to generate multi-agent behaviors using only single-agent data and a cost function, bypassing the data bottleneck.
Findings
CoDi effectively learns coordinated multi-agent behaviors from single-agent data.
The method is more data-efficient than existing multi-agent baselines.
CoDi performs well in simulation and hardware experiments, demonstrating robustness and coordination.
Abstract
Imitation learning powered by generative models has proven effective for modeling complex single-agent behaviors. However, teaching multi-agent systems, like multiple arms or vehicles, to coordinate through imitation learning is hindered by a fundamental data bottleneck: as the joint state-action space grows exponentially with the number of agents, collecting a sufficient amount of coordinated multi-agent demonstrations becomes extremely costly. In this work, we ask: how can we leverage single-agent demonstration data to learn multi-agent policies? We present Coordinated Diffusion (CoDi), a framework that couples independently trained single-agent diffusion policies through a user-defined multi-agent cost function, without requiring any coordinated demonstrations. We derive a new diffusion-based sampling scheme wherein the diffusion score function decomposes into independent,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
