TL;DR
CoFlow introduces a novel architecture for offline multi-agent reinforcement learning that achieves high coordination with minimal inference steps by native joint-coupled velocity fields.
Contribution
The paper proposes CoFlow, combining Coordinated Velocity Attention and Adaptive Coordination Gating, enabling single-pass multi-agent generation with superior coordination and efficiency.
Findings
CoFlow matches or surpasses various baseline methods on multiple benchmarks.
Single-pass inference suffices for high-quality coordination across configurations.
CoFlow achieves state-of-the-art coordination with 1-3 denoising steps.
Abstract
Generative models have emerged as a promising paradigm for offline multi-agent reinforcement learning (MARL), but existing approaches require many iterative sampling steps. Recent few-step acceleration methods either distill a joint teacher into independent students or apply averaged velocity fields independently to each agent. Unfortunately, these few-step approaches hurt inter-agent coordination. We show that the efficiency-coordination trade-off is not inherent: single-pass multi-agent generation can preserve coordination when the velocity field is natively joint-coupled. We propose Coordinated few-step Flow (CoFlow), an architecture that combines Coordinated Velocity Attention (CVA) with Adaptive Coordination Gating. A finite-difference consistency surrogate further replaces memory-prohibitive Jacobian-vector product backpropagation through the averaged velocity field with two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
