R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations

Connor Mattson; Varun Raveendra; Ellen Novoseller; Nicholas Waytowich; Vernon J. Lawhern; Daniel S. Brown

arXiv:2510.18085·cs.RO·October 22, 2025

R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations

Connor Mattson, Varun Raveendra, Ellen Novoseller, Nicholas Waytowich, Vernon J. Lawhern, Daniel S. Brown

PDF

Open Access

TL;DR

This paper introduces R2BC, a method enabling a single human to train multi-robot systems through sequential demonstrations, effectively teaching multi-agent behaviors without requiring joint action demonstrations.

Contribution

The paper presents R2BC, a novel approach for multi-agent imitation learning that uses single-agent demonstrations to train entire robot teams, simplifying data collection.

Findings

01

R2BC matches or surpasses oracle methods in simulated tasks.

02

R2BC successfully deployed on physical robots with real human demonstrations.

03

Sequential single-agent training effectively teaches multi-agent behaviors.

Abstract

Imitation Learning (IL) is a natural way for humans to teach robots, particularly when high-quality demonstrations are easy to obtain. While IL has been widely applied to single-robot settings, relatively few studies have addressed the extension of these methods to multi-agent systems, especially in settings where a single human must provide demonstrations to a team of collaborating robots. In this paper, we introduce and study Round-Robin Behavior Cloning (R2BC), a method that enables a single human operator to effectively train multi-robot systems through sequential, single-agent demonstrations. Our approach allows the human to teleoperate one agent at a time and incrementally teach multi-agent behavior to the entire system, without requiring demonstrations in the joint multi-agent action space. We show that R2BC methods match, and in some cases surpass, the performance of an oracle…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Social Robot Interaction and HRI