Generative adversarial imitation learning for robot swarms: Learning from human demonstrations and trained policies

Mattes Kraus; Jonas Kuckling

arXiv:2603.02783·cs.RO·March 4, 2026

Generative adversarial imitation learning for robot swarms: Learning from human demonstrations and trained policies

Mattes Kraus, Jonas Kuckling

PDF

Open Access

TL;DR

This paper introduces a generative adversarial imitation learning framework for swarm robotics, enabling robots to learn collective behaviors from human demonstrations and trained policies, with successful real-world deployment.

Contribution

It presents a novel imitation learning approach for swarm robotics that learns from human and policy-derived demonstrations, validated through real-robot experiments.

Findings

01

Behaviors learned are qualitatively meaningful and similar to demonstrations.

02

Policies perform comparably in simulation and real-world experiments.

03

The framework effectively transfers learned behaviors to physical robot swarms.

Abstract

In imitation learning, robots are supposed to learn from demonstrations of the desired behavior. Most of the work in imitation learning for swarm robotics provides the demonstrations as rollouts of an existing policy. In this work, we provide a framework based on generative adversarial imitation learning that aims to learn collective behaviors from human demonstrations. Our framework is evaluated across six different missions, learning both from manual demonstrations and demonstrations derived from a PPO-trained policy. Results show that the imitation learning process is able to learn qualitatively meaningful behaviors that perform similarly well as the provided demonstrations. Additionally, we deploy the learned policies on a swarm of TurtleBot 4 robots in real-robot experiments. The exhibited behaviors preserved their visually recognizable character and their performance is comparable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Social Robot Interaction and HRI