Inverse Reinforcement Learning in Swarm Systems

Adrian \v{S}o\v{s}i\'c; Wasiur R. KhudaBukhsh; Abdelhak M. Zoubir,; Heinz Koeppl

arXiv:1602.05450·stat.ML·March 27, 2017·32 cites

Inverse Reinforcement Learning in Swarm Systems

Adrian \v{S}o\v{s}i\'c, Wasiur R. KhudaBukhsh, Abdelhak M. Zoubir,, Heinz Koeppl

PDF

Open Access

TL;DR

This paper extends inverse reinforcement learning to large-scale multi-agent swarm systems by introducing a new framework, reducing the problem to single-agent IRL, and demonstrating effective local reward modeling.

Contribution

It introduces the swarMDP framework, reduces multi-agent IRL to single-agent, and proposes a novel learning scheme tailored for swarm systems.

Findings

01

Successfully models local rewards from swarm data

02

Replicates observed global dynamics using learned rewards

03

Framework applicable to large-scale homogeneous systems

Abstract

Inverse reinforcement learning (IRL) has become a useful tool for learning behavioral models from demonstration data. However, IRL remains mostly unexplored for multi-agent systems. In this paper, we show how the principle of IRL can be extended to homogeneous large-scale problems, inspired by the collective swarming behavior of natural systems. In particular, we make the following contributions to the field: 1) We introduce the swarMDP framework, a sub-class of decentralized partially observable Markov decision processes endowed with a swarm characterization. 2) Exploiting the inherent homogeneity of this framework, we reduce the resulting multi-agent IRL problem to a single-agent one by proving that the agent-specific value functions in this model coincide. 3) To solve the corresponding control problem, we propose a novel heterogeneous learning scheme that is particularly tailored to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Distributed Control Multi-Agent Systems · Neural dynamics and brain function