Guided Deep Reinforcement Learning for Swarm Systems

Maximilian H\"uttenrauch; Adrian \v{S}o\v{s}i\'c; Gerhard; Neumann

arXiv:1709.06011·cs.MA·September 19, 2017·100 cites

Guided Deep Reinforcement Learning for Swarm Systems

Maximilian H\"uttenrauch, Adrian \v{S}o\v{s}i\'c, Gerhard, Neumann

PDF

Open Access 1 Repo

TL;DR

This paper presents a guided deep reinforcement learning approach for controlling swarm agents with limited sensing, using a central critic with global state access to improve policy learning for tasks like formation and target search.

Contribution

It introduces a novel actor-critic method where the critic has global state access during training, enhancing learning for decentralized swarm control policies.

Findings

01

Effective in simulated tasks of swarm formation and target localization

02

Demonstrates improved policy learning with guided critic approach

03

Applicable to cooperative agents with limited sensing capabilities

Abstract

In this paper, we investigate how to learn to control a group of cooperative agents with limited sensing capabilities such as robot swarms. The agents have only very basic sensor capabilities, yet in a group they can accomplish sophisticated tasks, such as distributed assembly or search and rescue tasks. Learning a policy for a group of agents is difficult due to distributed partial observability of the state. Here, we follow a guided approach where a critic has central access to the global state during learning, which simplifies the policy evaluation problem from a reinforcement learning point of view. For example, we can get the positions of all robots of the swarm using a camera image of a scene. This camera image is only available to the critic and not to the control policies of the robots. We follow an actor-critic approach, where the actors base their decisions only on locally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hex-plex/KiloBot-MultiAgent-RL
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Distributed Control Multi-Agent Systems