Guided Deep Reinforcement Learning for Swarm Systems
Maximilian H\"uttenrauch, Adrian \v{S}o\v{s}i\'c, Gerhard, Neumann

TL;DR
This paper presents a guided deep reinforcement learning approach for controlling swarm agents with limited sensing, using a central critic with global state access to improve policy learning for tasks like formation and target search.
Contribution
It introduces a novel actor-critic method where the critic has global state access during training, enhancing learning for decentralized swarm control policies.
Findings
Effective in simulated tasks of swarm formation and target localization
Demonstrates improved policy learning with guided critic approach
Applicable to cooperative agents with limited sensing capabilities
Abstract
In this paper, we investigate how to learn to control a group of cooperative agents with limited sensing capabilities such as robot swarms. The agents have only very basic sensor capabilities, yet in a group they can accomplish sophisticated tasks, such as distributed assembly or search and rescue tasks. Learning a policy for a group of agents is difficult due to distributed partial observability of the state. Here, we follow a guided approach where a critic has central access to the global state during learning, which simplifies the policy evaluation problem from a reinforcement learning point of view. For example, we can get the positions of all robots of the swarm using a camera image of a scene. This camera image is only available to the critic and not to the control policies of the robots. We follow an actor-critic approach, where the actors base their decisions only on locally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Distributed Control Multi-Agent Systems
