Learning Scalable Policies over Graphs for Multi-Robot Task Allocation using Capsule Attention Networks
Steve Paul, Payam Ghassemi, Souma Chowdhury

TL;DR
This paper introduces CapAM, a graph reinforcement learning architecture with capsule attention, for scalable multi-robot task allocation, achieving faster decision-making and better generalization over traditional methods.
Contribution
The paper proposes a novel capsule attention-based neural architecture for multi-robot task allocation, improving scalability and performance over existing learning and non-learning methods.
Findings
CapAM outperforms non-learning methods in task completion.
Decision-making is over 10 times faster with CapAM.
CapAM generalizes well to larger, unseen scenarios.
Abstract
This paper presents a novel graph reinforcement learning (RL) architecture to solve multi-robot task allocation (MRTA) problems that involve tasks with deadlines and workload, and robot constraints such as work capacity. While drawing motivation from recent graph learning methods that learn to solve combinatorial optimization (CO) problems such as multi-Traveling Salesman and Vehicle Routing Problems using RL, this paper seeks to provide better performance (compared to non-learning methods) and important scalability (compared to existing learning architectures) for the stated class of MRTA problems. The proposed neural architecture, called Capsule Attention-based Mechanism or CapAM acts as the policy network, and includes three main components: 1) an encoder: a Capsule Network based node embedding model to represent each task as a learnable feature vector; 2) a decoder: an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
