Capability-Aware Shared Hypernetworks for Flexible Heterogeneous Multi-Robot Coordination
Kevin Fu, Shalin Anand Jain, Pierce Howell, Harish Ravichandar

TL;DR
This paper introduces CASH, a hypernetwork-based architecture that enables flexible, efficient, and generalizable multi-robot coordination by encoding robot capabilities, outperforming existing methods across various tasks and platforms.
Contribution
CASH is a novel hypernetwork architecture that dynamically adapts shared policies to different robots, balancing expressivity and efficiency without trade-offs.
Findings
CASH outperforms baseline architectures in performance and sample efficiency.
CASH achieves zero-shot generalization to unseen robots and team compositions.
CASH uses 60-80% fewer learnable parameters than traditional methods.
Abstract
Recent advances have enabled heterogeneous multi-robot teams to learn complex and effective coordination skills. However, existing neural architectures that support heterogeneous teaming tend to force a trade-off between expressivity and efficiency. Shared-parameter designs prioritize sample efficiency by enabling a single network to be shared across all or a pre-specified subset of robots (via input augmentations), but tend to limit behavioral diversity. In contrast, recent designs employ a separate policy for each robot, enabling greater diversity and expressivity at the cost of efficiency and generalization. Our key insight is that such tradeoffs can be avoided by viewing these design choices as ends of a broad spectrum. Inspired by recent work in transfer and meta learning, and building on prior work in multi-robot task allocation, we propose Capability-Aware Shared Hypernetworks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
