Collision Avoidance and Navigation for a Quadrotor Swarm Using   End-to-end Deep Reinforcement Learning

Zhehui Huang; Zhaojing Yang; Rahul Krupani; Bask{\i}n; \c{S}enba\c{s}lar; Sumeet Batra; Gaurav S. Sukhatme

arXiv:2309.13285·cs.RO·May 7, 2024

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

Zhehui Huang, Zhaojing Yang, Rahul Krupani, Bask{\i}n, \c{S}enba\c{s}lar, Sumeet Batra, Gaurav S. Sukhatme

PDF

Open Access

TL;DR

This paper introduces an end-to-end deep reinforcement learning method for quadrotor swarms that effectively navigates obstacle-rich environments, demonstrating zero-shot transfer from simulation to real-world hardware with scalable multi-robot control.

Contribution

It presents the first successful use of attention mechanisms in DRL for swarm navigation, enabling obstacle and neighbor avoidance on hardware-constrained quadrotors.

Findings

01

Scales to 32 robots in simulation with 80% obstacle density

02

Successfully transfers policies to 8 real quadrotors in physical environments

03

Achieves obstacle and neighbor avoidance with end-to-end DRL

Abstract

End-to-end deep reinforcement learning (DRL) for quadrotor control promises many benefits -- easy deployment, task generalization and real-time execution capability. Prior end-to-end DRL-based methods have showcased the ability to deploy learned controllers onto single quadrotors or quadrotor teams maneuvering in simple, obstacle-free environments. However, the addition of obstacles increases the number of possible interactions exponentially, thereby increasing the difficulty of training RL policies. In this work, we propose an end-to-end DRL approach to control quadrotor swarms in environments with obstacles. We provide our agents a curriculum and a replay buffer of the clipped collision episodes to improve performance in obstacle-rich environments. We implement an attention mechanism to attend to the neighbor robots and obstacle interactions - the first successful demonstration of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Reservoir Computing · Adaptive Dynamic Programming Control