Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Thomas Hickling, Maxwell Hogan, Abdulla Tammam, Nabil Aouf

TL;DR
This paper introduces an integrated deep reinforcement learning framework for autonomous UAV search-and-rescue missions in indoor environments, demonstrating real-world deployment and competitive performance.
Contribution
It presents a novel end-to-end system combining DRL guidance, cooperative task allocation with GAT, and altitude stabilization techniques for indoor UAV SAR operations.
Findings
Successful real-world deployment on quad-rotors
Achieved first place in NATO-Sapience competition
Demonstrated reliable navigation in complex indoor environments
Abstract
This paper presents the first end-to-end framework that combines guidance, navigation, and centralised task allocation for multiple UAVs performing autonomous search-and-rescue (SAR) in GNSS-denied indoor environments. A Twin Delayed Deep Deterministic Policy Gradient controller is trained with an Artificial Potential Field (APF) reward that blends attractive and repulsive potentials with continuous control, accelerating convergence and yielding smoother, safer trajectories than distance-only baselines. Collaborative mission assignment is solved by a deep Graph Attention Network that, at each decision step, reasons over the drone-task graph to produce near-optimal allocations with negligible on-board compute. To arrest the notorious Z-drift of indoor LiDAR-SLAM, we fuse depth-camera altimetry with IMU vertical velocity in a lightweight complementary filter, giving centimetre-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Robotics and Sensor-Based Localization · Distributed Control Multi-Agent Systems
MethodsGraph Convolutional Network
