Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

Thomas Hickling; Maxwell Hogan; Abdulla Tammam; Nabil Aouf

arXiv:2502.20326·cs.RO·February 17, 2026

Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

Thomas Hickling, Maxwell Hogan, Abdulla Tammam, Nabil Aouf

PDF

Open Access

TL;DR

This paper introduces an integrated deep reinforcement learning framework for autonomous UAV search-and-rescue missions in indoor environments, demonstrating real-world deployment and competitive performance.

Contribution

It presents a novel end-to-end system combining DRL guidance, cooperative task allocation with GAT, and altitude stabilization techniques for indoor UAV SAR operations.

Findings

01

Successful real-world deployment on quad-rotors

02

Achieved first place in NATO-Sapience competition

03

Demonstrated reliable navigation in complex indoor environments

Abstract

This paper presents the first end-to-end framework that combines guidance, navigation, and centralised task allocation for multiple UAVs performing autonomous search-and-rescue (SAR) in GNSS-denied indoor environments. A Twin Delayed Deep Deterministic Policy Gradient controller is trained with an Artificial Potential Field (APF) reward that blends attractive and repulsive potentials with continuous control, accelerating convergence and yielding smoother, safer trajectories than distance-only baselines. Collaborative mission assignment is solved by a deep Graph Attention Network that, at each decision step, reasons over the drone-task graph to produce near-optimal allocations with negligible on-board compute. To arrest the notorious Z-drift of indoor LiDAR-SLAM, we fuse depth-camera altimetry with IMU vertical velocity in a lightweight complementary filter, giving centimetre-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUAV Applications and Optimization · Robotics and Sensor-Based Localization · Distributed Control Multi-Agent Systems

MethodsGraph Convolutional Network