Agile Interception of a Flying Target using Competitive Reinforcement Learning
Timoth\'ee Gavin (ENAC-LAB, LAAS-RIS), Simon Lacroix (LAAS-RIS), Murat Bronz (ENAC)

TL;DR
This paper introduces a competitive reinforcement learning approach for agile drone interception, utilizing a high-fidelity simulation and low-level control to outperform heuristic methods in both simulation and real-world tests.
Contribution
It presents a novel competitive RL framework with a realistic simulation environment for agile drone interception using PPO and low-level control.
Findings
Outperforms heuristic baselines in catch rate and time to catch
Demonstrates successful transfer from simulation to real-world indoor flights
Achieves high agility and reliability in drone interception tasks
Abstract
This article presents a solution to intercept an agile drone by another agile drone carrying a catching net. We formulate the interception as a Competitive Reinforcement Learning problem, where the interceptor and the target drone are controlled by separate policies trained with Proximal Policy Optimization (PPO). We introduce a high-fidelity simulation environment that integrates a realistic quadrotor dynamics model and a low-level control architecture implemented in JAX, which allows for fast parallelized execution on GPUs. We train the agents using low-level control, collective thrust and body rates, to achieve agile flights both for the interceptor and the target. We compare the performance of the trained policies in terms of catch rate, time to catch, and crash rate, against common heuristic baselines and show that our solution outperforms these baselines for interception of agile…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems
