Curriculum-Based Iterative Self-Play for Scalable Multi-Drone Racing
Onur Akg\"un

TL;DR
CRUISE introduces a scalable reinforcement learning framework using curriculum learning and self-play to train autonomous multi-drone racing agents, significantly improving speed, success rate, and scalability in simulation.
Contribution
It presents a novel curriculum-based self-play method that enhances scalability and performance of multi-drone racing policies in high-fidelity simulations.
Findings
CRUISE nearly doubles the racing speed compared to baselines.
It maintains high success rates across increasing agent densities.
Ablation studies highlight the importance of curriculum structure.
Abstract
The coordination of multiple autonomous agents in high-speed, competitive environments represents a significant engineering challenge. This paper presents CRUISE (Curriculum-Based Iterative Self-Play for Scalable Multi-Drone Racing), a reinforcement learning framework designed to solve this challenge in the demanding domain of multi-drone racing. CRUISE overcomes key scalability limitations by synergistically combining a progressive difficulty curriculum with an efficient self-play mechanism to foster robust competitive behaviors. Validated in high-fidelity simulation with realistic quadrotor dynamics, the resulting policies significantly outperform both a standard reinforcement learning baseline and a state-of-the-art game-theoretic planner. CRUISE achieves nearly double the planner's mean racing speed, maintains high success rates, and demonstrates robust scalability as agent density…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
