Mastering Diverse, Unknown, and Cluttered Tracks for Robust Vision-Based Drone Racing
Feng Yu, Yu Hu, Yang Su, Yang Deng, Linzuo Zhang, and Danping Zou

TL;DR
This paper introduces a two-phase reinforcement learning framework for drone racing that enhances generalization and robustness in unknown, cluttered environments by combining soft and hard collision training, adaptive curriculum, and perceptual constraints.
Contribution
It presents a novel two-phase learning approach with curriculum and constraints to improve drone racing in diverse, partially unknown environments, addressing prior limitations.
Findings
Achieves agile and robust drone flight in cluttered environments
Demonstrates effective transfer from simulation to real-world scenarios
Enhances generalization and obstacle avoidance capabilities
Abstract
Most reinforcement learning(RL)-based methods for drone racing target fixed, obstacle-free tracks, leaving the generalization to unknown, cluttered environments largely unaddressed. This challenge stems from the need to balance racing speed and collision avoidance, limited feasible space causing policy exploration trapped in local optima during training, and perceptual ambiguity between gates and obstacles in depth maps-especially when gate positions are only coarsely specified. To overcome these issues, we propose a two-phase learning framework: an initial soft-collision training phase that preserves policy exploration for high-speed flight, followed by a hard-collision refinement phase that enforces robust obstacle avoidance. An adaptive, noise-augmented curriculum with an asymmetric actor-critic architecture gradually shifts the policy's reliance from privileged gate-state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · UAV Applications and Optimization · Robotics and Sensor-Based Localization
