Simultaneous learning of state-to-state minimum-time planning and control
Swati Dantu, Robert P\v{e}ni\v{c}ka, Martin Saska

TL;DR
This paper introduces a reinforcement learning framework that enables UAVs to learn minimum-time flight policies capable of navigating between any start and goal states, balancing agility and stability, with validation in simulation and real-world outdoor environments.
Contribution
It presents a novel RL-based approach that simultaneously learns planning and control for minimum-time UAV flights, generalizing beyond predefined tracks using curriculum learning and PMM trajectories.
Findings
The method outperforms NMPC in simulation.
Curriculum learning enhances training efficiency and generalization.
The policy is robust in outdoor real-world tests.
Abstract
This paper tackles the challenge of learning a generalizable minimum-time flight policy for UAVs, capable of navigating between arbitrary start and goal states while balancing agile flight and stable hovering. Traditional approaches, particularly in autonomous drone racing, achieve impressive speeds and agility but are constrained to predefined track layouts, limiting real-world applicability. To address this, we propose a reinforcement learning-based framework that simultaneously learns state-to-state minimum-time planning and control and generalizes to arbitrary state-to-state flights. Our approach leverages Point Mass Model (PMM) trajectories as proxy rewards to approximate the true optimal flight objective and employs curriculum learning to scale the training process efficiently and to achieve generalization. We validate our method through simulation experiments, comparing it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAerospace and Aviation Technology · Robotic Path Planning Algorithms · Reinforcement Learning in Robotics
