Time-optimal Flight in Cluttered Environments via Safe Reinforcement Learning
Wei Xiao, Zhaohan Feng, Ziyu Zhou, Jian Sun, Gang Wang, and Jie Chen

TL;DR
This paper introduces a safe reinforcement learning method enabling quadrotors to navigate cluttered environments quickly and safely, outperforming existing algorithms in time efficiency and obstacle avoidance.
Contribution
The paper proposes a novel reinforcement learning approach with safety and terminal rewards for time-optimal, collision-free drone navigation in complex environments.
Findings
Outperforms state-of-the-art algorithms in flight time and safety.
Achieves 66.7% success rate in unseen challenging environments.
Demonstrates effective obstacle avoidance and time minimization.
Abstract
This paper addresses the problem of guiding a quadrotor through a predefined sequence of waypoints in cluttered environments, aiming to minimize the flight time while avoiding collisions. Previous approaches either suffer from prolonged computational time caused by solving complex non-convex optimization problems or are limited by the inherent smoothness of polynomial trajectory representations, thereby restricting the flexibility of movement. In this work, we present a safe reinforcement learning approach for autonomous drone racing with time-optimal flight in cluttered environments. The reinforcement learning policy, trained using safety and terminal rewards specifically designed to enforce near time-optimal and collision-free flight, outperforms current state-of-the-art algorithms. Additionally, experimental results demonstrate the efficacy of the proposed approach in achieving both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAerospace and Aviation Technology · Biomimetic flight and propulsion mechanisms · Aeroelasticity and Vibration Control
