High-Speed Vision-Based Flight in Clutter with Safety-Shielded Reinforcement Learning

Jiarui Zhang; Chengyong Lei; Chengjiang Dai; Lijie Wang; Zhichao Han; Fei Gao

arXiv:2602.08653·cs.RO·February 10, 2026

High-Speed Vision-Based Flight in Clutter with Safety-Shielded Reinforcement Learning

Jiarui Zhang, Chengyong Lei, Chengjiang Dai, Lijie Wang, Zhichao Han, Fei Gao

PDF

Open Access

TL;DR

This paper introduces a hybrid reinforcement learning framework for quadrotor UAVs that combines physics-informed training and real-time safety filtering, enabling safe, high-speed navigation in cluttered environments.

Contribution

It presents a novel end-to-end RL approach with integrated safety mechanisms, bridging the gap between fast autonomous flight and formal safety guarantees.

Findings

01

Outperforms traditional planners and recent RL methods in obstacle avoidance.

02

Enables reliable high-speed navigation up to 7.5 m/s in dense clutter.

03

Demonstrates strong generalization in outdoor forest environments.

Abstract

Quadrotor unmanned aerial vehicles (UAVs) are increasingly deployed in complex missions that demand reliable autonomous navigation and robust obstacle avoidance. However, traditional modular pipelines often incur cumulative latency, whereas purely reinforcement learning (RL) approaches typically provide limited formal safety guarantees. To bridge this gap, we propose an end-to-end RL framework augmented with model-based safety mechanisms. We incorporate physical priors in both training and deployment. During training, we design a physics-informed reward structure that provides global navigational guidance. During deployment, we integrate a real-time safety filter that projects the policy outputs onto a provably safe set to enforce strict collision-avoidance constraints. This hybrid architecture reconciles high-speed flight with robust safety assurances. Benchmark evaluations demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety · Robotic Path Planning Algorithms