Simple but Stable, Fast and Safe: Achieve End-to-end Control by High-Fidelity Differentiable Simulation

Fanxing Li; Shengyang Wang; Yuxiang Huang; Fangyu Sun; Shuyu Wu; Yufei Yan; Danping Zou; Wenxian Yu

arXiv:2604.10548·cs.RO·April 17, 2026

Simple but Stable, Fast and Safe: Achieve End-to-end Control by High-Fidelity Differentiable Simulation

Fanxing Li, Shengyang Wang, Yuxiang Huang, Fangyu Sun, Shuyu Wu, Yufei Yan, Danping Zou, Wenxian Yu

PDF

1 Repo

TL;DR

This paper introduces an end-to-end reinforcement learning approach using high-fidelity differentiable simulation to enable quadrotors to perform obstacle avoidance at high speeds with stable, safe, and efficient control directly from depth images.

Contribution

The authors propose a novel low-level control policy trained via differentiable simulation that directly maps depth images to bodyrate commands, improving flight stability and generalization.

Findings

01

Achieves the highest success rate in obstacle avoidance benchmarks.

02

Demonstrates stable flight at speeds up to 7.5 m/s in outdoor environments.

03

Successfully deploys zero-shot in unseen, dense forest environments.

Abstract

Obstacle avoidance is a fundamental vision-based task essential for enabling quadrotors to perform advanced applications. When planning the trajectory, existing approaches both on optimization and learning typically regard quadrotor as a point-mass model, giving path or velocity commands then tracking the commands by outer-loop controller. However, at high speeds, planned trajectories sometimes become dynamically infeasible in actual flight, which beyond the capacity of controller. In this paper, we propose a novel end-to-end policy that directly maps depth images to low-level bodyrate commands by reinforcement learning via differentiable simulation. The high-fidelity simulation in training after parameter identification significantly reduces all the gaps between training, simulation and real world. Analytical process by differentiable simulation provides accurate gradient to ensure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Fanxing-LI/avoidance
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.