VisFly-Lab: Unified Differentiable Framework for First-Order Reinforcement Learning of Quadrotor Control

Fanxing Li; Fangyu Sun; Tianbao Zhang; Shuyu Wu; Dexin Zuo; yufei Yan; Wenxian Yu; Danping Zou

arXiv:2603.21123·cs.RO·March 24, 2026

VisFly-Lab: Unified Differentiable Framework for First-Order Reinforcement Learning of Quadrotor Control

Fanxing Li, Fangyu Sun, Tianbao Zhang, Shuyu Wu, Dexin Zuo, yufei Yan, Wenxian Yu, Danping Zou

PDF

Open Access

TL;DR

This paper introduces VisFly-Lab, a unified differentiable framework for multi-task quadrotor control using first-order reinforcement learning, addressing training challenges and demonstrating real-world transferability.

Contribution

It presents a comprehensive, extensible framework with a novel training algorithm, ABPT, to improve robustness and performance across diverse quadrotor tasks.

Findings

01

ABPT improves training stability in partially non-differentiable reward settings.

02

The framework achieves competitive results across four quadrotor control tasks.

03

Initial real-world deployments show promising transferability of learned policies.

Abstract

First-order reinforcement learning with differentiable simulation is promising for quadrotor control, but practical progress remains fragmented across task-specific settings. To support more systematic development and evaluation, we present a unified differentiable framework for multi-task quadrotor control. The framework is wrapped, extensible, and equipped with deployment-oriented dynamics, providing a common interface across four representative tasks: hovering, tracking, landing, and racing. We also present the suite of first-order learning algorithms, where we identify two practical bottlenecks of standard first-order training: limited state coverage caused by horizon initialization and gradient bias caused by partially non-differentiable rewards. To address these issues, we propose Amended Backpropagation Through Time (ABPT), which combines differentiable rollout optimization, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Control of Nonlinear Systems · Aerospace and Aviation Technology