What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Jiayu Chen; Chao Yu; Yuqing Xie; Feng Gao; Yinuo Chen; Shu'ang Yu; Wenhao Tang; Shilong Ji; Mo Mu; Yi Wu; Huazhong Yang; Yu Wang

arXiv:2412.11764·cs.RO·May 23, 2025

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Jiayu Chen, Chao Yu, Yuqing Xie, Feng Gao, Yinuo Chen, Shu'ang Yu, Wenhao Tang, Shilong Ji, Mo Mu, Yi Wu, Huazhong Yang, Yu Wang

PDF

Open Access

TL;DR

This paper presents SimpleFlight, a PPO-based training framework that significantly improves the robustness of zero-shot sim-to-real RL policies for quadrotor control, enabling precise flight in real-world conditions.

Contribution

The paper introduces SimpleFlight, a novel PPO-based framework that incorporates five key techniques to enhance zero-shot transfer of RL policies for quadrotor control.

Findings

01

Achieves over 50% reduction in trajectory tracking error.

02

Performs well on both smooth and challenging trajectories.

03

Outperforms existing RL baselines in real-world tests.

Abstract

Executing precise and agile flight maneuvers is critical for quadrotors in various applications. Traditional quadrotor control approaches are limited by their reliance on flat trajectories or time-consuming optimization, which restricts their flexibility. Recently, RL-based policy has emerged as a promising alternative due to its ability to directly map observations to actions, reducing the need for detailed system knowledge and actuation constraints. However, a significant challenge remains in bridging the sim-to-real gap, where RL-based policies often experience instability when deployed in real world. In this paper, we investigate key factors for learning robust RL-based control policies that are capable of zero-shot deployment in real-world quadrotors. We identify five critical factors and we develop a PPO-based training framework named SimpleFlight, which integrates these five…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Grid Security and Resilience · Power System Optimization and Stability