MAVEN: A Meta-Reinforcement Learning Framework for Varying-Dynamics Expertise in Agile Quadrotor Maneuvers
Jin Zhou, Dongcheng Cao, Xian Wang, and Shuo Li

TL;DR
MAVEN introduces a meta-reinforcement learning framework with a predictive context encoder that enables quadrotor policies to adapt rapidly to significant dynamic variations, achieving robust agile navigation in simulation and real-world scenarios.
Contribution
The paper presents MAVEN, a novel meta-RL framework with a predictive context encoder for fast adaptation across diverse quadrotor dynamics, reducing training time to less than an hour.
Findings
Achieves zero-shot sim-to-real transfer for high-speed maneuvers.
Handles mass variations up to 66.7% and thrust losses of 70%.
Converges in less than an hour using GPU-vectorized simulation.
Abstract
Reinforcement learning (RL) has emerged as a powerful paradigm for achieving online agile navigation with quadrotors. Despite this success, policies trained via standard RL typically fail to generalize across significant dynamic variations, exhibiting a critical lack of adaptability. This work introduces MAVEN, a meta-RL framework that enables a single policy to achieve robust end-to-end navigation across a wide range of quadrotor dynamics. Our approach features a novel predictive context encoder, which learns to infer a latent representation of the system dynamics from interaction history. We demonstrate our method in agile waypoint traversal tasks under two challenging scenarios: large variations in quadrotor mass and severe single-rotor thrust loss. We leverage a GPU-vectorized simulator to distribute tasks across thousands of parallel environments, overcoming the long training times…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Aerospace and Aviation Technology
