Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight
Jiaxu Xing, Angel Romero, Leonard Bauersfeld, Davide Scaramuzza

TL;DR
This paper introduces a hybrid learning framework combining reinforcement learning and imitation learning to efficiently train vision-based policies for agile quadrotor flight, outperforming existing methods in robustness and performance.
Contribution
The paper presents a novel three-phase policy learning framework that integrates RL and IL, enabling effective training of vision-based drone racing policies from simulation to real-world.
Findings
Successfully trained policies that outperform existing IL methods
Able to navigate complex race courses using only visual inputs
Effective transfer from simulation to real-world scenarios
Abstract
Learning visuomotor policies for agile quadrotor flight presents significant difficulties, primarily from inefficient policy exploration caused by high-dimensional visual inputs and the need for precise and low-latency control. To address these challenges, we propose a novel approach that combines the performance of Reinforcement Learning (RL) and the sample efficiency of Imitation Learning (IL) in the task of vision-based autonomous drone racing. While RL provides a framework for learning high-performance controllers through trial and error, it faces challenges with sample efficiency and computational demands due to the high dimensionality of visual inputs. Conversely, IL efficiently learns from visual expert demonstrations, but it remains limited by the expert's performance and state distribution. To overcome these limitations, our policy learning framework integrates the strengths of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
MethodsFocus
