Bootstrapping Reinforcement Learning with Imitation for Vision-Based   Agile Flight

Jiaxu Xing; Angel Romero; Leonard Bauersfeld; Davide Scaramuzza

arXiv:2403.12203·cs.RO·November 13, 2024·1 cites

Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight

Jiaxu Xing, Angel Romero, Leonard Bauersfeld, Davide Scaramuzza

PDF

Open Access

TL;DR

This paper introduces a hybrid learning framework combining reinforcement learning and imitation learning to efficiently train vision-based policies for agile quadrotor flight, outperforming existing methods in robustness and performance.

Contribution

The paper presents a novel three-phase policy learning framework that integrates RL and IL, enabling effective training of vision-based drone racing policies from simulation to real-world.

Findings

01

Successfully trained policies that outperform existing IL methods

02

Able to navigate complex race courses using only visual inputs

03

Effective transfer from simulation to real-world scenarios

Abstract

Learning visuomotor policies for agile quadrotor flight presents significant difficulties, primarily from inefficient policy exploration caused by high-dimensional visual inputs and the need for precise and low-latency control. To address these challenges, we propose a novel approach that combines the performance of Reinforcement Learning (RL) and the sample efficiency of Imitation Learning (IL) in the task of vision-based autonomous drone racing. While RL provides a framework for learning high-performance controllers through trial and error, it faces challenges with sample efficiency and computational demands due to the high dimensionality of visual inputs. Conversely, IL efficiently learns from visual expert demonstrations, but it remains limited by the expert's performance and state distribution. To overcome these limitations, our policy learning framework integrates the strengths of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms

MethodsFocus