Learning Perception-Aware Agile Flight in Cluttered Environments

Yunlong Song; Kexin Shi; Robert Penicka; and Davide Scaramuzza

arXiv:2210.01841·cs.RO·March 6, 2023

Learning Perception-Aware Agile Flight in Cluttered Environments

Yunlong Song, Kexin Shi, Robert Penicka, and Davide Scaramuzza

PDF

TL;DR

This paper introduces a perception-aware learning system for agile quadrotor flight in cluttered environments, combining imitation and reinforcement learning to improve speed and success rate in vision-based navigation.

Contribution

It presents a novel framework that integrates perception-awareness into agile flight control by distilling a full-state RL policy into a vision-based policy using imitation learning.

Findings

01

10x faster computation speed

02

Higher success rate in cluttered environments

03

Effective perception-control coupling

Abstract

Recently, neural control policies have outperformed existing model-based planning-and-control methods for autonomously navigating quadrotors through cluttered environments in minimum time. However, they are not perception aware, a crucial requirement in vision-based navigation due to the camera's limited field of view and the underactuated nature of a quadrotor. We propose a learning-based system that achieves perception-aware, agile flight in cluttered environments. Our method combines imitation learning with reinforcement learning (RL) by leveraging a privileged learning-by-cheating framework. Using RL, we first train a perception-aware teacher policy with full-state information to fly in minimum time through cluttered environments. Then, we use imitation learning to distill its knowledge into a vision-based student policy that only perceives the environment via a camera. Our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Adversarial Robustness in Machine Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings