Learning Perception-Aware Agile Flight in Cluttered Environments
Yunlong Song, Kexin Shi, Robert Penicka, and Davide Scaramuzza

TL;DR
This paper introduces a perception-aware learning system for agile quadrotor flight in cluttered environments, combining imitation and reinforcement learning to improve speed and success rate in vision-based navigation.
Contribution
It presents a novel framework that integrates perception-awareness into agile flight control by distilling a full-state RL policy into a vision-based policy using imitation learning.
Findings
10x faster computation speed
Higher success rate in cluttered environments
Effective perception-control coupling
Abstract
Recently, neural control policies have outperformed existing model-based planning-and-control methods for autonomously navigating quadrotors through cluttered environments in minimum time. However, they are not perception aware, a crucial requirement in vision-based navigation due to the camera's limited field of view and the underactuated nature of a quadrotor. We propose a learning-based system that achieves perception-aware, agile flight in cluttered environments. Our method combines imitation learning with reinforcement learning (RL) by leveraging a privileged learning-by-cheating framework. Using RL, we first train a perception-aware teacher policy with full-state information to fly in minimum time through cluttered environments. Then, we use imitation learning to distill its knowledge into a vision-based student policy that only perceives the environment via a camera. Our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Adversarial Robustness in Machine Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
