Learning to Fly via Deep Model-Based Reinforcement Learning

Philip Becker-Ehmck; Maximilian Karl; Jan Peters; Patrick van der; Smagt

arXiv:2003.08876·cs.RO·August 5, 2020·24 cites

Learning to Fly via Deep Model-Based Reinforcement Learning

Philip Becker-Ehmck, Maximilian Karl, Jan Peters, Patrick van der, Smagt

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that a quadrotor drone can learn to fly autonomously using deep model-based reinforcement learning with minimal training data, solely on onboard resources, without prior flight knowledge.

Contribution

It introduces a novel approach to drone control by learning a probabilistic model of dynamics directly from raw sensory data, enabling effective model-based RL for flight.

Findings

01

Achieved successful drone flight with less than 30 minutes of training.

02

Learned control policy using only onboard sensors and computation.

03

Demonstrated deployment on a self-built drone without prior knowledge.

Abstract

Learning to control robots without requiring engineered models has been a long-term goal, promising diverse and novel applications. Yet, reinforcement learning has only achieved limited impact on real-time robot control due to its high demand of real-world interactions. In this work, by leveraging a learnt probabilistic model of drone dynamics, we learn a thrust-attitude controller for a quadrotor through model-based reinforcement learning. No prior knowledge of the flight dynamics is assumed; instead, a sequential latent variable model, used generatively and as an online filter, is learnt from raw sensory input. The controller and value function are optimised entirely by propagating stochastic analytic gradients through generated latent trajectories. We show that "learning to fly" can be achieved with less than 30 minutes of experience with a single drone, and can be deployed solely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

argmax-ai/drone-hardware
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Gaussian Processes and Bayesian Inference · Robotics and Sensor-Based Localization