Learning-based vs Model-free Adaptive Control of a MAV under Wind Gust

Thomas Chaffre; Julien Moras; Adrien Chan-Hon-Tong; Julien Marzat,; Karl Sammut; Gilles Le Chenadec; Benoit Clement

arXiv:2101.12501·cs.RO·July 7, 2021

Learning-based vs Model-free Adaptive Control of a MAV under Wind Gust

Thomas Chaffre, Julien Moras, Adrien Chan-Hon-Tong, Julien Marzat,, Karl Sammut, Gilles Le Chenadec, Benoit Clement

PDF

Open Access

TL;DR

This paper compares learning-based and model-free adaptive control methods for a micro aerial vehicle under wind gusts, demonstrating the potential of learning-based approaches through realistic simulations.

Contribution

It introduces a simple learning-based adaptive control method using deep reinforcement learning and compares it with a model-free approach in MAV wind disturbance scenarios.

Findings

01

Learning-based control outperforms model-free in simulation.

02

Deep reinforcement learning effectively tunes the controller.

03

Learning-based methods show promise for real-world uncertainties.

Abstract

Navigation problems under unknown varying conditions are among the most important and well-studied problems in the control field. Classic model-based adaptive control methods can be applied only when a convenient model of the plant or environment is provided. Recent model-free adaptive control methods aim at removing this dependency by learning the physical characteristics of the plant and/or process directly from sensor feedback. Although there have been prior attempts at improving these techniques, it remains an open question as to whether it is possible to cope with real-world uncertainties in a control system that is fully based on either paradigm. We propose a conceptually simple learning-based approach composed of a full state feedback controller, tuned robustly by a deep reinforcement learning framework based on the Soft Actor-Critic algorithm. We compare it, in realistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control