Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous   Soaring

Erwan Lecarpentier; Sebastian Rapp; Marc Melo; Emmanuel Rachelson

arXiv:1707.05668·cs.LG·July 19, 2017·1 cites

Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring

Erwan Lecarpentier, Sebastian Rapp, Marc Melo, Emmanuel Rachelson

PDF

Open Access

TL;DR

This paper presents a real-time, model-free Q-learning algorithm for autonomous glider soaring, demonstrating improved endurance by adapting to atmospheric conditions without relying on detailed models.

Contribution

It introduces a Q-learning based control method for autonomous soaring that is online, data-driven, and computationally efficient, suitable for real-time deployment.

Findings

01

Q-learning effectively controls the glider in non-stationary conditions

02

The approach enhances mission endurance by exploiting thermal updrafts

03

The method operates with low computational requirements

Abstract

Autonomous unpowered flight is a challenge for control and guidance systems: all the energy the aircraft might use during flight has to be harvested directly from the atmosphere. We investigate the design of an algorithm that optimizes the closed-loop control of a glider's bank and sideslip angles, while flying in the lower convective layer of the atmosphere in order to increase its mission endurance. Using a Reinforcement Learning approach, we demonstrate the possibility for real-time adaptation of the glider's behaviour to the time-varying and noisy conditions associated with thermal soaring flight. Our approach is online, data-based and model-free, hence avoids the pitfalls of aerological and aircraft modelling and allow us to deal with uncertainties and non-stationarity. Additionally, we put a particular emphasis on keeping low computational requirements in order to make on-board…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAerospace and Aviation Technology · Model Reduction and Neural Networks · Air Traffic Management and Optimization

MethodsQ-Learning