Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring
Erwan Lecarpentier, Sebastian Rapp, Marc Melo, Emmanuel Rachelson

TL;DR
This paper presents a real-time, model-free Q-learning algorithm for autonomous glider soaring, demonstrating improved endurance by adapting to atmospheric conditions without relying on detailed models.
Contribution
It introduces a Q-learning based control method for autonomous soaring that is online, data-driven, and computationally efficient, suitable for real-time deployment.
Findings
Q-learning effectively controls the glider in non-stationary conditions
The approach enhances mission endurance by exploiting thermal updrafts
The method operates with low computational requirements
Abstract
Autonomous unpowered flight is a challenge for control and guidance systems: all the energy the aircraft might use during flight has to be harvested directly from the atmosphere. We investigate the design of an algorithm that optimizes the closed-loop control of a glider's bank and sideslip angles, while flying in the lower convective layer of the atmosphere in order to increase its mission endurance. Using a Reinforcement Learning approach, we demonstrate the possibility for real-time adaptation of the glider's behaviour to the time-varying and noisy conditions associated with thermal soaring flight. Our approach is online, data-based and model-free, hence avoids the pitfalls of aerological and aircraft modelling and allow us to deal with uncertainties and non-stationarity. Additionally, we put a particular emphasis on keeping low computational requirements in order to make on-board…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAerospace and Aviation Technology · Model Reduction and Neural Networks · Air Traffic Management and Optimization
MethodsQ-Learning
