Optimal control of point-to-point navigation in turbulent time-dependent   flows using Reinforcement Learning

Michele Buzzicotti; Luca Biferale; Fabio Bonaccorso; Patricio Clark di; Leoni; Kristian Gustavsson

arXiv:2103.00329·eess.SY·March 2, 2021

Optimal control of point-to-point navigation in turbulent time-dependent flows using Reinforcement Learning

Michele Buzzicotti, Luca Biferale, Fabio Bonaccorso, Patricio Clark di, Leoni, Kristian Gustavsson

PDF

TL;DR

This paper compares deterministic optimal navigation with reinforcement learning-based stochastic policies for point-to-point navigation in turbulent, time-dependent flows, demonstrating RL's ability to find near-optimal solutions where traditional methods fail.

Contribution

It introduces RL algorithms for navigation in complex flows, showing their effectiveness over classical optimal control in unstable, turbulent environments.

Findings

01

RL algorithms find quasi-optimal navigation paths in complex flows.

02

Optimal control solutions become unstable over typical navigation durations.

03

Allowing engine shutdown reduces energy consumption during navigation.

Abstract

We present theoretical and numerical results concerning the problem to find the path that minimizes the time to navigate between two given points in a complex fluid under realistic navigation constraints. We contrast deterministic Optimal Navigation (ON) control with stochastic policies obtained by Reinforcement Learning (RL) algorithms. We show that Actor-Critic RL algorithms are able to find quasi-optimal solutions in the presence of either time-independent or chaotically evolving flow configurations. For our application, ON solutions develop unstable behavior within the typical duration of the navigation process, and are therefore not useful in practice. We first explore navigation of turbulent flow using a constant propulsion speed. Based on a discretized phase-space, the propulsion direction is adjusted with the aim to minimize the time spent to reach the target. Further, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.