Single-step deep reinforcement learning for open-loop control of laminar   and turbulent flows

H. Ghraieb; J. Viquerat; A. Larcher; P. Meliga; E. Hachem

arXiv:2006.02979·math.OC·May 19, 2021

Single-step deep reinforcement learning for open-loop control of laminar and turbulent flows

H. Ghraieb, J. Viquerat, A. Larcher, P. Meliga, E. Hachem

PDF

1 Repo

TL;DR

This paper demonstrates the potential of a novel single-step deep reinforcement learning approach, based on a modified PPO algorithm, for optimizing complex fluid flow control problems in computational fluid dynamics.

Contribution

It introduces a new single-step PPO algorithm combined with a stabilized finite elements environment for effective flow control optimization.

Findings

01

Single-step PPO effectively optimizes fluid flow control.

02

Method outperforms traditional direct and adjoint methods in certain cases.

03

Applicable to complex unsteady and high-dimensional flow problems.

Abstract

This research gauges the ability of deep reinforcement learning (DRL) techniques to assist the optimization and control of fluid mechanical systems. It combines a novel, "degenerate" version of the proximal policy optimization (PPO) algorithm, that trains a neural network in optimizing the system only once per learning episode, and an in-house stabilized finite elements environment implementing the variational multiscale (VMS) method, that computes the numerical reward fed to the neural network. Three prototypical examples of separated flows in two dimensions are used as testbed for developing the methodology, each of which adds a layer of complexity due either to the unsteadiness of the flow solutions, or the sharpness of the objective function, or the dimension of the control parameter space. Relevance is carefully assessed by comparing systematically to reference data obtained by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jviquerat/pbo
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsEntropy Regularization · Proximal Policy Optimization