Backpropagation through Time and Space: Learning Numerical Methods with   Multi-Agent Reinforcement Learning

Elliot Way; Dheeraj S.K. Kapilavai; Yiwei Fu; Lei Yu

arXiv:2203.08937·cs.LG·March 30, 2022·1 cites

Backpropagation through Time and Space: Learning Numerical Methods with Multi-Agent Reinforcement Learning

Elliot Way, Dheeraj S.K. Kapilavai, Yiwei Fu, Lei Yu

PDF

Open Access

TL;DR

This paper presents BPTTS, a novel method for training neural networks to learn numerical schemes for PDEs in a multi-agent RL setting, enabling efficient and generalizable solutions for hyperbolic conservation laws.

Contribution

Introduction of BPTTS, a method for training spatio-temporal neural networks in MARL to learn numerical methods for PDEs, addressing non-stationarity via gradient flow across space and time.

Findings

01

Learned numerical policies match state-of-the-art methods.

02

Policies generalize well to different simulation setups.

03

Applicable to hyperbolic conservation laws like Burgers' and Euler equations.

Abstract

We introduce Backpropagation Through Time and Space (BPTTS), a method for training a recurrent spatio-temporal neural network, that is used in a homogeneous multi-agent reinforcement learning (MARL) setting to learn numerical methods for hyperbolic conservation laws. We treat the numerical schemes underlying partial differential equations (PDEs) as a Partially Observable Markov Game (POMG) in Reinforcement Learning (RL). Similar to numerical solvers, our agent acts at each discrete location of a computational space for efficient and generalizable learning. To learn higher-order spatial methods by acting on local states, the agent must discern how its actions at a given spatiotemporal location affect the future evolution of the state. The manifestation of this non-stationarity is addressed by BPTTS, which allows for the flow of gradients across both space and time. The learned numerical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Meteorological Phenomena and Simulations · Fluid Dynamics and Turbulent Flows