FiniteNet: A Fully Convolutional LSTM Network Architecture for Time-Dependent Partial Differential Equations
Ben Stevens, Tim Colonius

TL;DR
FiniteNet introduces a convolutional LSTM architecture that enhances traditional PDE solvers by reducing numerical errors, effectively capturing complex spatiotemporal dynamics across various PDE types.
Contribution
The paper presents a novel neural network architecture that improves PDE solution accuracy while maintaining convergence guarantees, extending the capabilities of existing numerical methods.
Findings
Reduces error by a factor of 2 to 3 compared to baseline methods.
Effectively models diverse PDE dynamics including wave propagation, shock formation, and chaos.
Demonstrates applicability across multiple PDE types with different behaviors.
Abstract
In this work, we present a machine learning approach for reducing the error when numerically solving time-dependent partial differential equations (PDE). We use a fully convolutional LSTM network to exploit the spatiotemporal dynamics of PDEs. The neural network serves to enhance finite-difference and finite-volume methods (FDM/FVM) that are commonly used to solve PDEs, allowing us to maintain guarantees on the order of convergence of our method. We train the network on simulation data, and show that our network can reduce error by a factor of 2 to 3 compared to the baseline algorithms. We demonstrate our method on three PDEs that each feature qualitatively different dynamics. We look at the linear advection equation, which propagates its initial conditions at a constant speed, the inviscid Burgers' equation, which develops shockwaves, and the Kuramoto-Sivashinsky (KS) equation, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Meteorological Phenomena and Simulations · Fluid Dynamics and Turbulent Flows
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
