Randomized Sparse Neural Galerkin Schemes for Solving Evolution Equations with Deep Networks
Jules Berman, Benjamin Peherstorfer

TL;DR
This paper introduces a neural Galerkin scheme that updates randomized sparse subsets of network parameters at each time step, significantly improving efficiency and accuracy in solving evolution equations.
Contribution
The work proposes a novel randomized sparse update method for neural Galerkin schemes, reducing computational costs and error accumulation in time-dependent PDE solutions.
Findings
Up to 100x more accurate at fixed computational cost
Up to 100x faster at fixed accuracy
Effective across various evolution equations
Abstract
Training neural networks sequentially in time to approximate solution fields of time-dependent partial differential equations can be beneficial for preserving causality and other physics properties; however, the sequential-in-time training is numerically challenging because training errors quickly accumulate and amplify over time. This work introduces Neural Galerkin schemes that update randomized sparse subsets of network parameters at each time step. The randomization avoids overfitting locally in time and so helps prevent the error from accumulating quickly over the sequential-in-time training, which is motivated by dropout that addresses a similar issue of overfitting due to neuron co-adaptation. The sparsity of the update reduces the computational costs of training without losing expressiveness because many of the network parameters are redundant locally at each time step. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Advanced Numerical Methods in Computational Mathematics · Numerical methods for differential equations
MethodsDropout
