Self-supervised Pretraining for Partial Differential Equations
Varun Madhavan, Amal S Sebastian, Bharath Ramsundar and, Venkatasubramanian Viswanathan

TL;DR
This paper introduces a self-supervised transformer-based neural PDE solver capable of generalizing across different PDE parameters without retraining, offering a flexible alternative to existing methods like Fourier Neural Operator.
Contribution
The work presents a novel self-supervised training approach for a transformer-based PDE solver that generalizes over parameters and scales with data and model size.
Findings
The model can generalize over PDE parameters without retraining.
Fine-tuning with small data improves specific parameter predictions.
The approach scales effectively with data and model size.
Abstract
In this work, we describe a novel approach to building a neural PDE solver leveraging recent advances in transformer based neural network architectures. Our model can provide solutions for different values of PDE parameters without any need for retraining the network. The training is carried out in a self-supervised manner, similar to pretraining approaches applied in language and vision tasks. We hypothesize that the model is in effect learning a family of operators (for multiple parameters) mapping the initial condition to the solution of the PDE at any future time step t. We compare this approach with the Fourier Neural Operator (FNO), and demonstrate that it can generalize over the space of PDE parameters, despite having a higher prediction error for individual parameter values compared to the FNO. We show that performance on a specific parameter can be improved by finetuning the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical methods for differential equations · Modeling and Simulation Systems
