Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers
Hang Zhou, Yuezhou Ma, Haixu Wu, Haowen Wang, Mingsheng Long

TL;DR
Unisolver introduces a Transformer-based neural PDE solver trained on diverse PDE data and conditioned on PDE components, achieving broad generalization and state-of-the-art results across multiple benchmarks.
Contribution
It presents a novel PDE-conditional Transformer model that leverages PDE components for universal solving capability, extending beyond traditional specialized neural PDE solvers.
Findings
Achieves state-of-the-art performance on large-scale PDE benchmarks.
Demonstrates strong generalization across diverse PDE types.
Integrates physical insights with Transformer architecture effectively.
Abstract
Deep models have recently emerged as promising tools to solve partial differential equations (PDEs), known as neural PDE solvers. While neural solvers trained from either simulation data or physics-informed loss can solve PDEs reasonably well, they are mainly restricted to a few instances of PDEs, e.g. a certain equation with a limited set of coefficients. This limits their generalization to diverse PDEs, preventing them from being practical surrogate models of numerical solvers. In this paper, we present Unisolver, a novel Transformer model trained on diverse data and conditioned on diverse PDEs, aiming towards a universal neural PDE solver capable of solving a wide scope of PDEs. Instead of purely scaling up data and parameters, Unisolver stems from the theoretical analysis of the PDE-solving process. Inspired by the mathematical structure of PDEs that a PDE solution is fundamentally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCloud Computing and Resource Management · Parallel Computing and Optimization Techniques · Quantum Computing Algorithms and Architecture
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout
