Test-time Generalization for Physics through Neural Operator Splitting
Louis Serrano, Jiequn Han, Edouard Oyallon, Shirley Ho, Rudy Morel

TL;DR
This paper introduces a test-time neural operator splitting method that composes trained operators to achieve zero-shot generalization for PDE solutions, outperforming prior approaches on out-of-distribution tasks.
Contribution
It proposes a novel test-time composition strategy for neural operators that enhances zero-shot generalization without retraining or fine-tuning.
Findings
State-of-the-art zero-shot generalization on out-of-distribution PDE tasks
Ability to recover underlying PDE parameters
Effective composition of trained operators for unseen physics
Abstract
Neural operators have shown promise in learning solution maps of partial differential equations (PDEs), but they often struggle to generalize when test inputs lie outside the training distribution, such as novel initial conditions, unseen PDE coefficients or unseen physics. Prior works address this limitation with large-scale multiple physics pretraining followed by fine-tuning, but this still requires examples from the new dynamics, falling short of true zero-shot generalization. In this work, we propose a method to enhance generalization at test time, i.e., without modifying pretrained weights. Building on DISCO, which provides a dictionary of neural operators trained across different dynamics, we introduce a neural operator splitting strategy that, at test time, searches over compositions of training operators to approximate unseen dynamics. On challenging out-of-distribution tasks…
Peer Reviews
Decision·Submitted to ICLR 2026
The problem that the paper is trying to solve is relevant and timely, and will have a significant impact. The paper is general is well written.
Despite the fact that the problem statement is extremely relevant, there are several problems as highlighted below: (a) The literature review is incomplete. There are works that has previously attempted to solve this problem. For example, ICON (https://www.pnas.org/doi/10.1073/pnas.2310142120), NCWNO (https://www.sciencedirect.com/science/article/pii/S0010465525003844). In fact the idea of combining previously learned solution is something NCWNO has explored previously (although the strategy of
- The core idea of combining a learned dictionary of neural operators with classical operator splitting at test time is, to my knowledge, highly novel. This is a well-motivated approach of bringing forth test-time computation to the realm of PDE surrogate modeling, drawing clever parallels to LLM inference techniques (beam search, best-of-N sampling). - A valuable feature of this formulation is the interpretability aspect. By analyzing the selected operator combinations, we can perform zero-shot
- While operator splitting has theoretical foundations for classical numerical methods, there's no analysis of when/why it works for learned neural operators. What's the role of the approximation error of the individual operators? Have you tried to study the convergence behavior of these splitting schemes (Lie or Strang)? How does the approximation error of the individual neural operators interact with the splitting error of the numerical scheme? - While computational complexity is stated, actua
- The test-time generalization in scientific ML is under-explored. This paper investigates a critical topic. - This paper is well-written. The motivation and formulation of test-time scaling are well presented.
This is a good topic, but I have a few concerns regarding the experiments part. - First, I think the OOD scenarios can be broader. Apart from the parameter extrapolation and operator composition, the authors might also consider unseen initial conditionals, boundary conditions, geometries, etc. Please refer to the unisolver paper [1]. - Second, it would be good to have a more explicit discussion of computational overhead. On Page 2, the authors also claimed that “This test-time strategy comes
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Quantum many-body systems · Machine Learning in Materials Science
