Test-time Generalization for Physics through Neural Operator Splitting

Louis Serrano; Jiequn Han; Edouard Oyallon; Shirley Ho; Rudy Morel

arXiv:2602.00884·cs.LG·February 3, 2026

Test-time Generalization for Physics through Neural Operator Splitting

Louis Serrano, Jiequn Han, Edouard Oyallon, Shirley Ho, Rudy Morel

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a test-time neural operator splitting method that composes trained operators to achieve zero-shot generalization for PDE solutions, outperforming prior approaches on out-of-distribution tasks.

Contribution

It proposes a novel test-time composition strategy for neural operators that enhances zero-shot generalization without retraining or fine-tuning.

Findings

01

State-of-the-art zero-shot generalization on out-of-distribution PDE tasks

02

Ability to recover underlying PDE parameters

03

Effective composition of trained operators for unseen physics

Abstract

Neural operators have shown promise in learning solution maps of partial differential equations (PDEs), but they often struggle to generalize when test inputs lie outside the training distribution, such as novel initial conditions, unseen PDE coefficients or unseen physics. Prior works address this limitation with large-scale multiple physics pretraining followed by fine-tuning, but this still requires examples from the new dynamics, falling short of true zero-shot generalization. In this work, we propose a method to enhance generalization at test time, i.e., without modifying pretrained weights. Building on DISCO, which provides a dictionary of neural operators trained across different dynamics, we introduce a neural operator splitting strategy that, at test time, searches over compositions of training operators to approximate unseen dynamics. On challenging out-of-distribution tasks…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 2Confidence 4

Strengths

The problem that the paper is trying to solve is relevant and timely, and will have a significant impact. The paper is general is well written.

Weaknesses

Despite the fact that the problem statement is extremely relevant, there are several problems as highlighted below: (a) The literature review is incomplete. There are works that has previously attempted to solve this problem. For example, ICON (https://www.pnas.org/doi/10.1073/pnas.2310142120), NCWNO (https://www.sciencedirect.com/science/article/pii/S0010465525003844). In fact the idea of combining previously learned solution is something NCWNO has explored previously (although the strategy of

Reviewer 02Rating 6Confidence 4

Strengths

- The core idea of combining a learned dictionary of neural operators with classical operator splitting at test time is, to my knowledge, highly novel. This is a well-motivated approach of bringing forth test-time computation to the realm of PDE surrogate modeling, drawing clever parallels to LLM inference techniques (beam search, best-of-N sampling). - A valuable feature of this formulation is the interpretability aspect. By analyzing the selected operator combinations, we can perform zero-shot

Weaknesses

- While operator splitting has theoretical foundations for classical numerical methods, there's no analysis of when/why it works for learned neural operators. What's the role of the approximation error of the individual operators? Have you tried to study the convergence behavior of these splitting schemes (Lie or Strang)? How does the approximation error of the individual neural operators interact with the splitting error of the numerical scheme? - While computational complexity is stated, actua

Reviewer 03Rating 4Confidence 4

Strengths

- The test-time generalization in scientific ML is under-explored. This paper investigates a critical topic. - This paper is well-written. The motivation and formulation of test-time scaling are well presented.

Weaknesses

This is a good topic, but I have a few concerns regarding the experiments part. - First, I think the OOD scenarios can be broader. Apart from the parameter extrapolation and operator composition, the authors might also consider unseen initial conditionals, boundary conditions, geometries, etc. Please refer to the unisolver paper [1]. - Second, it would be good to have a more explicit discussion of computational overhead. On Page 2, the authors also claimed that “This test-time strategy comes

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Quantum many-body systems · Machine Learning in Materials Science