Multi-Level Monte Carlo Training of Neural Operators

James Rowbottom; Stefania Fresca; Pietro Lio; Carola-Bibiane Sch\"onlieb; Nicolas Boull\'e

arXiv:2505.12940·cs.LG·February 4, 2026

Multi-Level Monte Carlo Training of Neural Operators

James Rowbottom, Stefania Fresca, Pietro Lio, Carola-Bibiane Sch\"onlieb, Nicolas Boull\'e

PDF

TL;DR

This paper introduces a Multi-Level Monte Carlo training method for neural operators that reduces computational costs by leveraging multi-resolution data, maintaining high accuracy for PDE-related operator learning.

Contribution

It presents a novel MLMC-based training framework applicable to various neural operator architectures, improving efficiency over traditional single-resolution methods.

Findings

01

Enhanced computational efficiency demonstrated across multiple models

02

Existence of a Pareto curve between accuracy and computational time

03

Effective use of multi-resolution data for training neural operators

Abstract

Operator learning is a rapidly growing field that aims to approximate nonlinear operators related to partial differential equations (PDEs) using neural operators. These rely on discretization of input and output functions and are, usually, expensive to train for large-scale problems at high-resolution. Motivated by this, we present a Multi-Level Monte Carlo (MLMC) approach to train neural operators by leveraging a hierarchy of resolutions of function discretization. Our framework relies on using gradient corrections from fewer samples of fine-resolution data to decrease the computational cost of training while maintaining a high level accuracy. The proposed MLMC training procedure can be applied to any architecture accepting multi-resolution data. Our numerical experiments on a range of state-of-the-art models and test-cases demonstrate improved computational efficiency compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.