Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation
J. Antonio Lara Benitez, Takashi Furuya, Florian Faucher, Anastasis, Kratsios, Xavier Tricoche, Maarten V. de Hoop

TL;DR
This paper introduces a new class of neural operators with stochastic depth, improving generalization for high-frequency wave problems like the Helmholtz equation, and provides theoretical out-of-distribution risk bounds.
Contribution
It proposes a stochastic depth neural operator framework inspired by transformers, with theoretical analysis and superior empirical performance on Helmholtz equation tasks.
Findings
Enhanced out-of-distribution generalization with stochastic depth
Theoretical upper bounds on Rademacher complexity and risk
Superior performance compared to standard neural operators
Abstract
Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well for all physics problems. We focus here on high-frequency waves to highlight possible shortcomings. To resolve these, we propose a subfamily of NOs enabling an enhanced empirical approximation of the nonlinear operator mapping wave speed to solution, or boundary values for the Helmholtz equation on a bounded domain. The latter operator is commonly referred to as the ''forward'' operator in the study of inverse problems. Our methodology draws inspiration from transformers and techniques such as stochastic depth. Our experiments reveal certain surprises in the generalization and the relevance of introducing stochastic depth. Our NOs show superior performance as compared with standard NOs, not only for testing within the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Non-Destructive Testing Techniques
MethodsHyperNetwork · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Test · Stochastic Depth · Dropout · Layer Normalization
