Data Complexity Estimates for Operator Learning
Nikola B. Kovachki, Samuel Lanthaler, Hrushikesh Mhaskar

TL;DR
This paper investigates the data requirements for operator learning, establishing theoretical bounds and demonstrating that parametric efficiency can lead to data efficiency, especially with neural operators like FNO.
Contribution
It derives lower bounds on data complexity for general operators and shows that parametric efficiency implies data efficiency for certain neural operator classes.
Findings
Lower bounds on n-widths demonstrate exponential data requirements for general classes.
Efficient parametric approximation with FNO leads to corresponding data efficiency.
Data complexity is fundamentally linked to the parametric complexity of the operator class.
Abstract
Operator learning has emerged as a new paradigm for the data-driven approximation of nonlinear operators. Despite its empirical success, the theoretical underpinnings governing the conditions for efficient operator learning remain incomplete. The present work develops theory to study the data complexity of operator learning, complementing existing research on the parametric complexity. We investigate the fundamental question: How many input/output samples are needed in operator learning to achieve a desired accuracy ? This question is addressed from the point of view of -widths, and this work makes two key contributions. The first contribution is to derive lower bounds on -widths for general classes of Lipschitz and Fr\'echet differentiable operators. These bounds rigorously demonstrate a ``curse of data-complexity'', revealing that learning on such general classes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Fault Detection and Control Systems
