TL;DR
DeepONet leverages the universal approximation theorem for operators to efficiently learn nonlinear operators from limited data, demonstrating high accuracy and convergence rates in modeling differential equations and dynamic systems.
Contribution
The paper introduces DeepONet, a novel neural network architecture that approximates nonlinear operators with theoretical guarantees and practical efficiency, outperforming traditional networks.
Findings
DeepONet achieves significantly lower generalization errors than fully-connected networks.
Theoretical analysis links approximation error to the number of sensors and input function type.
Computational results show polynomial and exponential convergence rates.
Abstract
While it is widely known that neural networks are universal approximators of continuous functions, a less known and perhaps more powerful result is that a neural network with a single hidden layer can approximate accurately any nonlinear continuous operator. This universal approximation theorem is suggestive of the potential application of neural networks in learning nonlinear operators from data. However, the theorem guarantees only a small approximation error for a sufficient large network, and does not consider the important optimization and generalization errors. To realize this theorem in practice, we propose deep operator networks (DeepONets) to learn operators accurately and efficiently from a relatively small dataset. A DeepONet consists of two sub-networks, one for encoding the input function at a fixed number of sensors (branch net), and another for encoding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
