Efficient Training of Deep Neural Operator Networks via Randomized Sampling
Sharmila Karumuri, Lori Graham-Brady, Somdatta Goswami

TL;DR
This paper introduces a randomized sampling method for training DeepONet neural operators, which enhances generalization, reduces computational time, and maintains accuracy across various scientific applications.
Contribution
The paper proposes a novel random sampling technique during DeepONet training that improves efficiency and generalization, addressing limitations of traditional uniform grid sampling.
Findings
Significant reduction in training time across benchmarks.
Maintained or improved test error compared to traditional methods.
Reduced memory requirements during training.
Abstract
Neural operators (NOs) employ deep neural networks to learn mappings between infinite-dimensional function spaces. Deep operator network (DeepONet), a popular NO architecture, has demonstrated success in the real-time prediction of complex dynamics across various scientific and engineering applications. In this work, we introduce a random sampling technique to be adopted during the training of DeepONet, aimed at improving the generalization ability of the model, while significantly reducing the computational time. The proposed approach targets the trunk network of the DeepONet model that outputs the basis functions corresponding to the spatiotemporal locations of the bounded domain on which the physical system is defined. While constructing the loss function, DeepONet training traditionally considers a uniform grid of spatiotemporal points at which all the output functions are evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM
