Efficient Training of Deep Neural Operator Networks via Randomized Sampling

Sharmila Karumuri; Lori Graham-Brady; Somdatta Goswami

arXiv:2409.13280·cs.LG·June 3, 2025·World Sci. Annu. Rev. Artif. Intell.

Efficient Training of Deep Neural Operator Networks via Randomized Sampling

Sharmila Karumuri, Lori Graham-Brady, Somdatta Goswami

PDF

Open Access 1 Repo

TL;DR

This paper introduces a randomized sampling method for training DeepONet neural operators, which enhances generalization, reduces computational time, and maintains accuracy across various scientific applications.

Contribution

The paper proposes a novel random sampling technique during DeepONet training that improves efficiency and generalization, addressing limitations of traditional uniform grid sampling.

Findings

01

Significant reduction in training time across benchmarks.

02

Maintained or improved test error compared to traditional methods.

03

Reduced memory requirements during training.

Abstract

Neural operators (NOs) employ deep neural networks to learn mappings between infinite-dimensional function spaces. Deep operator network (DeepONet), a popular NO architecture, has demonstrated success in the real-time prediction of complex dynamics across various scientific and engineering applications. In this work, we introduce a random sampling technique to be adopted during the training of DeepONet, aimed at improving the generalization ability of the model, while significantly reducing the computational time. The proposed approach targets the trunk network of the DeepONet model that outputs the basis functions corresponding to the spatiotemporal locations of the bounded domain on which the physical system is defined. While constructing the loss function, DeepONet training traditionally considers a uniform grid of spatiotemporal points at which all the output functions are evaluated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

centrum-intelliphysics/efficient_deeponet_training
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM