Some Best Practices in Operator Learning

Dustin Enyeart; Guang Lin

arXiv:2412.06686·cs.LG·December 10, 2024

Some Best Practices in Operator Learning

Dustin Enyeart, Guang Lin

PDF

Open Access 1 Repo

TL;DR

This paper investigates hyperparameter choices and training methods for operator learning architectures like DeepONets, Fourier neural operators, and Koopman autoencoders, aiming to identify robust training trends across differential equations.

Contribution

It provides practical guidelines on hyperparameters and training strategies tailored for operator learning models, enhancing their robustness and efficiency.

Findings

01

Activation functions significantly affect performance.

02

Dropout and stochastic weight averaging improve robustness.

03

Certain hyperparameter settings are consistently effective across models.

Abstract

Hyperparameters searches are computationally expensive. This paper studies some general choices of hyperparameters and training methods specifically for operator learning. It considers the architectures DeepONets, Fourier neural operators and Koopman autoencoders for several differential equations to find robust trends. Some options considered are activation functions, dropout and stochastic weight averaging.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/dustin_lee/neural-operators
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExperimental Learning in Engineering · Intelligent Tutoring Systems and Adaptive Learning

MethodsDropout