Towards Size-Independent Generalization Bounds for Deep Operator Nets

Pulkit Gopalani; Sayar Karmakar; Dibyakanti Kumar; Anirbit; Mukherjee

arXiv:2205.11359·cs.LG·December 5, 2024

Towards Size-Independent Generalization Bounds for Deep Operator Nets

Pulkit Gopalani, Sayar Karmakar, Dibyakanti Kumar, Anirbit, Mukherjee

PDF

Open Access 1 Repo

TL;DR

This paper develops size-independent generalization bounds for DeepONets, a neural network architecture used for solving PDEs, by analyzing their Rademacher complexity and loss functions, with theoretical and experimental validation.

Contribution

It introduces a novel size-independent Rademacher complexity bound for DeepONets and demonstrates how to choose loss functions for size-independent generalization error bounds.

Findings

01

Size-independent Rademacher complexity bounds for DeepONets.

02

Generalization error bounds that do not depend on network size.

03

Experimental correlation between capacity measure and generalization error.

Abstract

In recent times machine learning methods have made significant advances in becoming a useful tool for analyzing physical systems. A particularly active area in this theme has been "physics-informed machine learning" which focuses on using neural nets for numerically solving differential equations. In this work, we aim to advance the theory of measuring out-of-sample error while training DeepONets - which is among the most versatile ways to solve P.D.E systems in one-shot. Firstly, for a class of DeepONets, we prove a bound on their Rademacher complexity which does not explicitly scale with the width of the nets involved. Secondly, we use this to show how the Huber loss can be chosen so that for these DeepONet classes generalization error bounds can be obtained that have no explicit dependence on the size of the nets. The effective capacity measure for DeepONets that we thus derive is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Dibyakanti/Towards-Size-Independent-Generalization-Bounds-for-Deep-Operator-Nets
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Advancements in Semiconductor Devices and Circuit Design