SVD Perspectives for Augmenting DeepONet Flexibility and Interpretability
Simone Venturi, Tiernan Casey

TL;DR
This paper enhances DeepONet architectures by integrating SVD-based techniques and pre-transformation networks, improving their flexibility, interpretability, and efficiency in modeling complex dynamics with symmetries.
Contribution
It introduces SVD-DeepONet and flexDeepONet, novel extensions that leverage low-rank SVD methods and pre-transformation networks to improve DeepONet's design, training, and ability to handle symmetries.
Findings
flexDeepONet reduces trainable parameters by 95% in a combustion application
SVD-based methods improve DeepONet's interpretability and efficiency
Enhanced architectures better handle symmetries in dynamic systems
Abstract
Deep operator networks (DeepONets) are powerful architectures for fast and accurate emulation of complex dynamics. As their remarkable generalization capabilities are primarily enabled by their projection-based attribute, we investigate connections with low-rank techniques derived from the singular value decomposition (SVD). We demonstrate that some of the concepts behind proper orthogonal decomposition (POD)-neural networks can improve DeepONet's design and training phases. These ideas lead us to a methodology extension that we name SVD-DeepONet. Moreover, through multiple SVD analyses, we find that DeepONet inherits from its projection-based attribute strong inefficiencies in representing dynamics characterized by symmetries. Inspired by the work on shifted-POD, we develop flexDeepONet, an architecture enhancement that relies on a pre-transformation network for generating a moving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks
MethodsPrincipal Components Analysis
