Separable Operator Networks
Xinling Yu, Sean Hooten, Ziyue Liu, Yequan Zhao, Marco Fiorentino,, Thomas Van Vaerenbergh, Zheng Zhang

TL;DR
Separable Operator Networks (SepONet) significantly improve the efficiency of physics-informed operator learning for PDEs, offering faster training, reduced memory usage, and strong approximation capabilities, especially for complex, high-dimensional problems.
Contribution
We introduce SepONet, a novel framework with independent basis learning, proven universal approximation, and superior computational performance over existing methods like PI-DeepONet.
Findings
SepONet achieves up to 112x faster training than PI-DeepONet.
SepONet reduces GPU memory usage by up to 82x.
SepONet maintains accuracy on complex PDEs where PI-DeepONet fails.
Abstract
Operator learning has become a powerful tool in machine learning for modeling complex physical systems governed by partial differential equations (PDEs). Although Deep Operator Networks (DeepONet) show promise, they require extensive data acquisition. Physics-informed DeepONets (PI-DeepONet) mitigate data scarcity but suffer from inefficient training processes. We introduce Separable Operator Networks (SepONet), a novel framework that significantly enhances the efficiency of physics-informed operator learning. SepONet uses independent trunk networks to learn basis functions separately for different coordinate axes, enabling faster and more memory-efficient training via forward-mode automatic differentiation. We provide a universal approximation theorem for SepONet proving the existence of a separable approximation to any nonlinear continuous operator. Then, we comprehensively benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks Stability and Synchronization
MethodsDiffusion
