Training Deep Physics-Informed Kolmogorov-Arnold Networks
Spyros Rigas, Fotios Anagnostopoulos, Michalis Papachristou, Georgios Alexandridis

TL;DR
This paper introduces novel initialization and residual gating techniques for deep physics-informed Kolmogorov-Arnold Networks, significantly enhancing training stability and performance on PDE benchmarks.
Contribution
It proposes a basis-agnostic initialization scheme and Residual-Gated Adaptive KANs to improve deep cPIKAN training stability and accuracy.
Findings
RGA KANs outperform cPIKANs and PirateNets on PDE benchmarks
The new initialization scheme improves stability and accuracy
RGA KANs remain stable where other models diverge
Abstract
Since their introduction, Kolmogorov-Arnold Networks (KANs) have been successfully applied across several domains, with physics-informed machine learning (PIML) emerging as one of the areas where they have thrived. In the PIML setting, Chebyshev-based physics-informed KANs (cPIKANs) have become the standard due to their computational efficiency. However, like their multilayer perceptron-based counterparts, cPIKANs face significant challenges when scaled to depth, leading to training instabilities that limit their applicability to several PDE problems. To address this, we propose a basis-agnostic, Glorot-like initialization scheme that preserves activation variance and yields substantial improvements in stability and accuracy over the default initialization of cPIKANs. Inspired by the PirateNet architecture, we further introduce Residual-Gated Adaptive KANs (RGA KANs), designed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
