Variational Kolmogorov-Arnold Network
Francesco Alesiani, Henrik Christiansen, Federico Errica

TL;DR
InfinityKAN introduces a variational inference method for Kolmogorov-Arnold Networks that automatically learns the number of basis functions, removing the need for manual hyperparameter tuning across diverse datasets.
Contribution
It proposes a novel framework that models the basis count as a latent variable with a prior, enabling automatic capacity adaptation during training.
Findings
Matches or exceeds KAN performance across 18 datasets.
Eliminates manual hyperparameter tuning for basis functions.
Ensures stable training through Lipschitz continuity.
Abstract
Kolmogorov-Arnold Networks (KANs) offer a theoretically grounded alternative to multi-layer perceptrons by representing multivariate functions as compositions of univariate basis functions. However, a critical limitation of KANs is the need to manually specify the number of basis functions per layer -- a hyperparameter that directly controls model capacity and substantially impacts performance, yet whose optimal value varies unpredictably across tasks. We present InfinityKAN, a variational inference framework that eliminates this design choice by learning the number of basis functions during training. Our approach models the basis count as a latent variable with a truncated exponential prior, introducing a differentiable weighting function that enables gradient-based optimization. We establish the Lipschitz continuity of the variational objective, ensuring stable training dynamics.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
