MatrixKAN: Parallelized Kolmogorov-Arnold Network
Cale Coffman, Lizhong Chen

TL;DR
MatrixKAN introduces a parallelized approach to accelerate Kolmogorov-Arnold Networks by leveraging matrix operations, significantly reducing training and inference times especially for high-degree B-splines.
Contribution
The paper presents MatrixKAN, a novel method that parallelizes B-spline calculations in KANs using matrix operations, enhancing computational efficiency and scalability.
Findings
Achieves approximately 40x speedup over traditional KAN.
Demonstrates superior scaling with B-spline degree.
Potential for further speedup with larger datasets or higher degrees.
Abstract
Kolmogorov-Arnold Networks (KAN) are a new class of neural network architecture representing a promising alternative to the Multilayer Perceptron (MLP), demonstrating improved expressiveness and interpretability. However, KANs suffer from slow training and inference speeds relative to MLPs due in part to the recursive nature of the underlying B-spline calculations. This issue is particularly apparent with respect to KANs utilizing high-degree B-splines, as the number of required non-parallelizable recursions is proportional to B-spline degree. We solve this issue by proposing MatrixKAN, a novel optimization that parallelizes B-spline calculations with matrix representation and operations, thus significantly improving effective computation time for models utilizing high-degree B-splines. In this paper, we demonstrate the superior scaling of MatrixKAN's computation time relative to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
Methods+ ( 1 ) ⟷ 805 ⟷ ( 330 ) ⟷ 4056|How do I file a complaint with Expedia?
