Convolutional Kolmogorov-Arnold Networks
Alexander Dylan Bodner, Antonio Santiago Tepsich, Jack Natan Spolski,, Santiago Pourteau

TL;DR
Convolutional Kolmogorov-Arnold Networks introduce learnable spline-based activations into convolutional layers, achieving comparable accuracy to traditional CNNs with significantly fewer parameters, thus enhancing efficiency and expressive power.
Contribution
This paper presents a novel convolutional architecture integrating Kolmogorov-Arnold Networks' learnable activations, improving parameter efficiency and model expressiveness over standard CNNs.
Findings
Achieves similar accuracy to CNNs on Fashion-MNIST
Uses up to 50% fewer parameters
Effectively captures complex spatial relationships
Abstract
In this paper, we present Convolutional Kolmogorov-Arnold Networks, a novel architecture that integrates the learnable spline-based activation functions of Kolmogorov-Arnold Networks (KANs) into convolutional layers. By replacing traditional fixed-weight kernels with learnable non-linear functions, Convolutional KANs offer a significant improvement in parameter efficiency and expressive power over standard Convolutional Neural Networks (CNNs). We empirically evaluate Convolutional KANs on the Fashion-MNIST dataset, demonstrating competitive accuracy with up to 50% fewer parameters compared to baseline classic convolutions. This suggests that the KAN Convolution can effectively capture complex spatial relationships with fewer resources, offering a promising alternative for parameter-efficient deep learning models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsConvolution
