TL;DR
KAConvNet introduces a novel convolutional layer based on the Kolmogorov-Arnold theorem, enhancing interpretability and efficiency in vision recognition tasks, and outperforms existing KAN-based models.
Contribution
The paper proposes a new Kolmogorov-Arnold convolutional layer integrated with CNNs, improving interpretability and performance over previous KAN approaches.
Findings
KAConvNet outperforms existing KAN-based methods.
The model achieves competitive results with mainstream ViTs and CNNs.
The code is publicly available at https://github.com/UnicomAI/KAConvNet.
Abstract
The Convolutional Neural Networks (CNNs) have been the dominant and effective approach for general computer vision tasks. Recently, Kolmogorov-Arnold neural networks (KANs), based on the Kolmogorov-Arnold representation theorem, have shown potential to replace Multi-Layer Perceptrons (MLPs) in deep learning. KANs, which use learnable nonlinear activations on edges and simple summation on nodes, offer fewer parameters and greater explainability compared to MLPs. However, there has been limited exploration of integrating the Kolmogorov-Arnold representation theorem with convolutional methods for computer vision tasks. Existing attempts have merely replaced learnable activation functions with weights, undermining KANs' theoretical foundation and limiting their potential effectiveness. Additionally, the B-spline curves used in KANs suffer from computational inefficiency and a tendency to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
