A preliminary study on continual learning in computer vision using Kolmogorov-Arnold Networks
Alessandro Cacciatore, Valerio Morelli, Federica Paganica, Emanuele, Frontoni, Lucia Migliorelli, Daniele Berardini

TL;DR
This study evaluates Kolmogorov-Arnold Networks (KAN) for continual learning in computer vision, demonstrating their potential to outperform traditional MLPs and original KAN models in class-incremental tasks on MNIST datasets.
Contribution
The paper extends KAN evaluation to computer vision, introduces an efficient KAN variant, and analyzes hyperparameters and convolutional KAN performance in continual learning.
Findings
Efficient KAN outperforms traditional MLPs and original KAN.
Hyperparameters significantly influence model performance.
Preliminary convolutional KAN results compare favorably with CNNs.
Abstract
Deep learning has long been dominated by multi-layer perceptrons (MLPs), which have demonstrated superiority over other optimizable models in various domains. Recently, a new alternative to MLPs has emerged - Kolmogorov-Arnold Networks (KAN)- which are based on a fundamentally different mathematical framework. According to their authors, KANs address several major issues in MLPs, such as catastrophic forgetting in continual learning scenarios. However, this claim has only been supported by results from a regression task on a toy 1D dataset. In this paper, we extend the investigation by evaluating the performance of KANs in continual learning tasks within computer vision, specifically using the MNIST datasets. To this end, we conduct a structured analysis of the behavior of MLPs and two KAN-based models in a class-incremental learning scenario, ensuring that the architectures involved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Gaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks
Methods+ ( 1 ) ⟷ 805 ⟷ ( 330 ) ⟷ 4056|How do I file a complaint with Expedia?
