Improving Memory Efficiency for Training KANs via Meta Learning
Zhangchi Zhao, Jun Shu, Deyu Meng, Zongben Xu

TL;DR
This paper introduces MetaKANs, a meta-learning approach that generates weights for Kolmogorov-Arnold neural networks, significantly reducing memory usage and training costs while maintaining or improving performance across various tasks.
Contribution
The paper proposes MetaKANs, a novel meta-learning framework that enhances the memory efficiency and scalability of KANs without sacrificing interpretability or performance.
Findings
MetaKANs achieve comparable or better performance than standard KANs.
Significant reduction in trainable parameters and memory usage.
Effective across diverse tasks like symbolic regression, PDE solving, and image classification.
Abstract
Inspired by the Kolmogorov-Arnold representation theorem, KANs offer a novel framework for function approximation by replacing traditional neural network weights with learnable univariate functions. This design demonstrates significant potential as an efficient and interpretable alternative to traditional MLPs. However, KANs are characterized by a substantially larger number of trainable parameters, leading to challenges in memory efficiency and higher training costs compared to MLPs. To address this limitation, we propose to generate weights for KANs via a smaller meta-learner, called MetaKANs. By training KANs and MetaKANs in an end-to-end differentiable manner, MetaKANs achieve comparable or even superior performance while significantly reducing the number of trainable parameters and maintaining promising interpretability. Extensive experiments on diverse benchmark tasks, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI)
