QuantKAN: A Unified Quantization Framework for Kolmogorov Arnold Networks
Kazi Ahmed Asif Fuad, Lizhong Chen

TL;DR
QuantKAN introduces a unified quantization framework for Kolmogorov Arnold Networks, enabling efficient low-bit deployment while maintaining accuracy across various datasets and network variants.
Contribution
This work extends modern quantization algorithms to spline-based KANs, providing the first systematic benchmarks and practical guidelines for their low-bit quantization.
Findings
KANs are compatible with low-bit quantization.
LSQ, LSQ+, and PACT maintain near full precision at 4-bit for shallow models.
GPTQ and Uniform outperform others in post-training quantization.
Abstract
Kolmogorov Arnold Networks (KANs) represent a new class of neural architectures that replace conventional linear transformations and node-based nonlinearities with spline-based function approximations distributed along network edges. Although KANs offer strong expressivity and interpretability, their heterogeneous spline and base branch parameters hinder efficient quantization, which remains unexamined compared to CNNs and Transformers. In this paper, we present QuantKAN, a unified framework for quantizing KANs across both quantization aware training (QAT) and post-training quantization (PTQ) regimes. QuantKAN extends modern quantization algorithms, such as LSQ, LSQ+, PACT, DoReFa, QIL, GPTQ, BRECQ, AdaRound, AWQ, and HAWQ-V2, to spline based layers with branch-specific quantizers for base, spline, and activation components. Through extensive experiments on MNIST, CIFAR 10, and CIFAR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques
