KANMixer: a minimal KAN-centered mixer for long-term time series forecasting
Lingyu Jiang, Dengzhe Hou, Yuping Wang, Yao Su, Shuo Xing, Wenjing Chen, Xin Zhang, Zhengzhong Tu, Ziming Zhang, Fangzhou Lin, Michael Zielewski, Kazunori D Yamada

TL;DR
This paper introduces KANMixer, a novel KAN-centered architecture for long-term time series forecasting, demonstrating improved accuracy and providing insights into the impact of design choices on KAN performance.
Contribution
The paper proposes KANMixer, a minimal KAN-based model for LTSF, and offers practical guidance on design choices affecting KAN effectiveness.
Findings
KANMixer achieves best MSE in 16 out of 28 settings
B-spline bases outperform Fourier and Wavelet bases
Prediction head and moderate depth are crucial for performance
Abstract
Long-term time series forecasting (LTSF) underpins critical applications from energy management to weather prediction, yet achieving reliable multi-step-ahead accuracy remains challenging. Existing LTSF approaches, dominated by MLP- and Transformer-based architectures, either rely on simple linear mappings or introduce increasingly complex hand-crafted inductive biases, raising the question of whether a more expressive and principled nonlinear core could offer a better alternative. Therefore, we investigate whether Kolmogorov-Arnold Networks (KANs), a recently proposed model featuring adaptive basis functions capable of granular modulation of nonlinearities, can improve LTSF performance, and under which design choices they are most effective. Specifically, we propose KANMixer, a minimal KAN-centered architecture consisting of a multi-scale pooling frontend, a KAN-based temporal mixing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
