Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE
Minjong Cheon

TL;DR
This paper introduces KAN-SAE, a nonlinear autoencoder that improves interpretability of deep learning weather models by discovering more detailed and distinct climate features without supervision.
Contribution
The paper proposes KAN-SAE, a novel nonlinear autoencoder with learnable B-spline activations, enhancing climate feature discovery over linear methods.
Findings
KAN-SAE discovers 72% more features than linear baselines.
It reduces inter-feature redundancy by 20%.
It identifies interpretable climate phenomena like heatwaves and typhoons.
Abstract
Deep learning weather prediction models achieve remarkable predictive skill yet remain largely opaque: we know little about how they represent physical climate phenomena internally. Mechanistic interpretability through Sparse Autoencoders (SAEs) offers a principled route to decomposing these representations, but existing SAEs assume strictly linear feature superposition - a constraint ill-suited for the highly nonlinear atmospheric dynamics encoded in modern transformers. We introduce KAN-SAE, a sparse autoencoder whose encoder replaces the standard ReLU with learnable per-feature B-spline activations drawn from Kolmogorov-Arnold Networks (KANs), allowing each latent dimension to develop its own nonlinear gating profile. Applied to Sonny, KAN-SAE discovers 975 alive features (vs. 566 for a linear baseline, a 72% improvement) with 20% lower inter-feature redundancy and comparable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
