Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE

Minjong Cheon

arXiv:2605.17493·cs.LG·May 19, 2026

Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE

Minjong Cheon

PDF

TL;DR

This paper introduces KAN-SAE, a nonlinear autoencoder that improves interpretability of deep learning weather models by discovering more detailed and distinct climate features without supervision.

Contribution

The paper proposes KAN-SAE, a novel nonlinear autoencoder with learnable B-spline activations, enhancing climate feature discovery over linear methods.

Findings

01

KAN-SAE discovers 72% more features than linear baselines.

02

It reduces inter-feature redundancy by 20%.

03

It identifies interpretable climate phenomena like heatwaves and typhoons.

Abstract

Deep learning weather prediction models achieve remarkable predictive skill yet remain largely opaque: we know little about how they represent physical climate phenomena internally. Mechanistic interpretability through Sparse Autoencoders (SAEs) offers a principled route to decomposing these representations, but existing SAEs assume strictly linear feature superposition - a constraint ill-suited for the highly nonlinear atmospheric dynamics encoded in modern transformers. We introduce KAN-SAE, a sparse autoencoder whose encoder replaces the standard ReLU with learnable per-feature B-spline activations drawn from Kolmogorov-Arnold Networks (KANs), allowing each latent dimension to develop its own nonlinear gating profile. Applied to Sonny, KAN-SAE discovers 975 alive features (vs. 566 for a linear baseline, a 72% improvement) with 20% lower inter-feature redundancy and comparable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.