The Persian Rug: solving toy models of superposition using large-scale symmetries
Aditya Cowsik, Kfir Dolev, Alex Infanger

TL;DR
This paper provides a detailed analysis of a sparse autoencoder's learned algorithm in high-dimensional, permutation-symmetric data, revealing large-scale symmetry effects and introducing a new interpretability technique, with implications for neural network understanding.
Contribution
It offers an explicit analytical description of the autoencoder's behavior under symmetry, demonstrating near-optimal performance and linking learned weights to fractal structures.
Findings
Large models learn symmetry-sensitive algorithms based on weight statistics.
The loss function becomes analytically tractable at high sparsity.
Artificial weights exhibit fractal structures resembling Persian rugs.
Abstract
We present a complete mechanistic description of the algorithm learned by a minimal non-linear sparse data autoencoder in the limit of large input dimension. The model, originally presented in arXiv:2209.10652, compresses sparse data vectors through a linear layer and decompresses using another linear layer followed by a ReLU activation. We notice that when the data is permutation symmetric (no input feature is privileged) large models reliably learn an algorithm that is sensitive to individual weights only through their large-scale statistics. For these models, the loss function becomes analytically tractable. Using this understanding, we give the explicit scalings of the loss at high sparsity, and show that the model is near-optimal among recently proposed architectures. In particular, changing or adding to the activation function any elementwise or filtering operation can at best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCellular Automata and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer
