White-Box Transformers via Sparse Rate Reduction
Yaodong Yu, Sam Buchanan, Druv Pai, Tianzhe Chu, Ziyang Wu, and Shengbang Tong, Benjamin D. Haeffele, Yi Ma

TL;DR
This paper introduces a mathematically interpretable white-box transformer model based on sparse rate reduction, demonstrating its ability to learn effective representations on large-scale vision datasets with performance comparable to traditional transformers.
Contribution
It derives a transparent, interpretable transformer architecture from the sparse rate reduction objective, linking each component to an optimization step, and validates its effectiveness on real-world datasets.
Findings
The white-box transformer compresses and sparsifies data representations effectively.
It achieves near state-of-the-art performance on ImageNet.
The architecture is fully interpretable and grounded in optimization theory.
Abstract
In this paper, we contend that the objective of representation learning is to compress and transform the distribution of the data, say sets of tokens, towards a mixture of low-dimensional Gaussian distributions supported on incoherent subspaces. The quality of the final representation can be measured by a unified objective function called sparse rate reduction. From this perspective, popular deep networks such as transformers can be naturally viewed as realizing iterative schemes to optimize this objective incrementally. Particularly, we show that the standard transformer block can be derived from alternating optimization on complementary parts of this objective: the multi-head self-attention operator can be viewed as a gradient descent step to compress the token sets by minimizing their lossy coding rate, and the subsequent multi-layer perceptron can be viewed as attempting to sparsify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
