PRISM: Deriving a White-Box Transformer as a Signal-Noise Decomposition Operator via Maximum Coding Rate Reduction
Dongchen Huang

TL;DR
This paper introduces Prism, a white-box Transformer architecture based on signal-noise decomposition principles, which enhances interpretability and disentangles semantic and syntactic information through geometric inductive biases and spectral specialization.
Contribution
Prism is a novel, theoretically grounded Transformer design that employs maximum coding rate reduction and frequency separation to achieve interpretability and functional disentanglement.
Findings
Prism's attention heads specialize into spectral regimes capturing different information types.
Empirical evidence shows Prism induces unsupervised disentanglement of semantic and syntactic features.
Prism maintains information flow and isolates pathology in large language models.
Abstract
Deep learning models, particularly Transformers, are often criticized as "black boxes" and lack interpretability. We propose Prism, a white-box attention-based architecture derived from the principles of Maximizing Coding Rate Reduction (). By modeling the attention mechanism as a gradient ascent process on a distinct signal-noise manifold, we introduce a specific irrational frequency separation (-RoPE) to enforce incoherence between signal (semantic) and noise (syntactic) subspaces. We show empirical evidence that these geometric inductive biases can induce unsupervised functional disentanglement alone. Prism spontaneously specializes its attention heads into spectrally distinct regimes: low-frequency heads capturing long-range causal dependencies (signal) and high-frequency heads handling local syntactic constraints and structural artifacts. To provide a theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face Recognition and Perception · Neural Networks and Reservoir Computing
