PRISM: Deriving a White-Box Transformer as a Signal-Noise Decomposition Operator via Maximum Coding Rate Reduction

Dongchen Huang

arXiv:2601.15540·cs.LG·February 4, 2026

PRISM: Deriving a White-Box Transformer as a Signal-Noise Decomposition Operator via Maximum Coding Rate Reduction

Dongchen Huang

PDF

Open Access

TL;DR

This paper introduces Prism, a white-box Transformer architecture based on signal-noise decomposition principles, which enhances interpretability and disentangles semantic and syntactic information through geometric inductive biases and spectral specialization.

Contribution

Prism is a novel, theoretically grounded Transformer design that employs maximum coding rate reduction and frequency separation to achieve interpretability and functional disentanglement.

Findings

01

Prism's attention heads specialize into spectral regimes capturing different information types.

02

Empirical evidence shows Prism induces unsupervised disentanglement of semantic and syntactic features.

03

Prism maintains information flow and isolates pathology in large language models.

Abstract

Deep learning models, particularly Transformers, are often criticized as "black boxes" and lack interpretability. We propose Prism, a white-box attention-based architecture derived from the principles of Maximizing Coding Rate Reduction ( $MCR^{2}$ ). By modeling the attention mechanism as a gradient ascent process on a distinct signal-noise manifold, we introduce a specific irrational frequency separation ( $π$ -RoPE) to enforce incoherence between signal (semantic) and noise (syntactic) subspaces. We show empirical evidence that these geometric inductive biases can induce unsupervised functional disentanglement alone. Prism spontaneously specializes its attention heads into spectrally distinct regimes: low-frequency heads capturing long-range causal dependencies (signal) and high-frequency heads handling local syntactic constraints and structural artifacts. To provide a theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face Recognition and Perception · Neural Networks and Reservoir Computing