PRISM: A Geometric Risk Bound that Decomposes Drift into Scale, Shape, and Head
Chieh-Yen Lin, Shao-Hua Sun

TL;DR
PRISM introduces a geometric risk bound that decomposes drift in large language models into scale, shape, and head divergence, enabling targeted diagnostics and regularization.
Contribution
It provides a novel, closed-form upper bound on risk gap that links representation drift to specific failure modes and guides remediation strategies.
Findings
PRISM achieves high correlation in ranking model variants across benchmarks.
The shape regularizer outperforms experience replay in mitigating downstream forgetting.
Decomposition into axes enables targeted model diagnostics and improvements.
Abstract
Comparing post-training LLM variants, such as quantized, LoRA-adapted, and distilled models, requires a diagnostic that identifies how a variant has drifted, not only whether it has degraded. Existing similarity scores such as CKA and SVCCA can flag degradation, but they do not directly link representation drift to risk or mechanism. We propose PRISM, Proxy Risk Inference via Structural Mapping, which exploits the linear output head of LLMs and the empirically near-isometric structure of their backbones to derive a closed-form upper bound on the cross-entropy risk gap between a target model and a post-training variant. The bound is calibrated for variant ranking and decomposes drift into three independently measurable axes: scale mismatch, shape mismatch, and head divergence. Each axis corresponds to a distinct failure mode, including shape distortion under low-bit quantization, scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
