Prism-$\Delta$: Differential Subspace Steering for Prompt Highlighting in Large Language Models
Yuyao Ge, Shenghua Liu, Yiwei Wang, Tianyu Liu, Baolong Bi, Lingrui Mei, Jiayu Yao, Jiafeng Guo, Xueqi Cheng

TL;DR
PRISM-Δ introduces a novel differential subspace steering method for large language models that improves prompt highlighting by effectively capturing discriminative signals, leading to better performance with lower fluency costs.
Contribution
It proposes PRISM-Δ, a new framework that decomposes cross-covariance matrices to enhance prompt steering, outperforming existing methods across multiple benchmarks and models.
Findings
Matches or exceeds existing methods on 19 of 20 configurations.
Achieves up to +10.6% relative gain in performance.
Halves the fluency cost of steering.
Abstract
Prompt highlighting steers a large language model to prioritize user-specified text spans during generation. A key challenge is extracting steering directions that capture the difference between relevant and irrelevant contexts, rather than shared structural patterns common to both. We propose PRISM- (Projection-based Relevance-Informed Steering Method), which decomposes the difference between positive and negative cross-covariance matrices to maximize discriminative energy while eliminating shared directions. Each attention head receives a continuous softplus importance weight, letting weak-but-useful heads contribute at reduced strength. The framework extends naturally to Value representations, capturing content-channel signal that Key-only methods leave unused. Across four benchmarks and five models, PRISM- matches or exceeds the best existing method on 19 of 20…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Authorship Attribution and Profiling
