Learning Evidence Highlighting for Frozen LLMs

Shaoang Li; Yanhang Shi; Yufei Li; Mingfu Liang; Xiaohan Wei; Yunchen Pu; Fei Tian; Chonglin Sun; Frank Shyu; Luke Simon; Sandeep Pandey; Xi Liu; Jian Li

arXiv:2604.22565·cs.CL·April 27, 2026

Learning Evidence Highlighting for Frozen LLMs

Shaoang Li, Yanhang Shi, Yufei Li, Mingfu Liang, Xiaohan Wei, Yunchen Pu, Fei Tian, Chonglin Sun, Frank Shyu, Luke Simon, Sandeep Pandey, Xi Liu, Jian Li

PDF

TL;DR

HiLight is a framework that highlights key evidence in long contexts for frozen LLMs, improving reasoning without altering the original input.

Contribution

It introduces a reinforcement learning-based emphasis actor that highlights evidence, enhancing reasoning performance across tasks without modifying the LLM.

Findings

01

Consistently improves performance over prompt-based baselines.

02

Transfers zero-shot to unseen Solver models.

03

Does not require evidence labels or input modification.

Abstract

Large Language Models (LLMs) can reason well, yet often miss decisive evidence when it is buried in long, noisy contexts. We introduce HiLight, an Evidence Emphasis framework that decouples evidence selection from reasoning for frozen LLM solvers. HiLight avoids compressing or rewriting the input, which can discard or distort evidence, by training a lightweight Emphasis Actor to insert minimal highlight tags around pivotal spans in the unaltered context. A frozen Solver then performs downstream reasoning on the emphasized input. We cast highlighting as a weakly supervised decision-making problem and optimize the Actor with reinforcement learning using only the Solver's task reward, requiring no evidence labels and no access to or modification of the Solver. Across sequential recommendation and long-context question answering, HiLight consistently improves performance over strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.