From Attribution to Abstention: Training-Free Attention-Based Auditing for Clinical Summarization

Qianqi Yan; Huy Nguyen; Sumana Srivatsa; Hari Bandi; Xin Eric Wang; Krishnaram Kenthapadi

arXiv:2601.16397·cs.CL·April 21, 2026

From Attribution to Abstention: Training-Free Attention-Based Auditing for Clinical Summarization

Qianqi Yan, Huy Nguyen, Sumana Srivatsa, Hari Bandi, Xin Eric Wang, Krishnaram Kenthapadi

PDF

TL;DR

ClinTrace is a training-free, attention-based auditing framework for clinical summarization that provides source attribution and hallucination detection without additional training or inference costs.

Contribution

It introduces a novel, training-free method leveraging decoder attention weights for source attribution and hallucination detection in clinical summarization tasks.

Findings

01

Achieves over 92% text F1 in source attribution on radiology reports.

02

Groundedness scores reach 0.77 AUROC for hallucination detection.

03

Abstention mechanism improves faithfulness from 61.7% to 72.6%.

Abstract

Deploying multimodal large language models (MLLMs) for clinical summarization demands not only fluent generation but also transparency about where each statement originates-and a mechanism to flag when statements lack evidential support. We present ClinTrace, a training-free framework that extracts two clinically useful signals from the decoder attention weights that every transformer-based MLLM already produces during generation: (i) fine-grained source attributions linking each output sentence to supporting text spans or images, and (ii) per-sentence groundedness scores that identify poorly supported claims as candidate hallucinations. Both signals are derived from the same attention tensors in a single pass, requiring no retraining, no auxiliary models, and no additional inference cost. We evaluate on two clinical summarization tasks: doctor-patient dialogue summarization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.