From Attribution to Abstention: Training-Free Attention-Based Auditing for Clinical Summarization
Qianqi Yan, Huy Nguyen, Sumana Srivatsa, Hari Bandi, Xin Eric Wang, Krishnaram Kenthapadi

TL;DR
ClinTrace is a training-free, attention-based auditing framework for clinical summarization that provides source attribution and hallucination detection without additional training or inference costs.
Contribution
It introduces a novel, training-free method leveraging decoder attention weights for source attribution and hallucination detection in clinical summarization tasks.
Findings
Achieves over 92% text F1 in source attribution on radiology reports.
Groundedness scores reach 0.77 AUROC for hallucination detection.
Abstention mechanism improves faithfulness from 61.7% to 72.6%.
Abstract
Deploying multimodal large language models (MLLMs) for clinical summarization demands not only fluent generation but also transparency about where each statement originates-and a mechanism to flag when statements lack evidential support. We present ClinTrace, a training-free framework that extracts two clinically useful signals from the decoder attention weights that every transformer-based MLLM already produces during generation: (i) fine-grained source attributions linking each output sentence to supporting text spans or images, and (ii) per-sentence groundedness scores that identify poorly supported claims as candidate hallucinations. Both signals are derived from the same attention tensors in a single pass, requiring no retraining, no auxiliary models, and no additional inference cost. We evaluate on two clinical summarization tasks: doctor-patient dialogue summarization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
