Semantic Consistency-Based Uncertainty Quantification for Factuality in   Radiology Report Generation

Chenyu Wang; Weichao Zhou; Shantanu Ghosh; Kayhan Batmanghelich,; Wenchao Li

arXiv:2412.04606·cs.AI·March 18, 2025

Semantic Consistency-Based Uncertainty Quantification for Factuality in Radiology Report Generation

Chenyu Wang, Weichao Zhou, Shantanu Ghosh, Kayhan Batmanghelich,, Wenchao Li

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a semantic consistency-based method for quantifying uncertainty in radiology report generation, significantly improving factual accuracy and hallucination detection without altering existing models.

Contribution

The proposed plug-and-play framework effectively detects hallucinations and enhances report factuality without requiring model modifications or access to internal states.

Findings

01

Improves factuality scores by 10% by rejecting 20% of reports.

02

Achieves 82.9% success rate in flagging lowest-precision sentences.

03

Effectively detects hallucinations in radiology reports.

Abstract

Radiology report generation (RRG) has shown great potential in assisting radiologists by automating the labor-intensive task of report writing. While recent advancements have improved the quality and coherence of generated reports, ensuring their factual correctness remains a critical challenge. Although generative medical Vision Large Language Models (VLLMs) have been proposed to address this issue, these models are prone to hallucinations and can produce inaccurate diagnostic information. To address these concerns, we introduce a novel Semantic Consistency-Based Uncertainty Quantification framework that provides both report-level and sentence-level uncertainties. Unlike existing approaches, our method does not require modifications to the underlying model or access to its inner state, such as output token logits, thus serving as a plug-and-play module that can be seamlessly integrated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BU-DEPEND-Lab/SCUQ-RRG
noneOfficial

Videos

Semantic Consistency-Based Uncertainty Quantification for Factuality in Radiology Report Generation· underline

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Biomedical Text Mining and Ontologies