Rep-GLS: Report-Guided Generalized Label Smoothing for Robust Disease Detection

Kunyu Zhang; Fukang Ge; Binyang Wang; Yingke Chen; Kazuma Kobayashi; Lin Gu; Jinhao Bi; Yingying Zhu

arXiv:2508.02495·cs.LG·November 21, 2025

Rep-GLS: Report-Guided Generalized Label Smoothing for Robust Disease Detection

Kunyu Zhang, Fukang Ge, Binyang Wang, Yingke Chen, Kazuma Kobayashi, Lin Gu, Jinhao Bi, Yingying Zhu

PDF

TL;DR

This paper introduces Rep-GLS, a framework that uses medical report-derived uncertainty expressions and large language models to improve disease detection accuracy by incorporating expert skepticism into training.

Contribution

It proposes a novel report-guided generalized label smoothing method leveraging LLMs to utilize uncertainty expressions for better medical image classification.

Findings

01

Outperforms state-of-the-art disease detection methods

02

Establishes a new uncertainty-aware benchmark for medical diagnosis

03

Demonstrates the effectiveness of uncertainty-based supervision signals

Abstract

Unlike nature image classification where groundtruth label is explicit and of no doubt, physicians commonly interpret medical image conditioned on certainty like using phrase "probable" or "likely". Existing medical image datasets either simply overlooked the nuance and polarise into binary label. Here, we propose a novel framework that leverages a Large Language Model (LLM) to directly mine medical reports to utilise the uncertainty relevant expression for supervision signal. At first, we collect uncertainty keywords from medical reports. Then, we use Qwen-3 4B to identify the textual uncertainty and map them into an adaptive Generalized Label Smoothing (GLS) rate. This rate allows our model to treat uncertain labels not as errors, but as informative signals, effectively incorporating expert skepticism into the training process. We establish a new clinical expert uncertainty-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.