High-Fidelity Pseudo-label Generation by Large Language Models for   Training Robust Radiology Report Classifiers

Brian Wong; Kaito Tanaka

arXiv:2505.01693·cs.CL·May 6, 2025

High-Fidelity Pseudo-label Generation by Large Language Models for Training Robust Radiology Report Classifiers

Brian Wong, Kaito Tanaka

PDF

Open Access

TL;DR

This paper presents DeBERTa-RAD, a two-stage framework that uses large language models for high-quality pseudo-labeling and knowledge distillation to train fast, accurate radiology report classifiers, significantly improving over existing methods.

Contribution

The paper introduces a novel two-stage approach combining LLM pseudo-labeling with DeBERTa-based knowledge distillation for efficient and accurate chest X-ray report classification.

Findings

01

Achieves a state-of-the-art Macro F1 score of 0.9120 on MIMIC-500.

02

Outperforms rule-based systems, fine-tuned transformers, and direct LLM inference.

03

Demonstrates strong handling of uncertain findings in reports.

Abstract

Automated labeling of chest X-ray reports is essential for enabling downstream tasks such as training image-based diagnostic models, population health studies, and clinical decision support. However, the high variability, complexity, and prevalence of negation and uncertainty in these free-text reports pose significant challenges for traditional Natural Language Processing methods. While large language models (LLMs) demonstrate strong text understanding, their direct application for large-scale, efficient labeling is limited by computational cost and speed. This paper introduces DeBERTa-RAD, a novel two-stage framework that combines the power of state-of-the-art LLM pseudo-labeling with efficient DeBERTa-based knowledge distillation for accurate and fast chest X-ray report labeling. We leverage an advanced LLM to generate high-quality pseudo-labels, including certainty statuses, for a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Radiomics and Machine Learning in Medical Imaging

MethodsKnowledge Distillation · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings