High-Fidelity Pseudo-label Generation by Large Language Models for Training Robust Radiology Report Classifiers
Brian Wong, Kaito Tanaka

TL;DR
This paper presents DeBERTa-RAD, a two-stage framework that uses large language models for high-quality pseudo-labeling and knowledge distillation to train fast, accurate radiology report classifiers, significantly improving over existing methods.
Contribution
The paper introduces a novel two-stage approach combining LLM pseudo-labeling with DeBERTa-based knowledge distillation for efficient and accurate chest X-ray report classification.
Findings
Achieves a state-of-the-art Macro F1 score of 0.9120 on MIMIC-500.
Outperforms rule-based systems, fine-tuned transformers, and direct LLM inference.
Demonstrates strong handling of uncertain findings in reports.
Abstract
Automated labeling of chest X-ray reports is essential for enabling downstream tasks such as training image-based diagnostic models, population health studies, and clinical decision support. However, the high variability, complexity, and prevalence of negation and uncertainty in these free-text reports pose significant challenges for traditional Natural Language Processing methods. While large language models (LLMs) demonstrate strong text understanding, their direct application for large-scale, efficient labeling is limited by computational cost and speed. This paper introduces DeBERTa-RAD, a novel two-stage framework that combines the power of state-of-the-art LLM pseudo-labeling with efficient DeBERTa-based knowledge distillation for accurate and fast chest X-ray report labeling. We leverage an advanced LLM to generate high-quality pseudo-labels, including certainty statuses, for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Radiomics and Machine Learning in Medical Imaging
MethodsKnowledge Distillation · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
