VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels
Saahil Jain, Akshay Smit, Steven QH Truong, Chanh DT Nguyen,, Minh-Thanh Huynh, Mudit Jain, Victoria A. Young, Andrew Y. Ng, Matthew P., Lungren, Pranav Rajpurkar

TL;DR
This paper introduces VisualCheXbert, a model that improves the alignment of radiology report labels with actual image labels, addressing discrepancies caused by radiologist labeling inconsistencies, to enhance medical image interpretation.
Contribution
We propose VisualCheXbert, a novel method using a biomedically-pretrained BERT to better align report-derived labels with image labels, outperforming existing report labelers.
Findings
VisualCheXbert achieves higher F1 scores than existing labelers.
It better aligns with radiologists' image labels than report labels.
Improves supervision quality for medical image analysis models.
Abstract
Automatic extraction of medical conditions from free-text radiology reports is critical for supervising computer vision models to interpret medical images. In this work, we show that radiologists labeling reports significantly disagree with radiologists labeling corresponding chest X-ray images, which reduces the quality of report labels as proxies for image labels. We develop and evaluate methods to produce labels from radiology reports that have better agreement with radiologists labeling images. Our best performing method, called VisualCheXbert, uses a biomedically-pretrained BERT model to directly map from a radiology report to the image labels, with a supervisory signal determined by a computer vision model trained to detect medical conditions from chest X-ray images. We find that VisualCheXbert outperforms an approach using an existing radiology report labeler by an average F1…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Artificial Intelligence in Healthcare and Education · AI in cancer detection
MethodsLinear Layer · Linear Warmup With Linear Decay · Softmax · Adam · Multi-Head Attention · Residual Connection · Dropout · WordPiece · Attention Dropout · Weight Decay
