VisualCheXbert: Addressing the Discrepancy Between Radiology Report   Labels and Image Labels

Saahil Jain; Akshay Smit; Steven QH Truong; Chanh DT Nguyen,; Minh-Thanh Huynh; Mudit Jain; Victoria A. Young; Andrew Y. Ng; Matthew P.; Lungren; Pranav Rajpurkar

arXiv:2102.11467·eess.IV·March 16, 2021·5 cites

VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels

Saahil Jain, Akshay Smit, Steven QH Truong, Chanh DT Nguyen,, Minh-Thanh Huynh, Mudit Jain, Victoria A. Young, Andrew Y. Ng, Matthew P., Lungren, Pranav Rajpurkar

PDF

Open Access 1 Repo

TL;DR

This paper introduces VisualCheXbert, a model that improves the alignment of radiology report labels with actual image labels, addressing discrepancies caused by radiologist labeling inconsistencies, to enhance medical image interpretation.

Contribution

We propose VisualCheXbert, a novel method using a biomedically-pretrained BERT to better align report-derived labels with image labels, outperforming existing report labelers.

Findings

01

VisualCheXbert achieves higher F1 scores than existing labelers.

02

It better aligns with radiologists' image labels than report labels.

03

Improves supervision quality for medical image analysis models.

Abstract

Automatic extraction of medical conditions from free-text radiology reports is critical for supervising computer vision models to interpret medical images. In this work, we show that radiologists labeling reports significantly disagree with radiologists labeling corresponding chest X-ray images, which reduces the quality of report labels as proxies for image labels. We develop and evaluate methods to produce labels from radiology reports that have better agreement with radiologists labeling images. Our best performing method, called VisualCheXbert, uses a biomedically-pretrained BERT model to directly map from a radiology report to the image labels, with a supervisory signal determined by a computer vision model trained to detect medical conditions from chest X-ray images. We find that VisualCheXbert outperforms an approach using an existing radiology report labeler by an average F1…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stanfordmlgroup/VisualCheXbert
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Artificial Intelligence in Healthcare and Education · AI in cancer detection

MethodsLinear Layer · Linear Warmup With Linear Decay · Softmax · Adam · Multi-Head Attention · Residual Connection · Dropout · WordPiece · Attention Dropout · Weight Decay