Generalized Radiograph Representation Learning via Cross-supervision   between Images and Free-text Radiology Reports

Hong-Yu Zhou; Xiaoyu Chen; Yinghao Zhang; Ruibang Luo; Liansheng Wang,; Yizhou Yu

arXiv:2111.03452·eess.IV·January 28, 2022

Generalized Radiograph Representation Learning via Cross-supervision between Images and Free-text Radiology Reports

Hong-Yu Zhou, Xiaoyu Chen, Yinghao Zhang, Ruibang Luo, Liansheng Wang,, Yizhou Yu

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces REFERS, a cross-supervised learning method that leverages free-text radiology reports to pre-train vision transformers, outperforming traditional supervised and self-supervised approaches in radiograph analysis.

Contribution

REFERS is a novel cross-supervised pre-training approach that uses radiology reports as supervision signals, reducing reliance on labor-intensive annotations and surpassing existing methods.

Findings

01

Outperforms transfer and self-supervised methods on 4 X-ray datasets

02

Surpasses methods with structured labels and source domain supervision

03

Effective with extremely limited supervision

Abstract

Pre-training lays the foundation for recent successes in radiograph analysis supported by deep learning. It learns transferable image representations by conducting large-scale fully-supervised or self-supervised learning on a source domain. However, supervised pre-training requires a complex and labor intensive two-stage human-assisted annotation process while self-supervised learning cannot compete with the supervised paradigm. To tackle these issues, we propose a cross-supervised methodology named REviewing FreE-text Reports for Supervision (REFERS), which acquires free supervision signals from original radiology reports accompanying the radiographs. The proposed approach employs a vision transformer and is designed to learn joint representations from multiple views within every patient study. REFERS outperforms its transfer learning and self-supervised learning counterparts on 4…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

funnyzhou/refers
pytorchOfficial

Models

🤗
youngzhou12/REFERS
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Radiology practices and education · Domain Adaptation and Few-Shot Learning

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Dense Connections · Softmax · Vision Transformer