Generalized Radiograph Representation Learning via Cross-supervision between Images and Free-text Radiology Reports
Hong-Yu Zhou, Xiaoyu Chen, Yinghao Zhang, Ruibang Luo, Liansheng Wang,, Yizhou Yu

TL;DR
This paper introduces REFERS, a cross-supervised learning method that leverages free-text radiology reports to pre-train vision transformers, outperforming traditional supervised and self-supervised approaches in radiograph analysis.
Contribution
REFERS is a novel cross-supervised pre-training approach that uses radiology reports as supervision signals, reducing reliance on labor-intensive annotations and surpassing existing methods.
Findings
Outperforms transfer and self-supervised methods on 4 X-ray datasets
Surpasses methods with structured labels and source domain supervision
Effective with extremely limited supervision
Abstract
Pre-training lays the foundation for recent successes in radiograph analysis supported by deep learning. It learns transferable image representations by conducting large-scale fully-supervised or self-supervised learning on a source domain. However, supervised pre-training requires a complex and labor intensive two-stage human-assisted annotation process while self-supervised learning cannot compete with the supervised paradigm. To tackle these issues, we propose a cross-supervised methodology named REviewing FreE-text Reports for Supervision (REFERS), which acquires free supervision signals from original radiology reports accompanying the radiographs. The proposed approach employs a vision transformer and is designed to learn joint representations from multiple views within every patient study. REFERS outperforms its transfer learning and self-supervised learning counterparts on 4…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Radiology practices and education · Domain Adaptation and Few-Shot Learning
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Dense Connections · Softmax · Vision Transformer
