TL;DR
This study demonstrates that self-supervised learning pre-training on large-scale natural images significantly improves medical image analysis performance, surpassing traditional supervised methods and enhancing diagnostic accuracy in chest radiograph AI models.
Contribution
The paper shows that SSL pre-training on unlabeled natural images can outperform supervised pre-training on both natural and medical images for chest radiograph analysis.
Findings
SSL pre-training outperforms ImageNet-based pre-training (P<0.001) across datasets.
SSL pre-training sometimes exceeds supervised pre-training on medical images.
Pre-training strategy critically impacts AI diagnostic accuracy in medical imaging.
Abstract
Pre-training datasets, like ImageNet, have become the gold standard in medical image analysis. However, the emergence of self-supervised learning (SSL), which leverages unlabeled data to learn robust features, presents an opportunity to bypass the intensive labeling process. In this study, we explored if SSL for pre-training on non-medical images can be applied to chest radiographs and how it compares to supervised pre-training on non-medical images and on medical images. We utilized a vision transformer and initialized its weights based on (i) SSL pre-training on natural images (DINOv2), (ii) SL pre-training on natural images (ImageNet dataset), and (iii) SL pre-training on chest radiographs from the MIMIC-CXR database. We tested our approach on over 800,000 chest radiographs from six large global datasets, diagnosing more than 20 different imaging findings. Our SSL pre-training on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Linear Layer · Residual Connection · Dense Connections · Vision Transformer
