Enhancing Network Initialization for Medical AI Models Using   Large-Scale, Unlabeled Natural Images

Soroosh Tayebi Arasteh; Leo Misera; Jakob Nikolas Kather; Daniel; Truhn; Sven Nebelung

arXiv:2308.07688·eess.IV·February 9, 2024

Enhancing Network Initialization for Medical AI Models Using Large-Scale, Unlabeled Natural Images

Soroosh Tayebi Arasteh, Leo Misera, Jakob Nikolas Kather, Daniel, Truhn, Sven Nebelung

PDF

2 Repos

TL;DR

This study demonstrates that self-supervised learning pre-training on large-scale natural images significantly improves medical image analysis performance, surpassing traditional supervised methods and enhancing diagnostic accuracy in chest radiograph AI models.

Contribution

The paper shows that SSL pre-training on unlabeled natural images can outperform supervised pre-training on both natural and medical images for chest radiograph analysis.

Findings

01

SSL pre-training outperforms ImageNet-based pre-training (P<0.001) across datasets.

02

SSL pre-training sometimes exceeds supervised pre-training on medical images.

03

Pre-training strategy critically impacts AI diagnostic accuracy in medical imaging.

Abstract

Pre-training datasets, like ImageNet, have become the gold standard in medical image analysis. However, the emergence of self-supervised learning (SSL), which leverages unlabeled data to learn robust features, presents an opportunity to bypass the intensive labeling process. In this study, we explored if SSL for pre-training on non-medical images can be applied to chest radiographs and how it compares to supervised pre-training on non-medical images and on medical images. We utilized a vision transformer and initialized its weights based on (i) SSL pre-training on natural images (DINOv2), (ii) SL pre-training on natural images (ImageNet dataset), and (iii) SL pre-training on chest radiographs from the MIMIC-CXR database. We tested our approach on over 800,000 chest radiographs from six large global datasets, diagnosing more than 20 different imaging findings. Our SSL pre-training on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Linear Layer · Residual Connection · Dense Connections · Vision Transformer