Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images
Mehdi Cherti, Jenia Jitsev

TL;DR
This study investigates how increasing pre-training scale affects transfer learning performance across in-domain and out-of-domain natural and medical X-ray image tasks, highlighting the importance of large-scale natural image pre-training for medical applications.
Contribution
It is the first to combine large, openly available medical X-ray datasets with natural image datasets to analyze transfer learning at scale across domains.
Findings
Larger pre-training scale improves intra-domain transfer for natural and medical images.
Pre-training on large natural datasets can match or outperform medical datasets for X-ray transfer.
Scaling pre-training reduces dependency on domain-specific data for high-quality out-of-domain transfer.
Abstract
Increasing model, data and compute budget scale in the pre-training has been shown to strongly improve model generalization and transfer learning in vast line of work done in language modeling and natural image recognition. However, most studies on the positive effect of larger scale were done in scope of in-domain setting, with source and target data being in close proximity. To study effect of larger scale for both in-domain and out-of-domain setting when performing full and few-shot transfer, we combine here for the first time large, openly available medical X-Ray chest imaging datasets to reach a scale for medical imaging domain comparable to ImageNet-1k, routinely used for pre-training in natural image domain. We then conduct supervised pre-training, while varying network size and source data scale and domain, being either large natural (ImageNet-1k/21k) or large medical chest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · Radiology practices and education
MethodsWeight Standardization · Group Normalization · Bitcoin Customer Service Number +1-833-534-1729
