Self-supervised Learning from 100 Million Medical Images
Florin C. Ghesu, Bogdan Georgescu, Awais Mansoor, Youngjin Yoo,, Dominik Neumann, Pragneshkumar Patel, R.S. Vishwanath, James M. Balter, Yue, Cao, Sasa Grbic, Dorin Comaniciu

TL;DR
This paper introduces a self-supervised learning approach using contrastive learning and feature clustering on over 100 million medical images, significantly improving accuracy, training speed, and robustness for medical image assessment tasks.
Contribution
It presents a novel self-supervised learning method leveraging large-scale medical image datasets to enhance model performance without extensive annotations.
Findings
Achieved 3-7% AUC improvement in abnormality detection.
Reduced training convergence time by up to 85%.
Enhanced robustness to image augmentations.
Abstract
Building accurate and robust artificial intelligence systems for medical image assessment requires not only the research and design of advanced deep learning models but also the creation of large and curated sets of annotated training examples. Constructing such datasets, however, is often very costly -- due to the complex nature of annotation tasks and the high level of expertise required for the interpretation of medical images (e.g., expert radiologists). To counter this limitation, we propose a method for self-supervised learning of rich image features based on contrastive learning and online feature clustering. For this purpose we leverage large training datasets of over 100,000,000 medical images of various modalities, including radiography, computed tomography (CT), magnetic resonance (MR) imaging and ultrasonography. We propose to use these features to guide model training in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Lung Cancer Diagnosis and Treatment · COVID-19 diagnosis using AI
MethodsContrastive Learning
