Self-Supervision Closes the Gap Between Weak and Strong Supervision in Histology
Olivier Dehaene, Axel Camara, Olivier Moindrot, Axel de Lavergne,, Pierre Courtiol

TL;DR
This paper demonstrates that self-supervised learning with MoCo v2 on histology images significantly improves weakly-supervised models, narrowing the performance gap with strongly-supervised models and revealing meaningful tissue structure embeddings.
Contribution
It introduces in-domain self-supervised feature extraction for histology, outperforming ImageNet features and enhancing weakly-supervised model accuracy.
Findings
Improved Camelyon16 AUC from 91.4% to 98.7%.
Close to strongly-supervised model performance at 99.3% AUC.
Biologically meaningful tissue structure separation in embedding space.
Abstract
One of the biggest challenges for applying machine learning to histopathology is weak supervision: whole-slide images have billions of pixels yet often only one global label. The state of the art therefore relies on strongly-supervised model training using additional local annotations from domain experts. However, in the absence of detailed annotations, most weakly-supervised approaches depend on a frozen feature extractor pre-trained on ImageNet. We identify this as a key weakness and propose to train an in-domain feature extractor on histology images using MoCo v2, a recent self-supervised learning algorithm. Experimental results on Camelyon16 and TCGA show that the proposed extractor greatly outperforms its ImageNet counterpart. In particular, our results improve the weakly-supervised state of the art on Camelyon16 from 91.4% to 98.7% AUC, thereby closing the gap with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvances in Oncology and Radiotherapy · Innovations in Medical Education · Health and Medical Research Impacts
MethodsDense Connections · Feedforward Network · Random Gaussian Blur · Batch Normalization · MoCo v2 · InfoNCE · Momentum Contrast
