TL;DR
DeepSMILE introduces a contrastive self-supervised pre-training approach combined with heterogeneity-aware multiple instance learning to improve genomic label classification from whole-slide images in colorectal and breast cancer, reducing the need for manual annotations.
Contribution
The paper presents a novel combination of contrastive self-supervised learning and heterogeneity-aware multiple instance learning for histopathology image analysis, enhancing classification accuracy with fewer labels.
Findings
Contrastive self-supervised pre-training improves MSI AUROC from 0.77 to 0.87.
DeepSMILE achieves HRD AUROC of 0.81 without manual annotations.
Methods reach baseline performance using only 40% of labeled data.
Abstract
We propose a Deep learning-based weak label learning method for analyzing whole slide images (WSIs) of Hematoxylin and Eosin (H&E) stained tumor tissue not requiring pixel-level or tile-level annotations using Self-supervised pre-training and heterogeneity-aware deep Multiple Instance LEarning (DeepSMILE). We apply DeepSMILE to the task of Homologous recombination deficiency (HRD) and microsatellite instability (MSI) prediction. We utilize contrastive self-supervised learning to pre-train a feature extractor on histopathology tiles of cancer tissue. Additionally, we use variability-aware deep multiple instance learning to learn the tile feature aggregation function while modeling tumor heterogeneity. For MSI prediction in a tumor-annotated and color normalized subset of TCGA-CRC (n=360 patients), contrastive self-supervised learning improves the tile supervision baseline from 0.77 to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
