DeepSMILE: Contrastive self-supervised pre-training benefits MSI and HRD   classification directly from H&E whole-slide images in colorectal and breast   cancer

Yoni Schirris; Efstratios Gavves; Iris Nederlof; Hugo Mark Horlings,; Jonas Teuwen

arXiv:2107.09405·eess.IV·June 29, 2023

DeepSMILE: Contrastive self-supervised pre-training benefits MSI and HRD classification directly from H&E whole-slide images in colorectal and breast cancer

Yoni Schirris, Efstratios Gavves, Iris Nederlof, Hugo Mark Horlings,, Jonas Teuwen

PDF

1 Repo

TL;DR

DeepSMILE introduces a contrastive self-supervised pre-training approach combined with heterogeneity-aware multiple instance learning to improve genomic label classification from whole-slide images in colorectal and breast cancer, reducing the need for manual annotations.

Contribution

The paper presents a novel combination of contrastive self-supervised learning and heterogeneity-aware multiple instance learning for histopathology image analysis, enhancing classification accuracy with fewer labels.

Findings

01

Contrastive self-supervised pre-training improves MSI AUROC from 0.77 to 0.87.

02

DeepSMILE achieves HRD AUROC of 0.81 without manual annotations.

03

Methods reach baseline performance using only 40% of labeled data.

Abstract

We propose a Deep learning-based weak label learning method for analyzing whole slide images (WSIs) of Hematoxylin and Eosin (H&E) stained tumor tissue not requiring pixel-level or tile-level annotations using Self-supervised pre-training and heterogeneity-aware deep Multiple Instance LEarning (DeepSMILE). We apply DeepSMILE to the task of Homologous recombination deficiency (HRD) and microsatellite instability (MSI) prediction. We utilize contrastive self-supervised learning to pre-train a feature extractor on histopathology tiles of cancer tissue. Additionally, we use variability-aware deep multiple instance learning to learn the tile feature aggregation function while modeling tumor heterogeneity. For MSI prediction in a tumor-annotated and color normalized subset of TCGA-CRC (n=360 patients), contrastive self-supervised learning improves the tile supervision baseline from 0.77 to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nki-ai/dlup-lightning-mil
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.