Semi-Supervised Histology Classification using Deep Multiple Instance Learning and Contrastive Predictive Coding
Ming Y. Lu, Richard J. Chen, Jingwen Wang, Debora Dillon, Faisal, Mahmood

TL;DR
This paper introduces a semi-supervised method combining contrastive predictive coding and attention-based multiple instance learning to improve histology slide classification, achieving state-of-the-art accuracy with limited labeled data.
Contribution
The paper presents a novel two-stage semi-supervised pipeline that enhances histology classification by integrating self-supervised feature learning with MIL, addressing data scarcity issues.
Findings
Achieved 95% mean validation accuracy on breast cancer histology classification.
State-of-the-art performance with an AUC of 0.968 across multiple splits.
CPC features outperform simple transfer learning in MIL context.
Abstract
Convolutional neural networks can be trained to perform histology slide classification using weak annotations with multiple instance learning (MIL). However, given the paucity of labeled histology data, direct application of MIL can easily suffer from overfitting and the network is unable to learn rich feature representations due to the weak supervisory signal. We propose to overcome such limitations with a two-stage semi-supervised approach that combines the power of data-efficient self-supervised feature learning via contrastive predictive coding (CPC) and the interpretability and flexibility of regularized attention-based MIL. We apply our two-stage CPC + MIL semi-supervised pipeline to the binary classification of breast cancer histology images. Across five random splits, we report state-of-the-art performance with a mean validation accuracy of 95% and an area under the ROC curve of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Digital Imaging for Blood Diseases · Colorectal Cancer Screening and Detection
MethodsInterpretability · InfoNCE · Contrastive Predictive Coding
