Histopathological Image Classification based on Self-Supervised Vision   Transformer and Weak Labels

Ahmet Gokberk Gul; Oezdemir Cetin; Christoph Reich; Tim Prangemeier,; Nadine Flinner; Heinz Koeppl

arXiv:2210.09021·cs.CV·April 19, 2023

Histopathological Image Classification based on Self-Supervised Vision Transformer and Weak Labels

Ahmet Gokberk Gul, Oezdemir Cetin, Christoph Reich, Tim Prangemeier,, Nadine Flinner, Heinz Koeppl

PDF

1 Repo

TL;DR

This paper introduces Self-ViT-MIL, a novel self-supervised Vision Transformer-based method for classifying and localizing cancer in whole slide images using only slide-level labels, improving accuracy over existing methods.

Contribution

It is the first to integrate self-supervised ViTs into MIL for WSI analysis, eliminating the need for pixel-level annotations and enhancing performance.

Findings

01

Outperforms existing MIL-based methods in accuracy

02

Achieves higher AUC on Camelyon16 dataset

03

Demonstrates effective localization of cancerous regions

Abstract

Whole Slide Image (WSI) analysis is a powerful method to facilitate the diagnosis of cancer in tissue samples. Automating this diagnosis poses various issues, most notably caused by the immense image resolution and limited annotations. WSIs commonly exhibit resolutions of 100Kx100K pixels. Annotating cancerous areas in WSIs on the pixel level is prohibitively labor-intensive and requires a high level of expert knowledge. Multiple instance learning (MIL) alleviates the need for expensive pixel-level annotations. In MIL, learning is performed on slide-level labels, in which a pathologist provides information about whether a slide includes cancerous tissue. Here, we propose Self-ViT-MIL, a novel approach for classifying and localizing cancerous areas based on slide-level annotations, eliminating the need for pixel-wise annotated training data. Self-ViT- MIL is pre-trained in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gokberkgul/self-learning-transformer-mil
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Dense Connections · Linear Layer · Multi-Head Attention · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Layer Normalization · Residual Connection · Dropout