Virchow: A Million-Slide Digital Pathology Foundation Model

Eugene Vorontsov; Alican Bozkurt; Adam Casson; George Shaikovski,; Michal Zelechowski; Siqi Liu; Kristen Severson; Eric Zimmermann; James Hall,; Neil Tenenholtz; Nicolo Fusi; Philippe Mathieu; Alexander van Eck; Donghun; Lee; Julian Viret; Eric Robert; Yi Kan Wang; Jeremy D. Kunz; Matthew C. H.; Lee; Jan Bernhard; Ran A. Godrich; Gerard Oakley; Ewan Millar; Matthew Hanna,; Juan Retamero; William A. Moye; Razik Yousfi; Christopher Kanan; David; Klimstra; Brandon Rothrock; Thomas J. Fuchs

arXiv:2309.07778·eess.IV·January 19, 2024·56 cites

Virchow: A Million-Slide Digital Pathology Foundation Model

Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski,, Michal Zelechowski, Siqi Liu, Kristen Severson, Eric Zimmermann, James Hall,, Neil Tenenholtz, Nicolo Fusi, Philippe Mathieu, Alexander van Eck, Donghun, Lee, Julian Viret, Eric Robert, Yi Kan Wang

PDF

Open Access 1 Repo

TL;DR

Virchow is a large-scale foundation model for computational pathology trained on 1.5 million slides, achieving state-of-the-art accuracy in cancer detection and biomarker prediction, demonstrating the benefits of massive data and model scale.

Contribution

This work introduces Virchow, a vision transformer foundation model trained on unprecedentedly large pathology datasets using self-supervised learning.

Findings

01

Achieves 0.949 specimen-level AUC across 17 cancer types

02

Sets new state-of-the-art on multiple benchmarks

03

Highlights the benefits of large-scale training for pathology models

Abstract

The use of artificial intelligence to enable precision medicine and decision support systems through the analysis of pathology images has the potential to revolutionize the diagnosis and treatment of cancer. Such applications will depend on models' abilities to capture the diverse patterns observed in pathology images. To address this challenge, we present Virchow, a foundation model for computational pathology. Using self-supervised learning empowered by the DINOv2 algorithm, Virchow is a vision transformer model with 632 million parameters trained on 1.5 million hematoxylin and eosin stained whole slide images from diverse tissue and specimen types, which is orders of magnitude more data than previous works. The Virchow model enables the development of a pan-cancer detection system with 0.949 overall specimen-level AUC across 17 different cancer types, while also achieving 0.937 AUC…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Paige-AI/paige-ml-sdk
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Digital Imaging for Blood Diseases

MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Linear Layer · Softmax · Residual Connection · Dense Connections · Vision Transformer