Virchow: A Million-Slide Digital Pathology Foundation Model
Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski,, Michal Zelechowski, Siqi Liu, Kristen Severson, Eric Zimmermann, James Hall,, Neil Tenenholtz, Nicolo Fusi, Philippe Mathieu, Alexander van Eck, Donghun, Lee, Julian Viret, Eric Robert, Yi Kan Wang

TL;DR
Virchow is a large-scale foundation model for computational pathology trained on 1.5 million slides, achieving state-of-the-art accuracy in cancer detection and biomarker prediction, demonstrating the benefits of massive data and model scale.
Contribution
This work introduces Virchow, a vision transformer foundation model trained on unprecedentedly large pathology datasets using self-supervised learning.
Findings
Achieves 0.949 specimen-level AUC across 17 cancer types
Sets new state-of-the-art on multiple benchmarks
Highlights the benefits of large-scale training for pathology models
Abstract
The use of artificial intelligence to enable precision medicine and decision support systems through the analysis of pathology images has the potential to revolutionize the diagnosis and treatment of cancer. Such applications will depend on models' abilities to capture the diverse patterns observed in pathology images. To address this challenge, we present Virchow, a foundation model for computational pathology. Using self-supervised learning empowered by the DINOv2 algorithm, Virchow is a vision transformer model with 632 million parameters trained on 1.5 million hematoxylin and eosin stained whole slide images from diverse tissue and specimen types, which is orders of magnitude more data than previous works. The Virchow model enables the development of a pan-cancer detection system with 0.949 overall specimen-level AUC across 17 different cancer types, while also achieving 0.937 AUC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Digital Imaging for Blood Diseases
MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Linear Layer · Softmax · Residual Connection · Dense Connections · Vision Transformer
