PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue
Eugene Vorontsov, George Shaikovski, Adam Casson, Julian Viret, Eric Zimmermann, Neil Tenenholtz, Yi Kan Wang, Jan H. Bernhard, Ran A. Godrich, Juan A. Retamero, Jinru Shia, Mithat Gonen, Martin R. Weiser, David S. Klimstra, Razik Yousfi, Nicolo Fusi, Thomas J. Fuchs

TL;DR
PRISM2 is a large multimodal foundation model for pathology that leverages clinical dialogue supervision to achieve high diagnostic accuracy and generalize across tasks, bridging human reasoning and AI capabilities.
Contribution
It introduces PRISM2, the largest histopathology dataset and a multimodal model trained on clinical dialogue, enhancing pathology AI's generalization and diagnostic performance.
Findings
PRISM2 matches or exceeds clinical-grade cancer detection performance.
PRISM2 achieves top results on various pathology tasks without additional training.
Task-specific fine-tuning with large datasets further improves performance.
Abstract
Recent rapid progress in the field of computational pathology has been enabled by foundation models. These models are beginning to move beyond encoding image patches towards whole-slide understanding but their clinical utility remains limited. In this work, we present PRISM2, a multimodal slide-level foundation model trained on data from 700,000 diagnostic specimen-report pairs, the largest vision (2.3 million whole slide images) and language (14M question-answer pairs) histopathology dataset to date. By learning through clinical-dialogue supervision, PRISM2 aligns histomorphologic features with the language of diagnostic reasoning, producing slide-level representations that support both direct diagnostic question-answering and transferable embeddings for downstream tasks. Without additional training, PRISM2 matches or exceeds the cancer-detection performance of clinical-grade products.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Machine Learning in Healthcare
MethodsALIGN
