Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective
Shengjia Chen, Gabriele Campanella, Abdulkadir Elmas, Aryeh Stock,, Jennifer Zeng, Alexandros D. Polydorides, Adam J. Schoenfeld, Kuan-lin Huang,, Jane Houldsworth, Chad Vanderbilt, Thomas J. Fuchs

TL;DR
This paper benchmarks ten slide-level aggregation methods for digital pathology images across nine clinical tasks, revealing the superiority of domain-specific embeddings and highlighting the need for adaptable aggregation techniques.
Contribution
It provides a comprehensive comparison of aggregation methods in computational pathology, emphasizing the importance of domain-specific models and spatial-aware techniques.
Findings
Domain-specific FMs outperform ImageNet models.
Spatial-aware aggregators improve ImageNet-based models.
No single model is best for all tasks.
Abstract
Recent advances in artificial intelligence (AI), in particular self-supervised learning of foundation models (FMs), are revolutionizing medical imaging and computational pathology (CPath). A constant challenge in the analysis of digital Whole Slide Images (WSIs) is the problem of aggregating tens of thousands of tile-level image embeddings to a slide-level representation. Due to the prevalent use of datasets created for genomic research, such as TCGA, for method development, the performance of these techniques on diagnostic slides from clinical practice has been inadequately explored. This study conducts a thorough benchmarking analysis of ten slide-level aggregation techniques across nine clinically relevant tasks, including diagnostic assessment, biomarker classification, and outcome prediction. The results yield following key insights: (1) Embeddings derived from domain-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Statistical Methods in Clinical Trials
