GOLDMARK: Governed Outcome-Linked Diagnostic Model Assessment Reference Kit
Chad Vanderbilt, Gabriele Campanella, Siddharth Singi, Swaraj Nanda, Jie-Fu Chen, Ali Kamali, Amir Momeni Boroujeni, David Kim, Mohamed Yakoub, Jamal Benhamida, Meera Hameed, Neeraj Kumar, and Gregory Goldgof

TL;DR
GOLDMARK provides a standardized benchmarking framework for computational pathology, enabling reproducible evaluation of AI models on histopathology data with structured intermediate representations and cross-cohort testing.
Contribution
It introduces GOLDMARK, a comprehensive, standardized platform for benchmarking AI models in computational pathology with curated datasets and reproducible evaluation protocols.
Findings
Mean AUROC of 0.689 on TCGA and 0.630 on MSKCC datasets.
Highest-performing tasks achieved AUROCs of 0.831 and 0.801, respectively.
Stable cross-site performance on key morphologic-genomic associations.
Abstract
Computational biomarkers (CBs) are histopathology-derived patterns extracted from hematoxylin-eosin (H&E) whole-slide images (WSIs) using artificial intelligence (AI) to predict therapeutic response or prognosis. Recently, slide-level multiple-instance learning (MIL) with pathology foundation models (PFMs) has become the standard baseline for CB development. While these methods have improved predictive performance, computational pathology lacks standardized intermediate data formats, provenance tracking, checkpointing conventions, and reproducible evaluation metrics required for clinical-grade deployment. We introduce GOLDMARK (https://artificialintelligencepathology.org), a standardized benchmarking framework built on a curated TCGA cohort with clinically actionable OncoKB level 1-3 biomarker labels. GOLDMARK releases structured intermediate representations, including tile coordinate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Cell Image Analysis Techniques
