Biological Spatial Priors Regularize Foundation Model Representations for Cross-Site MSI Generalization in Colorectal Cancer
Dasari Naga Raju

TL;DR
This study introduces biologically motivated spatial priors to enhance foundation model generalization for MSI prediction in colorectal cancer across different sites, improving robustness and site-invariance.
Contribution
It proposes a novel spatial prior based on peripheral distance encoding to regularize foundation model representations for better cross-site MSI classification.
Findings
Peripheral distance encoding achieves high AUC on external data.
Local immune neighborhood encoding shows lower cross-site specificity.
Spatial priors act as regularizers reducing site-specific pattern reliance.
Abstract
Predicting microsatellite instability (MSI) status from routine hematoxylin and eosin (H&E) whole slide images (WSIs) offers a practical alternative to molecular testing, but models trained at one institution tend to generalize poorly to slides acquired at a different site. Foundation model representations, despite their generality, still encode site-specific texture alongside the conserved biological morphology underlying MSI. We investigate whether tile-level spatial priors derived from known MSI histology can guide these representations toward more site-invariant features. We introduce a biologically motivated spatial prior based on peripheral distance encoding, reflecting the Crohn's-like peripheral lymphocytic reaction at the tumor invasive margin, and evaluate a secondary local immune neighborhood encoding reflecting the lymphocyte-to-tumor ratio in each tile's immediate spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
