Do Pathology Foundation Models Encode Disease Progression? A Pseudotime Analysis of Visual Representations
Pritika Vig (1, 2), Ren-Chin Wu (3), William Lotter (2, 4, 5) ((1) Massachusetts Institute of Technology, (2) Department of Data Science, Dana-Farber Cancer Institute, (3) Department of Pathology, Dana-Farber Cancer Institute, (4) Brigham, Women's Hospital

TL;DR
This study investigates whether vision foundation models trained on pathology images encode disease progression by analyzing their representations with pseudotime methods, revealing they do implicitly capture continuous disease trajectories.
Contribution
The paper introduces a novel application of diffusion pseudotime to evaluate if foundation models encode disease progression, demonstrating their ability to organize disease states along coherent trajectories.
Findings
Pathology-specific models recover disease trajectories significantly better than null baselines.
Trajectory fidelity strongly predicts few-shot classification performance on new diseases.
Cell-type composition varies smoothly along inferred disease trajectories, consistent with biological knowledge.
Abstract
Vision foundation models trained on discretely sampled images achieve strong performance on classification benchmarks, yet whether their representations encode the continuous processes underlying their training data remains unclear. This question is especially pertinent in computational pathology, where we posit that models whose latent representations implicitly capture continuous disease progression may better reflect underlying biology, support more robust generalization, and enable quantitative analyses of features associated with disease transitions. Using diffusion pseudotime, a method developed to infer developmental trajectories from single-cell transcriptomics, we probe whether foundation models organize disease states along coherent progression directions in representation space. Across four cancer progressions and six models, we find that all pathology-specific models recover…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · AI in cancer detection · Cell Image Analysis Techniques
