# A foundation model-based multi-instance learning framework for accurate prediction of lymph node metastasis in prostate cancer from whole slide images

**Authors:** Guang Zeng, Weiwei Li, Haonan Mei, Ran Du

PMC · DOI: 10.3389/fonc.2026.1775750 · Frontiers in Oncology · 2026-03-04

## TL;DR

This study presents an AI framework using foundation models and multi-instance learning to accurately predict lymph node metastasis in prostate cancer from whole slide images.

## Contribution

A novel weakly supervised deep learning framework integrating MIL with domain-adapted foundation models for interpretable N stage prediction in prostate cancer.

## Key findings

- The UNI-v2-based model achieved the highest AUC of 0.879 in RHWU and 0.850 in TCGA for N stage prediction.
- Attention heatmaps highlighted tumor-stromal interfaces and poorly differentiated tumor clusters as key features.
- Transcriptomic analysis identified 94 differentially expressed genes linked to cell cycle, immune pathways, and metabolism.

## Abstract

Nodal involvement (N stage) is a key prognostic factor in prostate cancer (PCa). Conventional imaging and histopathology often have limited sensitivity and inter-observer variability. AI-based computational pathology, using multi-instance learning (MIL) and foundation models, offers a promising approach for accurate and interpretable N stage prediction from H&E-stained whole slide images (WSIs).

In this multicenter retrospective study, we developed a weakly supervised deep learning framework integrating MIL with domain-adapted foundation model encoders (UNI-v2, CONCH, ResNet-50) to predict N stage. WSIs from 280 RHWU patients were used for training and 306 TCGA patients for external validation. Attention heatmaps enabled interpretability, while transcriptomic analyses explored molecular correlates via differential expression and bioinformation analysis.

The UNI-v2-based model achieved the highest performance (AUC 0.879 in RHWU, 0.850 in TCGA), surpassing CONCH and ResNet-50. Attention heatmaps highlighted tumor-stromal interfaces and poorly differentiated tumor clusters. Transcriptomic analysis identified 94 differentially expressed genes; upregulated genes were enriched in cell cycle, and immune pathways, while downregulated genes involved ion transport and metabolism.

This AI-MIL framework accurately predicts nodal involvement in PCa and provides biologically interpretable insights, supporting its potential as a precision oncology tool for risk stratification and treatment planning.

## Linked entities

- **Diseases:** prostate cancer (MONDO:0005159)

## Full-text entities

- **Diseases:** tumor (MESH:D009369), lymph node metastasis (MESH:D008207), PCa (MESH:D011471), N (MESH:C536108)
- **Chemicals:** H&amp;E (MESH:D006371)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12995785/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12995785/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12995785/full.md

---
Source: https://tomesphere.com/paper/PMC12995785