# A cancer-type-aware framework for robust multimodal survival prediction under missing modalities

**Authors:** Yiran Song, Zaifu Zhan, Feng Xie, Nian Wang, Yifan Peng, Rui Zhang, Mingquan Lin

PMC · DOI: 10.1093/bib/bbag124 · Briefings in Bioinformatics · 2026-03-23

## TL;DR

This paper introduces a new framework for predicting cancer survival that works well even when some data is missing and adapts to different cancer types.

## Contribution

The novel framework combines adaptive fusion of missing data with cancer-specific modeling and achieves robust cross-institutional performance.

## Key findings

- The framework achieved C-indices of 0.578–0.778 across 10 cancer types.
- It maintained performance with missing RNA or clinical text data.
- Cross-institutional validation showed stability with standard deviations <0.040 in eight cancer types.

## Abstract

Despite advances in multimodal cancer prognosis, robust performance in practical settings remains hindered by three critical barriers: ubiquitous data incompleteness, failure to model cancer-specific biology, and cross-institutional instability. We address these practical challenges through a cancer-type-aware framework that uniquely combines adaptive gated fusion for missing modalities, hybrid architecture for cancer heterogeneity, and demonstrated cross-institutional robustness. By establishing histopathology as the universally available anchor modality while adaptively incorporating RNA expression and clinical text through gated fusion, our framework maintains robust performance under realistic data constraints. Evaluation across 10 The Cancer Genome Atlas cancer types demonstrated superior performance (C-indices 0.578–0.778; mean 0.670 \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
$\pm $\end{document} 0.066), with state-of-the-art results in six cancer types. The framework maintained predictive performance under missing data scenarios, with C-indices ranging from 0.621 to 0.627 for missing RNA and from 0.568 to 0.606 for missing clinical text. Cross-institutional validation across 12–38 centers demonstrated robust cross-institutional performance (standard deviations <0.040 in eight of 10 cancer types). This methodological framework addresses key technical prerequisites—handling missing data, modeling cancer heterogeneity, and ensuring cross-institutional stability—for multimodal survival prediction, providing computational foundations necessary for future prospective clinical validation.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Genes:** TENM1 (teneurin transmembrane protein 1) [NCBI Gene 10178] {aka ODZ1, ODZ3, TEN-M1, TEN1, TNM, TNM1}, BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}
- **Diseases:** LUAD (MESH:D000077192), endometrial carcinoma (MESH:D016889), LGG (MESH:D005910), GBM (MESH:D005909), Cancer (MESH:D009369), death (MESH:D003643), COAD (MESH:D003110), breast invasive carcinoma (MESH:D001943), READ (MESH:D012004), BLCA (MESH:D001749), KIRC (MESH:D002292), PAAD (MESH:D010190)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13006980/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13006980/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC13006980/full.md

---
Source: https://tomesphere.com/paper/PMC13006980