Generating crossmodal gene expression from cancer histopathology improves multimodal AI predictions
Samiran Dey, Christopher R.S. Banerji, Partha Basuchowdhuri, Sanjoy K. Saha, Deepak Parashar, Tapabrata Chakraborti

TL;DR
This paper introduces PathGen, a diffusion-based AI model that synthesizes gene expression data from histopathology images, enhancing cancer diagnosis and prognosis predictions without needing actual transcriptomic tests.
Contribution
The study presents a novel diffusion-based generative model, PathGen, that synthesizes transcriptomic data from histopathology images, enabling improved multimodal cancer prediction in clinical settings.
Findings
Significant performance improvements in cancer grading and risk estimation with synthesized data.
Synthesized gene expression features are statistically comparable to real data.
PathGen achieves state-of-the-art accuracy and interpretability in multimodal predictions.
Abstract
Emerging research has highlighted that artificial intelligence-based multimodal fusion of digital pathology and transcriptomic features can improve cancer diagnosis (grading/subtyping) and prognosis (survival risk) prediction. However, such direct fusion is impractical in clinical settings, where histopathology remains the gold standard and transcriptomic tests are rarely requested in public healthcare. We experiment on two publicly available multimodal datasets, The Cancer Genomic Atlas and the Clinical Proteomic Tumor Analysis Consortium, spanning four independent cohorts: glioma-glioblastoma, renal, uterine, and breast, and observe significant performance gains in gradation and risk estimation (p-value<0.05) when incorporating synthesized transcriptomic data with WSIs. Also, predictions using synthesized features were statistically close to those obtained with real transcriptomic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection
MethodsSoftmax · Attention Is All You Need · Diffusion
