CHRep: Cross-modal Histology Representation and Post-hoc Calibration for Spatial Gene Expression Prediction
Changfan Wang, Xinran Wang, Donghai Liu, Fei Su, Lulu Sun, Zhicheng Zhao, and Zhu Meng

TL;DR
CHRep is a novel two-phase framework that enhances spatial gene expression prediction from histology images by learning structure-aware representations and applying post-hoc calibration to improve robustness across slides.
Contribution
It introduces a joint optimization approach for representation learning and a lightweight calibration module for cross-slide robustness without backbone fine-tuning.
Findings
Consistently improves gene-wise correlation in leave-one-slide-out evaluation.
Increases Pearson correlation coefficient by 4.0% on cSCC and 9.8% on HER2+ cohorts.
Further improves PCC by 39.5% on Alex+10x compared to mclSTExp.
Abstract
Spatial transcriptomics (ST) enables spatially resolved gene profiling but remains expensive and low-throughput, limiting large-cohort studies and routine clinical use. Predicting spatial gene expression from routine hematoxylin and eosin (H&E) slides is a promising alternative, yet under realistic leave-one-slide-out evaluation, existing models often suffer from slide-level appearance shifts and regression-driven over-smoothing that suppress biologically meaningful variation. CHRep is a two-phase framework for robust histology-to-expression prediction. In the training phase, CHRep learns a structure-aware representation by jointly optimizing correlation-aware regression, symmetric image-expression alignment, and coordinate-induced spatial topology regularization. In the inference phase, cross-slide robustness is improved without backbone fine-tuning through a lightweight calibration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
