CHRep: Cross-modal Histology Representation and Post-hoc Calibration for Spatial Gene Expression Prediction

Changfan Wang; Xinran Wang; Donghai Liu; Fei Su; Lulu Sun; Zhicheng Zhao; and Zhu Meng

arXiv:2604.21573·cs.CV·April 24, 2026

CHRep: Cross-modal Histology Representation and Post-hoc Calibration for Spatial Gene Expression Prediction

Changfan Wang, Xinran Wang, Donghai Liu, Fei Su, Lulu Sun, Zhicheng Zhao, and Zhu Meng

PDF

TL;DR

CHRep is a novel two-phase framework that enhances spatial gene expression prediction from histology images by learning structure-aware representations and applying post-hoc calibration to improve robustness across slides.

Contribution

It introduces a joint optimization approach for representation learning and a lightweight calibration module for cross-slide robustness without backbone fine-tuning.

Findings

01

Consistently improves gene-wise correlation in leave-one-slide-out evaluation.

02

Increases Pearson correlation coefficient by 4.0% on cSCC and 9.8% on HER2+ cohorts.

03

Further improves PCC by 39.5% on Alex+10x compared to mclSTExp.

Abstract

Spatial transcriptomics (ST) enables spatially resolved gene profiling but remains expensive and low-throughput, limiting large-cohort studies and routine clinical use. Predicting spatial gene expression from routine hematoxylin and eosin (H&E) slides is a promising alternative, yet under realistic leave-one-slide-out evaluation, existing models often suffer from slide-level appearance shifts and regression-driven over-smoothing that suppress biologically meaningful variation. CHRep is a two-phase framework for robust histology-to-expression prediction. In the training phase, CHRep learns a structure-aware representation by jointly optimizing correlation-aware regression, symmetric image-expression alignment, and coordinate-induced spatial topology regularization. In the inference phase, cross-slide robustness is improved without backbone fine-tuning through a lightweight calibration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.