Learning Cell-Aware Hierarchical Multi-Modal Representations for Robust Molecular Modeling
Mengran Li, Zelin Zang, Wenbin Xing, Junzhou Chen, Ronghui Zhang, Jiebo Luo, Stan Z. Li

TL;DR
This paper introduces CHMR, a hierarchical multi-modal framework that integrates molecular and cellular data to improve biological property prediction, addressing data incompleteness and hierarchical modeling challenges.
Contribution
It presents a novel tree-structured vector quantization module and a joint modeling approach for hierarchical dependencies, advancing cell-aware molecular representations.
Findings
Outperforms state-of-the-art on nine benchmarks
Achieves 3.6% improvement on classification tasks
Achieves 17.2% improvement on regression tasks
Abstract
Understanding how chemical perturbations propagate through biological systems is essential for robust molecular property prediction. While most existing methods focus on chemical structures alone, recent advances highlight the crucial role of cellular responses such as morphology and gene expression in shaping drug effects. However, current cell-aware approaches face two key limitations: (1) modality incompleteness in external biological data, and (2) insufficient modeling of hierarchical dependencies across molecular, cellular, and genomic levels. We propose CHMR (Cell-aware Hierarchical Multi-modal Representations), a robust framework that jointly models local-global dependencies between molecules and cellular responses and captures latent biological hierarchies via a novel tree-structured vector quantization module. Evaluated on nine public benchmarks spanning 728 tasks, CHMR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsCell Image Analysis Techniques · Computational Drug Discovery Methods · Machine Learning in Bioinformatics
