TL;DR
AROMA is a multimodal architecture that integrates textual, topological, and protein features to improve virtual cell genetic perturbation predictions, emphasizing interpretability and robustness.
Contribution
It introduces a novel multimodal model with a two-stage training strategy and constructs extensive resources for virtual cell modeling.
Findings
AROMA outperforms existing methods across multiple cell lines.
AROMA remains robust in zero-shot and knowledge-sparse scenarios.
The model provides interpretable predictions aligned with biological topology.
Abstract
Virtual cell modeling predicts molecular state changes under genetic perturbations in silico, which is essential for biological mechanism studies. However, existing approaches suffer from unconstrained reasoning, uninterpretable predictions, and retrieval signals that are weakly aligned with regulatory topology. To address these limitations, we propose AROMA, an Augmented Reasoning Over a Multimodal Architecture for virtual cell genetic perturbation modeling. AROMA integrates textual evidence, graph-topology information, and protein sequence features to model perturbation-target dependencies, and is trained with a two-stage optimization strategy to yield predictions that are both accurate and interpretable. We also construct two knowledge graphs and a perturbation reasoning dataset, PerturbReason, containing more than 498k samples, as reusable resources for the virtual cell domain.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
