AROMA: Augmented Reasoning Over a Multimodal Architecture for Virtual Cell Genetic Perturbation Modeling

Zhenyu Wang; Geyan Ye; Wei Liu; Man Tat Alexander Ng

arXiv:2604.20263·q-bio.QM·April 23, 2026

AROMA: Augmented Reasoning Over a Multimodal Architecture for Virtual Cell Genetic Perturbation Modeling

Zhenyu Wang, Geyan Ye, Wei Liu, Man Tat Alexander Ng

PDF

1 Repo

TL;DR

AROMA is a multimodal architecture that integrates textual, topological, and protein features to improve virtual cell genetic perturbation predictions, emphasizing interpretability and robustness.

Contribution

It introduces a novel multimodal model with a two-stage training strategy and constructs extensive resources for virtual cell modeling.

Findings

01

AROMA outperforms existing methods across multiple cell lines.

02

AROMA remains robust in zero-shot and knowledge-sparse scenarios.

03

The model provides interpretable predictions aligned with biological topology.

Abstract

Virtual cell modeling predicts molecular state changes under genetic perturbations in silico, which is essential for biological mechanism studies. However, existing approaches suffer from unconstrained reasoning, uninterpretable predictions, and retrieval signals that are weakly aligned with regulatory topology. To address these limitations, we propose AROMA, an Augmented Reasoning Over a Multimodal Architecture for virtual cell genetic perturbation modeling. AROMA integrates textual evidence, graph-topology information, and protein sequence features to model perturbation-target dependencies, and is trained with a two-stage optimization strategy to yield predictions that are both accurate and interpretable. We also construct two knowledge graphs and a perturbation reasoning dataset, PerturbReason, containing more than 498k samples, as reusable resources for the virtual cell domain.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

blazerye/AROMA
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.