TL;DR
SAVE is a Transformer-based framework that models multi-condition single-cell gene expression by grouping genes into blocks, enabling better generalization and simulation of unseen biological scenarios.
Contribution
It introduces gene block attention and a flow matching mechanism to improve multi-condition single-cell modeling and generalization to new conditions.
Findings
Outperforms state-of-the-art methods in generation fidelity.
Enables extrapolative generalization to unseen conditions.
Effective in low-resource and combinatorially held-out settings.
Abstract
Modeling single-cell gene expression across diverse biological and technical conditions is crucial for characterizing cellular states and simulating unseen scenarios. Existing methods often treat genes as independent tokens, overlooking their high-level biological relationships and leading to poor performance. We introduce SAVE, a unified generative framework based on conditional Transformers for multi-condition single-cell modeling. SAVE leverages a coarse-grained representation by grouping semantically related genes into blocks, capturing higher-order dependencies among gene modules. A Flow Matching mechanism and condition-masking strategy further enhance flexible simulation and enable generalization to unseen condition combinations. We evaluate SAVE on a range of benchmarks, including conditional generation, batch effect correction, and perturbation prediction. SAVE consistently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
