STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations
Boyang Fu, George Dasoulas, Sameer Gabbita, Xiang Lin, Shanghua Gao, Xiaorui Su, Soumya Ghosh, Marinka Zitnik

TL;DR
STRAND is a novel generative model that predicts single-cell transcriptional responses to genetic perturbations by conditioning on regulatory DNA sequences, enabling zero-shot inference and improved accuracy across cell types.
Contribution
It introduces sequence-based perturbation representation and a conditional transport process, significantly expanding inference coverage and accuracy over gene-level models.
Findings
Improves discrimination scores by up to 33% in low-sample regimes.
Achieves best average rank on unseen gene perturbation benchmarks.
Enhances transfer to new cell lines with up to 0.14 increase in Pearson correlation.
Abstract
Predicting how genetic perturbations change cellular state is a core problem for building controllable models of gene regulation. Perturbations targeting the same gene can produce different transcriptional responses depending on their genomic locus, including different transcription start sites and regulatory elements. Gene-level perturbation models collapse these distinct interventions into the same representation. We introduce STRAND, a generative model that predicts single-cell transcriptional responses by conditioning on regulatory DNA sequence. STRAND represents a perturbation by encoding the sequence at its genomic locus and uses this representation to parameterize a conditional transport process from control to perturbed cell states. Representing perturbations by sequence, rather than by a fixed set of gene identifiers, supports zero-shot inference at loci not seen during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Genomics and Chromatin Dynamics · RNA Research and Splicing
