Diversity by Design: Addressing Mode Collapse Improves scRNA-seq Perturbation Modeling on Well-Calibrated Metrics
Gabriel M. Mejia, Henry E. Miller, Francis J. A. Leblanc, Bo Wang, Brendan Swain, Lucas Paulo de Lima Camillo

TL;DR
This paper identifies that current evaluation metrics for scRNA-seq perturbation models are biased by control-referenced shifts, and proposes new metrics and loss functions to better measure true biological signals and reduce mode collapse.
Contribution
The authors introduce DEG-aware metrics and baseline calibrations for more accurate evaluation, and demonstrate that using WMSE as a loss reduces mode collapse and enhances model performance.
Findings
Control-referenced shifts inflate performance metrics.
DEG-aware metrics improve sensitivity to genuine signals.
Using WMSE as a loss reduces mode collapse.
Abstract
Recent benchmarks reveal that models for single-cell perturbation response are often outperformed by simply predicting the dataset mean. We trace this anomaly to a metric artifact: control-referenced deltas and unweighted error metrics reward mode collapse whenever the control is biased or the biological signal is sparse. Large-scale \textit{in silico} simulations and analysis of two real-world perturbation datasets confirm that shared reference shifts, not genuine biological change, drives high performance in these evaluations. We introduce differentially expressed gene (DEG)-aware metrics, weighted mean-squared error (WMSE) and weighted delta () with respect to all perturbations, that measure error in niche signals with high sensitivity. We further introduce negative and positive performance baselines to calibrate these metrics. With these improvements, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Gene Regulatory Network Analysis · RNA Research and Splicing
