Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells
Han Zhang, Guo-Hua Yuan, Chaohao Yuan, Tingyang Xu, Tian Bian, Hong Cheng, Wenbing Huang, Deli Zhao, Yu Rong

TL;DR
Lingshu-Cell is a novel discrete diffusion model that accurately simulates cellular transcriptomes and responses to perturbations, advancing virtual cell development.
Contribution
It introduces a discrete token-based generative model for transcriptomic data that captures complex dependencies without prior gene filtering.
Findings
Accurately reproduces transcriptomic distributions across tissues and species.
Predicts gene expression changes for novel cell type and perturbation combinations.
Achieves top performance on Virtual Cell Challenge and cytokine response prediction.
Abstract
Modeling cellular states and predicting their responses to perturbations are central challenges in computational biology and the development of virtual cells. Existing foundation models for single-cell transcriptomics provide powerful static representations, but they do not explicitly model the distribution of cellular states for generative simulation. Here, we introduce Lingshu-Cell, a masked discrete diffusion model that learns transcriptomic state distributions and supports conditional simulation under perturbation. By operating directly in a discrete token space that is compatible with the sparse, non-sequential nature of single-cell transcriptomic data, Lingshu-Cell captures complex transcriptome-wide expression dependencies across approximately 18,000 genes without relying on prior gene selection, such as filtering by high variability or ranking by expression level. Across diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
