# Precision and Accuracy in Quantitative Measurement of Gene Expression from Single-cell/nucleus RNA Sequencing Data

**Authors:** Rujia Dai, Ming Zhang, Tianyao Chu, Richard Kopp, Chunling Zhang, Kefu Liu, Yue Wang, Xusheng Wang, Chao Chen, Chunyu Liu

PMC · DOI: 10.1093/gpbjnl/qzaf077 · Genomics, Proteomics & Bioinformatics · 2025-08-26

## TL;DR

This study evaluates the reliability of gene expression measurements in single-cell RNA sequencing and provides guidelines to improve study design and data quality.

## Contribution

The study introduces data-driven thresholds and a tool called VICE to assess and enhance the reproducibility of sc/snRNA-seq results.

## Key findings

- Precision and accuracy in single-cell RNA sequencing are generally low and strongly influenced by cell count and RNA quality.
- A minimum of 500 cells per cell type per individual is recommended for reliable quantification.
- Signal-to-noise ratio is a key metric for identifying reproducible differentially expressed genes.

## Abstract

Single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) have become essential tools for profiling gene expression across different cell types in biomedical research. While factors like RNA integrity, cell count, and sequencing depth are known to influence data quality, quantitative benchmarks and actionable guidelines are lacking. This gap contributes to variability in study designs and inconsistencies in downstream analyses. In this study, we systematically evaluated quantitative precision and accuracy in expression measures across 23 sc/snRNA-seq datasets comprising 3,682,576 cells from 339 samples. Precision was assessed using technical replicates based on pseudo-bulks created from subsampling. Accuracy was evaluated using sample-matched scRNA-seq and pooled-cell RNA sequencing data of mononuclear phagocytes from four species. Our results show that precision and accuracy are generally low at the single-cell level, with reproducibility being strongly influenced by cell count and RNA quality. We established data-driven thresholds for optimizing study design, recommending at least 500 cells per cell type per individual to achieve reliable quantification. Furthermore, we showed that signal-to-noise ratio is a key metric for identifying reproducible differentially expressed genes. To support future research, we developed Variability In single-Cell gene Expression (VICE), a tool that evaluates sc/snRNA-seq data quality and estimates the true positive rate of differential expression results based on sample size, observed noise levels, and expected effect size. These findings provide practical, evidence-based guidelines to enhance the reliability and reproducibility of sc/snRNA-seq studies.

## Full-text entities

- **Genes:** VIP (vasoactive intestinal peptide) [NCBI Gene 7432] {aka PHM27}, RIT2 (Ras like without CAAX 2) [NCBI Gene 6014] {aka RIBA, RIN, ROC2}, SST (somatostatin) [NCBI Gene 6750] {aka SMST, SST1}
- **Diseases:** schizophrenia (MESH:D012559), neuropsychiatric disorders (MESH:D001523), Alzheimer (MESH:D000544), MCL (MESH:C535516), brain disorders (MESH:D001927), DE (MESH:D001039), autism (MESH:D001321)
- **Chemicals:** LPS (MESH:D008070), poly-I:C (MESH:D011070)
- **Species:** Rattus norvegicus (brown rat, species) [taxon 10116], Sus scrofa (pig, species) [taxon 9823], Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606], Oryctolagus cuniculus (domestic rabbit, species) [taxon 9986]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12603356/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12603356/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/PMC12603356/full.md

---
Source: https://tomesphere.com/paper/PMC12603356