# Optimal sequencing budget allocation for trajectory reconstruction of single cells

**Authors:** Noa Moriel, Edvin Memet, Mor Nitzan

PMC · DOI: 10.1093/bioinformatics/btae258 · 2024-06-28

## TL;DR

This paper explores how to best use a limited sequencing budget to accurately reconstruct cellular development paths from single-cell RNA-sequencing data.

## Contribution

The study introduces a framework for optimally allocating sequencing budget between cell sampling breadth and sequencing depth to improve trajectory reconstruction.

## Key findings

- Trajectory reconstruction accuracy scales logarithmically with sequencing breadth or depth.
- Optimal cell sampling follows a power law relationship with sequencing budget.
- Non-monotonic behavior in trajectory reconstruction can affect downstream analysis like expression pattern detection.

## Abstract

Charting cellular trajectories over gene expression is key to understanding dynamic cellular processes and their underlying mechanisms. While advances in single-cell RNA-sequencing technologies and computational methods have pushed forward the recovery of such trajectories, trajectory inference remains a challenge due to the noisy, sparse, and high-dimensional nature of single-cell data. This challenge can be alleviated by increasing either the number of cells sampled along the trajectory (breadth) or the sequencing depth, i.e. the number of reads captured per cell (depth). Generally, these two factors are coupled due to an inherent breadth-depth tradeoff that arises when the sequencing budget is constrained due to financial or technical limitations.

Here we study the optimal allocation of a fixed sequencing budget to optimize the recovery of trajectory attributes. Empirical results reveal that reconstruction accuracy of internal cell structure in expression space scales with the logarithm of either the breadth or depth of sequencing. We additionally observe a power law relationship between the optimal number of sampled cells and the corresponding sequencing budget. For linear trajectories, non-monotonicity in trajectory reconstruction across the breadth-depth tradeoff can impact downstream inference, such as expression pattern analysis along the trajectory. We demonstrate these results for five single-cell RNA-sequencing datasets encompassing differentiation of embryonic stem cells, pancreatic beta cells, hepatoblast and multipotent hematopoietic cells, as well as induced reprogramming of embryonic fibroblasts into neurons. By addressing the challenges of single-cell data, our study offers insights into maximizing the efficiency of cellular trajectory analysis through strategic allocation of sequencing resources.

## Full-text entities

- **Species:** Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** mESC — Mus musculus (Mouse), Embryonic stem cell (CVCL_4378)

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11211845/full.md

---
Source: https://tomesphere.com/paper/PMC11211845