# Robustness and resilience of computational deconvolution methods for bulk RNA sequencing data

**Authors:** Su Xu, Duan Chen, Xue Wang, Shaoyu Li

PMC · DOI: 10.1093/bib/bbaf264 · Briefings in Bioinformatics · 2025-06-12

## TL;DR

This study compares how well different methods estimate cell-type proportions in bulk RNA sequencing data, focusing on their reliability and adaptability.

## Contribution

The study introduces a systematic benchmarking framework for evaluating deconvolution methods under varying conditions.

## Key findings

- Reference-based methods perform better when high-quality reference data is available.
- Reference-free methods are more effective in the absence of suitable reference data.
- Cell-level transcriptomic variations and composition significantly impact deconvolution accuracy.

## Abstract

This study benchmarks the robustness and resilience of computational deconvolution methods for estimating cell-type proportions in bulk tissues, with a focus on comparing reference-based and reference-free methods. Robustness is evaluated by generating in silico pseudo-bulk tissue RNA sequencing data from cell-level gene expression profiles derived from four different tissue types, with simulated cellular composition at varying levels of heterogeneity. To assess resilience, we intentionally alter single-cell RNA profiles to create pseudo-bulk tissue RNA-seq data. Deconvolution estimates are compared with ground truth using Pearson’s correlation coefficient, root mean squared deviation, and mean absolute deviation. The results show that reference-based methods are more robust when reliable reference data are available, whereas reference-free methods excel in scenarios lacking suitable reference data. Furthermore, variations in cell-level transcriptomic profiles and cell composition have emerged as critical factors influencing the performance of deconvolution methods. This study provides significant insights into the factors affecting bulk tissue deconvolution performance, which are essential for guiding users and advancing the development of more powerful and reliable algorithms in the future.

## Full-text entities

- **Genes:** CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}
- **Diseases:** AD (MESH:D000544), T2D (MESH:D003924), neuronal loss (MESH:D009410), neurodegenerative disease (MESH:D019636), MAlignant Tumours (MESH:D009369)
- **Chemicals:** E-MTAB-5061 (-)
- **Species:** Rattus norvegicus (brown rat, species) [taxon 10116], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** LM22 — Homo sapiens (Human), Astrocytoma, Cancer cell line (CVCL_A1IU)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12159287/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12159287/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC12159287/full.md

---
Source: https://tomesphere.com/paper/PMC12159287