# Reference-free deconvolution of complex samples based on cross-cell-type differential analysis: Systematic evaluations with various feature selection options

**Authors:** Weiwei Zhang, Zhonghe Tian, Ling Peng

PMC · DOI: 10.3389/fgene.2025.1570781 · Frontiers in Genetics · 2025-05-30

## TL;DR

This paper introduces a new reference-free method for estimating cell composition in complex genomic samples by using cross-cell-type differential analysis.

## Contribution

The novel method, RFdecd, improves reference-free deconvolution through iterative feature selection based on cross-cell-type differential analysis.

## Key findings

- RFdecd outperformed existing methods in both simulated and real datasets.
- The method effectively identifies cell-type-specific features without requiring reference data.
- Iterative feature selection significantly enhances estimation accuracy.

## Abstract

Genomic and epigenomic data from complex samples reflect the average level of multiple cell types. However, differences in cell compositions can introduce bias into many relevant analyses. Consequently, the accurate estimation of cell compositions has been regarded as an important initial step in the analysis of complex samples. A large number of computational methods have been developed for estimating cell compositions; however, their applications are limited due to the absence of reference or prior information. As a result, reference-free deconvolution has the potential to be widely applied due to its flexibility. A previous study emphasized the importance of feature selection for improving estimation accuracy in reference-free deconvolution.

In this paper, we systematically evaluated five feature selection options and developed an optimal feature-selection-based reference-free deconvolution method. Our proposal iteratively searches for cell-type-specific (CTS) features by integrating cross-cell-type differential analysis between one cell type and the other cell types, as well as between two cell types and the other cell types, and performs composition estimation.

Comprehensive simulation studies and analyses of seven real datasets show the excellent performance of the proposed method. The proposed method, that is, reference-free deconvolution based on cross-cell-type differential (RFdecd), is implemented as an R package at https://github.com/wwzhang-study/RFdecd.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12162504/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12162504/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/PMC12162504/full.md

---
Source: https://tomesphere.com/paper/PMC12162504