Evaluation of tools for differential gene expression analysis by RNA-seq on a 48 biological replicate experiment
Nicholas J. Schurch, Pieta Schofield, Marek Gierli\'nski, Christian, Cole, Alexander Sherstnev, Vijender Singh, Nicola Wrobel, Karim Gharbi,, Gordon G. Simpson, Tom Owen-Hughes, Mark Blaxter, Geoffrey J. Barton

TL;DR
This study evaluates RNA-seq differential gene expression tools using a 48-replicate experiment, revealing how replicate number affects detection accuracy and identifying the most effective tools for various experimental designs.
Contribution
It provides empirical data on the performance of DGE analysis tools across different replicate numbers, guiding optimal experimental design and tool selection.
Findings
Higher replicates improve true positive rates for all genes.
edgeR performs best with 6-12 replicates, while DESeq excels with more than 12.
Achieving high TPR across all fold-changes requires more than 20 replicates.
Abstract
An RNA-seq experiment with 48 biological replicates in each of 2 conditions was performed to determine the number of biological replicates () required, and to identify the most effective statistical analysis tools for identifying differential gene expression (DGE). When , seven of the nine tools evaluated give true positive rates (TPR) of only 20 to 40 percent. For high fold-change genes () the TPR is percent. Two tools performed poorly; over- or under-predicting the number of differentially expressed genes. Increasing replication gives a large increase in TPR when considering all DE genes but only a small increase for high fold-change genes. Achieving a TPR % across all fold-changes requires . For future RNA-seq experiments these results suggest , rising to when identifying DGE irrespective of fold-change is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Molecular Biology Techniques and Applications · Genomics and Phylogenetic Studies
