Biases in differential expression analysis of RNA-seq data: A matter of replicate type
Sora Yoon, Dougu Nam

TL;DR
This paper investigates biases in RNA-seq differential expression analysis, revealing that certain biases are specific to technical replicates and can lead to false positives in gene set enrichment, emphasizing careful analysis of technical replicate data.
Contribution
The study clarifies that biases in DE analysis are confined to technical replicates, providing guidance on when current methods are reliable or require caution.
Findings
Biases are specific to technical replicate data.
Current DE methods are reliable for biological replicates.
Technical replicate analysis can produce false positives.
Abstract
In differential expression (DE) analysis of RNA-seq count data, it is known that genes with a larger read number are more likely to be differentially expressed. This bias has a profound effect on the subsequent Gene Ontology (GO) analysis by perturbing the ranks of gene-sets. Another known bias is that the commonly used parametric DE analysis methods (e.g., edgeR, DESeq and baySeq) tend to yield more DE genes as the sequencing depth is increased. We nevertheless show that these biases are in fact confined to data of the technical replicate type. We also show the GO or gene-set enrichment analysis methods applied to technical replicate data result in considerable number of false positives. In conclusion, the current DE and enrichment analysis methods can be confidently used for biological replicate count data, while caution should be exercised when analysing technical replicate data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Genomics and Phylogenetic Studies
