Count-based differential expression analysis of RNA sequencing data using R and Bioconductor
Simon Anders, Davis J. McCarthy, Yunshen Chen, Michal Okoniewski,, Gordon K. Smyth, Wolfgang Huber, Mark D. Robinson

TL;DR
This paper provides a comprehensive workflow for differential expression analysis of RNA-seq data using R and Bioconductor, emphasizing best practices and practical implementation with DESeq and edgeR.
Contribution
It offers a detailed, practical protocol for RNA-seq differential expression analysis, integrating current best practices with open-source tools.
Findings
Efficient analysis workflow for small RNA-seq experiments
Guidance on data quality control and statistical modeling
Implementation using DESeq and edgeR tools
Abstract
RNA sequencing (RNA-seq) has been rapidly adopted for the profiling of transcriptomes in many areas of biology, including studies into gene regulation, development and disease. Of particular interest is the discovery of differentially expressed genes across different conditions (e.g., tissues, perturbations), while optionally adjusting for other systematic factors that affect the data collection process. There are a number of subtle yet critical aspects of these analyses, such as read counting, appropriate treatment of biological variability, quality control checks and appropriate setup of statistical modeling. Several variations have been presented in the literature, and there is a need for guidance on current best practices. This protocol presents a "state-of-the-art" computational and statistical RNA-seq differential expression analysis workflow largely based on the free open-source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
