Modeling and analysis of RNA-seq data: a review from a statistical perspective
Wei Vivian Li, Jingyi Jessica Li

TL;DR
This review discusses statistical models and methods for analyzing RNA-seq data across different biological levels, highlighting their assumptions, performance, and practical considerations to guide users and developers.
Contribution
It provides a comprehensive comparison of statistical models used in RNA-seq analysis, aiding method selection and future development.
Findings
Significant advances in statistical methods over the past decade.
Diverse models exhibit different performances depending on scenarios.
Guidance provided for selecting appropriate analysis methods.
Abstract
Background: Since the invention of next-generation RNA sequencing (RNA-seq) technologies, they have become a powerful tool to study the presence and quantity of RNA molecules in biological samples and have revolutionized transcriptomic studies. The analysis of RNA-seq data at four different levels (samples, genes, transcripts, and exons) involve multiple statistical and computational questions, some of which remain challenging up to date. Results: We review RNA-seq analysis tools at the sample, gene, transcript, and exon levels from a statistical perspective. We also highlight the biological and statistical questions of most practical considerations. Conclusion: The development of statistical and computational methods for analyzing RNA- seq data has made significant advances in the past decade. However, methods developed to answer the same biological question often rely on diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
