Robust estimation of isoform expression with RNA-Seq data
Jun Li, Hui Jiang

TL;DR
This paper introduces a robust negative binomial GLM approach with quasi-likelihood equations for estimating isoform expression from RNA-Seq data, effectively handling overdispersion and outliers, and outperforming existing methods.
Contribution
It proposes a novel robust negative binomial GLM with quasi-likelihood equations for improved isoform expression estimation in RNA-Seq analysis.
Findings
Outperforms existing methods in simulations
Effective in handling overdispersion and outliers
Validated on real RNA-Seq data
Abstract
Qualifying gene and isoform expression is one of the primary tasks for RNA-Seq experiments. Given a sequence of counts representing numbers of reads mapped to different positions (exons and junctions) of isoforms, methods based on Poisson generalized linear models (GLM) with the identity link function have been proposed to estimate isoform expression levels from these counts. These Poisson based models have very limited ability in handling the overdispersion in the counts brought by various sources, and some of them are not robust to outliers. We propose a negative binomial based GLM with identity link, and use a set of robustified quasi-likelihood equations to make it resistant to outliers. An efficient and reliable numeric algorithm has been identified to solve these equations. In simulations, we find that our approach seems to outperform existing approaches. We also find evidence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA Research and Splicing · RNA and protein synthesis mechanisms · Molecular Biology Techniques and Applications
