Bayesian Estimation of Negative Binomial Parameters with Applications to RNA-Seq Data
Luis Leon-Novelo, Claudio Fuentes, Sarah Emerson

TL;DR
This paper introduces a Bayesian approach to estimate negative binomial parameters for RNA-Seq data, improving stability and reliability over traditional methods, and applies it to differential gene expression analysis.
Contribution
It develops a conjugate Bayesian hierarchical model for negative binomial overdispersion estimation and demonstrates its effectiveness in RNA-Seq data analysis.
Findings
Bayesian estimator outperforms MLE in controlling variance.
Proposed method is competitive with existing negative binomial procedures.
Application to real data illustrates the method's flexibility.
Abstract
RNA-Seq data characteristically exhibits large variances, which need to be appropriately accounted for in the model. We first explore the effects of this variability on the maximum likelihood estimator (MLE) of the overdispersion parameter of the negative binomial distribution, and propose instead the use an estimator obtained via maximization of the marginal likelihood in a conjugate Bayesian framework. We show, via simulation studies, that the marginal MLE can better control this variation and produce a more stable and reliable estimator. We then formulate a conjugate Bayesian hierarchical model, in which the estimate of overdispersion is a marginalized estimate and use this estimator to propose a Bayesian test to detect differentially expressed genes with RNA-Seq data. We use numerical studies to show that our much simpler approach is competitive with other negative binomial based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular Biology Techniques and Applications · Bayesian Methods and Mixture Models · Gene expression and cancer classification
