NBLDA: Negative Binomial Linear Discriminant Analysis for RNA-Seq Data
Kai Dong, Hongyu Zhao, Xiang Wan, Tiejun Tong

TL;DR
This paper introduces NBLDA, a novel classification method for RNA-Seq data that models overdispersion using negative binomial distribution, outperforming previous Poisson-based approaches in simulations and real data.
Contribution
The paper develops a negative binomial linear discriminant analysis method specifically for RNA-Seq data, addressing overdispersion and improving classification accuracy.
Findings
NBLDA outperforms Poisson-based classifiers in simulations.
NBLDA achieves higher accuracy on real RNA-Seq datasets.
The method effectively models overdispersion in count data.
Abstract
RNA-sequencing (RNA-Seq) has become a powerful technology to characterize gene expression profiles because it is more accurate and comprehensive than microarrays. Although statistical methods that have been developed for microarray data can be applied to RNA-Seq data, they are not ideal due to the discrete nature of RNA-Seq data. The Poisson distribution and negative binomial distribution are commonly used to model count data. Recently, Witten (2011) proposed a Poisson linear discriminant analysis for RNA-Seq data. The Poisson assumption may not be as appropriate as negative binomial distribution when biological replicates are available and in the presence of overdispersion (i.e., when the variance is larger than the mean). However, it is more complicated to model negative binomial variables because they involve a dispersion parameter that needs to be estimated. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genomics and Phylogenetic Studies · Molecular Biology Techniques and Applications
