Incorporating Scale Uncertainty into Differential Expression Analyses Using ALDEx2
Scott J. Dos Santos, Gregory B. Gloor

TL;DR
This paper introduces a method to improve differential expression analysis by accounting for scale uncertainty in RNA-seq and metatranscriptomic data using the ALDEx2 package.
Contribution
The paper introduces scale models in ALDEx2 to correct normalization biases and reduce false discoveries in differential expression analyses.
Findings
Using scale models in ALDEx2 reduces false-discovery rates in differential expression analyses.
Failure to account for scale uncertainty leads to high false-discovery rates due to incorrect normalization assumptions.
ALDEx2 outputs can be used for high-level data visualization through principal component analysis.
Abstract
Differential abundance or expression analyses are routinely performed on metagenomic, metatranscriptomic, and amplicon sequencing data. In such datasets, analysts usually have no information regarding the true scale (i.e., size) of the microbial community or sample under study, with inter‐sample differences in sequencing depth instead being driven by technical variation rather than biological factors. Recent work has demonstrated that normalizations used in all analysis tools make incorrect assumptions about the biological scale of the system in question, leading to unacceptably high false‐discovery rates in the output. To mitigate this, analysts can acknowledge and account for the uncertainty of the overall system scale during normalization by building scale models of the data—a feature that has been integrated into the ALDEx2 R package. Here, we provide reproducible examples that…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGut microbiota and health · Microbial Community Ecology and Physiology · Microbial Metabolic Engineering and Bioproduction
