Metagenomic Analysis using Phylogenetic Placement -- A Review of the First Decade
Lucas Czech, Alexandros Stamatakis, Micah Dunthorn, Pierre Barbera

TL;DR
This review summarizes a decade of phylogenetic placement methods for metagenomic data, highlighting workflows, tools, best practices, and applications to improve accuracy and interpretation of microbial community analyses.
Contribution
It provides a comprehensive overview of phylogenetic placement techniques, workflows, and best practices developed over the first ten years, aiding researchers in metagenomic analysis.
Findings
Enhanced accuracy in metagenomic surveys using phylogenetic context
Identification of common pitfalls and misconceptions in placement methods
Showcase of typical analyses and visualization techniques
Abstract
Phylogenetic placement refers to a family of tools and methods to analyze, visualize, and interpret the tsunami of metagenomic sequencing data generated by high-throughput sequencing. Compared to alternative (e. g., similarity-based) methods, it puts metabarcoding sequences into a phylogenetic context using a set of known reference sequences and taking evolutionary history into account. Thereby, one can increase the accuracy of metagenomic surveys and eliminate the requirement for having exact or close matches with existing sequence databases. Phylogenetic placement constitutes a valuable analysis tool per se, but also entails a plethora of downstream tools to interpret its results. A common use case is to analyze species communities obtained from metagenomic sequencing, for example via taxonomic assignment, diversity quantification, sample comparison, and identification of correlations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
