tRNA signatures reveal polyphyletic origins of streamlined SAR11 genomes among the alphaproteobacteria
Katherine C.H. Amrine, Wesley D. Swingley, David H. Ardell

TL;DR
This study introduces a novel tRNA signature-based method for phylogenetic classification that overcomes biases from compositional convergence and horizontal gene transfer, clarifying the evolutionary origins of streamlined SAR11 genomes.
Contribution
The paper develops a new tRNA CIF-based phyloclassification approach that provides more accurate phylogenetic placement of SAR11 genomes compared to traditional methods.
Findings
tRNA CIF-based classification rejects monophyly of SAR11
Most SAR11 strains are affiliated with Rhizobiales
tRNA CIF method is more robust to base composition convergence
Abstract
Phylogenomic analyses are subject to bias from compositional convergence and noise from horizontal gene transfer (HGT). Compositional convergence is a likely cause of controversy regarding phylogeny of the SAR11 group of Alphaproteobacteria that have extremely streamlined, A+T-biased genomes. While careful modeling can reduce artifacts caused by convergence, the most consistent and robust phylogenetic signal in genomes may lie distributed among encoded functional features that govern macromolecular interactions. Here we develop a novel phyloclassification method based on signatures derived from bioinformatically defined tRNA Class-Informative Features (CIFs). tRNA CIFs are enriched for features that underlie tRNA-protein interactions. Using a simple tRNA-CIF-based phyloclassifier, we obtained results consistent with those of bias-corrected whole proteome phylogenomic studies, rejecting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
