CVTree for 16S rRNA: Constructing Taxonomy-Compatible All-Species Living Tree Effectively and Efficiently
Yi-Fei Lu, Xiao-Yang Zhi, Guang-Hong Zuo

TL;DR
This paper demonstrates that the CVTree method efficiently constructs accurate phylogenetic trees from 16S rRNA sequences, outperforming traditional alignment methods in speed while maintaining high consistency with taxonomy.
Contribution
It adapts the alignment-free CVTree approach for 16S rRNA, enabling fast and reliable phylogenetic analysis aligned with existing taxonomy.
Findings
CVTree achieves 10-1000x faster computation than alignment-based methods.
High consistency of CVTree with established prokaryotic taxonomy.
Outperforms some multiple sequence alignment approaches in accuracy.
Abstract
The Composition Vector Tree (CVTree) method, developed under the leadership of Professor Hao Bailin, is an alignment-free algorithm for constructing phylogenetic trees. Although initially designed for studying prokaryotic evolution based on whole-genome, it has demonstrated broad applicability across diverse biological systems and gene sequences. In this study, we employed two methods, InterList and Hao, of CVTree to investigate the phylogeny and taxonomy of prokaryote based on the 16S rRNA sequences from All-Species Living Tree Project. We have established a comprehensive phylogenetic tree that incorporates the majority of species documented in human scientific knowledge and compared it with the taxonomy of prokaryotes. And the performance of CVTree were also compared with multiple sequence alignment-based approaches. Our results revealed that CVTree methods achieve computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
