TL;DR
This paper presents a polynomial-time algorithm for completing phylogenetic tree sets with overlapping taxa, preserving branch lengths and topology, and outperforming existing methods in accuracy.
Contribution
Introduces a novel polynomial-time algorithm for set-wide phylogenetic tree completion that maintains distances and is order-independent.
Findings
Algorithm runs in polynomial time.
Achieves lowest distance to reference trees in experiments.
Preserves topology and branch lengths effectively.
Abstract
Comparative analyses of phylogenetic trees typically require identical taxon sets, however, in practice, trees often include distinct but overlapping taxa. Pruning non-shared leaves discards phylogenetic signal, whereas tree completion can preserve both taxa and branch-length information. This work introduces a polynomial-time algorithm for set-wide completion of phylogenetic trees with partial taxon overlap. The proposed method identifies and extracts maximal completion subtrees that frequently appear across the source trees and constructs a weighted majority-rule consensus. Branch lengths are scaled using rates derived from common leaves. Each consensus subtree is inserted at the position that minimizes the quadratic distance error measured against information from the source trees, with candidate positions restricted to the original branches of the target tree. We demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
