SDSR: A Spectral Divide-and-Conquer Approach for Species Tree Reconstruction
Ortal Reshef (1), Ofer Glassman (3), Or Zuk (1), Yariv Aizenbud (2), Boaz Nadler (3), Ariel Jaffe (1) ((1) Hebrew University of Jerusalem, (2) Tel Aviv University, (3) Weizmann Institute of Science)

TL;DR
SDSR is a scalable spectral divide-and-conquer method for species tree reconstruction that offers significant runtime improvements while maintaining accuracy, effectively handling large datasets with complex evolutionary histories.
Contribution
The paper introduces SDSR, a novel spectral graph theory-based divide-and-conquer algorithm for efficient and accurate species tree reconstruction from large genetic datasets.
Findings
SDSR achieves up to 10-fold faster runtimes compared to full-data methods.
SDSR maintains comparable accuracy to traditional methods on synthetic datasets.
Theoretical guarantees support SDSR's recovery performance under the MSC model.
Abstract
Recovering a tree that represents the evolutionary history of a group of species is a key task in phylogenetics. Performing this task using sequence data from multiple genetic markers poses two key challenges. The first is the discordance between the evolutionary history of individual genes and that of the species. The second challenge is computational, as contemporary studies involve thousands of species. Here we present SDSR, a scalable divide-and-conquer approach for species tree reconstruction based on spectral graph theory. The algorithm recursively partitions the species into subsets until their sizes are below a given threshold. The trees of these subsets are reconstructed by a user-chosen species tree algorithm. Finally, these subtrees are merged to form the full tree. On the theoretical front, we derive recovery guarantees for SDSR, under the multispecies coalescent (MSC)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genome Rearrangement Algorithms · Cancer Genomics and Diagnostics
