Computational Performance and Statistical Accuracy of *BEAST and Comparisons with Other Methods
Huw A. Ogilvie, Joseph Heled, Dong Xie, Alexei J. Drummond

TL;DR
This study evaluates the computational efficiency and accuracy of *BEAST, a Bayesian multispecies coalescent method, demonstrating its advantages over concatenation methods with increasing loci and varying branch lengths through simulation analyses.
Contribution
It provides a detailed analysis of *BEAST's scaling behavior and compares its performance with other methods, guiding practical application in phylogenetics.
Findings
*BEAST's statistical performance improves with more loci and shorter branches.
Using *BEAST with tens of loci can outperform concatenation with thousands.
Simulation results inform on the number of loci needed for desired accuracy.
Abstract
Under the multispecies coalescent model of molecular evolution, gene trees have independent evolutionary histories within a shared species tree. In comparison, supermatrix concatenation methods assume that gene trees share a single common genealogical history, thereby equating gene coalescence with species divergence. The multispecies coalescent is supported by previous studies which found that its predicted distributions fit empirical data, and that concatenation is not a consistent estimator of the species tree. *BEAST, a fully Bayesian implementation of the multispecies coalescent, is popular but computationally intensive, so the increasing size of phylogenetic data sets is both a computational challenge and an opportunity for better systematics. Using simulation studies, we characterize the scaling behaviour of *BEAST, and enable quantitative prediction of the impact increasing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
