Gradients do grow on trees: a linear-time ${\cal O}\hspace{-0.2em}\left( N \right)$-dimensional gradient for statistical phylogenetics
Xiang Ji, Zhenyu Zhang, Andrew Holbrook, Akihiko Nishimura, Guy Baele,, Andrew Rambaut, Philippe Lemey, Marc A. Suchard

TL;DR
This paper introduces a linear-time algorithm for gradient computation in phylogenetics, enabling efficient analysis of large-scale sequence data without assuming stationarity or reversibility.
Contribution
The authors develop a novel linear-time gradient calculation method for phylogenetic models, significantly improving computational efficiency over traditional quadratic-time algorithms.
Findings
Achieved 126- to 234-fold speedup in maximum-likelihood optimization.
Realized 16- to 33-fold performance increase in Bayesian inference.
Applied method to infer evolutionary rates of multiple pathogenic viruses.
Abstract
Calculation of the log-likelihood stands as the computational bottleneck for many statistical phylogenetic algorithms. Even worse is its gradient evaluation, often used to target regions of high probability. Order -dimensional gradient calculations based on the standard pruning algorithm require operations where N is the number of sampled molecular sequences. With the advent of high-throughput sequencing, recent phylogenetic studies have analyzed hundreds to thousands of sequences, with an apparent trend towards even larger data sets as a result of advancing technology. Such large-scale analyses challenge phylogenetic reconstruction by requiring inference on larger sets of process parameters to model the increasing data heterogeneity. To make this tractable, we present a linear-time algorithm for ${\cal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Evolution and Paleontology Studies · Bioinformatics and Genomic Networks
