Calculating higher-order moments of phylogenetic stochastic mapping summaries in linear time
Amrit Dhar, Vladimir N. Minin

TL;DR
This paper introduces a linear-time, simulation-free dynamic programming algorithm to compute higher-order moments of stochastic mapping summaries in phylogenetics, enhancing analysis of evolutionary processes.
Contribution
The authors develop the first efficient algorithm for calculating higher-order moments of stochastic mapping summaries, extending existing methods beyond mean and variance.
Findings
Algorithm scales linearly with phylogeny size
Enables computation of prior and posterior variances
Improves statistical tests for rate variation and conserved regions
Abstract
Stochastic mapping is a simulation-based method for probabilistically mapping substitution histories onto phylogenies according to continuous-time Markov models of evolution. This technique can be used to infer properties of the evolutionary process on the phylogeny and, unlike parsimony-based mapping, conditions on the observed data to randomly draw substitution mappings that do not necessarily require the minimum number of events on a tree. Most stochastic mapping applications simulate substitution mappings only to estimate the mean and/or variance of two commonly used mapping summaries: the number of particular types of substitutions (labeled substitution counts) and the time spent in a particular group of states (labeled dwelling times) on the tree. Fast, simulation-free algorithms for calculating the mean of stochastic mapping summaries exist. Importantly, these algorithms scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genetic diversity and population structure · Algorithms and Data Compression
