On the lumpability of tree-valued Markov chains
Rodrigo B. Alves, Yuri F. Saporito, Luiz M. Carvalho

TL;DR
This paper explores conditions under which Markov processes on phylogenetic trees can be simplified through lumping, enabling more efficient statistical analysis of complex tree spaces.
Contribution
It introduces criteria for exact and approximate lumpability of tree-valued Markov chains, leveraging combinatorial structures to improve Monte Carlo methods.
Findings
Bounds on lumping error for SPR graph processes
Conditions for exact lumpability of tree shapes
Enhanced Monte Carlo estimation using lumpability
Abstract
Phylogenetic trees constitute an interesting class of objects for stochastic processes due to the non-standard nature of the space they inhabit. In particular, many statistical applications require the construction of Markov processes on the space of trees, whose cardinality grows superexponentially with the number of leaves considered. We investigate whether certain lower-dimensional projections of tree space preserve the Markov property in tree-valued Markov processes. We study exact lumpability of tree shapes and -lumpability of clades, exploiting the combinatorial structure of the SPR graph to obtain bounds on the lumping error under the random walk and Metropolis-Hastings processes. Finally, we show how to use these results in empirical investigation, leveraging exact and -lumpability to improve Monte Carlo estimation of tree-related quantities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods
