Neighbourhoods of phylogenetic trees: exact and asymptotic counts
Jamie V. de Jong, Jeanette C McLeod, Mike Steel

TL;DR
This paper investigates the number of phylogenetic trees at various distances from a given tree under different rearrangement operations, providing exact and asymptotic counts to aid in understanding tree space structure.
Contribution
It offers new exact and asymptotic formulas for the counts of trees at specific distances, enhancing understanding of phylogenetic tree neighborhoods under key operations.
Findings
Derived exact counts for trees at distance two and three.
Provided asymptotic estimates for large trees.
Identified shape features influencing neighborhood sizes.
Abstract
A central theme in phylogenetics is the reconstruction and analysis of evolutionary trees from a given set of data. To determine the optimal search methods for reconstructing trees, it is crucial to understand the size and structure of the neighbourhoods of trees under tree rearrangement operations. The diameter and size of the immediate neighbourhood of a tree has been well-studied, however little is known about the number of trees at distance two, three or (more generally) from a given tree. In this paper we provide a number of exact and asymptotic results concerning these quantities, and identify some key aspects of tree shape that play a role in determining these quantities. We obtain several new results for two of the main tree rearrangement operations - Nearest Neighbour Interchange and Subtree Prune and Regraft -- as well as for the Robinson-Foulds metric on trees.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Genomics and Phylogenetic Studies · Genetic diversity and population structure
