Distributions of topological tree metrics between a species tree and a gene tree
Jing Xi, Jin Xie, Ruriko Yoshida

TL;DR
This paper investigates the statistical distributions of three tree distance metrics between species and gene trees, providing theoretical insights and simulations to aid in phylogenetic discordance analysis.
Contribution
It offers a detailed analysis of the distributions of three key tree distance metrics, including theoretical results and simulations, for random and species-gene tree comparisons.
Findings
Distributions of three tree metrics are characterized for random trees.
The study provides theoretical and simulation-based insights into these distributions.
Results facilitate statistical analysis of phylogenetic discordance.
Abstract
In order to conduct a statistical analysis on a given set of phylogenetic gene trees, we often use a distance measure between two trees. In a statistical distance-based method to analyze discordance between gene trees, it is a key to decide "biological meaningful" and "statistically well-distributed" distance between trees. Thus, in this paper, we study the distributions of the three tree distance metrics: the edge difference, the path difference, and the precise interval cospeciation distance, between two trees: first, we focus on distributions of the three tree distances between two random unrooted trees with leaves (); and then we focus on the distributions the three tree distances between a fixed rooted species tree with leaves and a random gene tree with leaves generated under the coalescent process with given the species tree. We show some theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Bioinformatics and Genomic Networks · Gene expression and cancer classification
