Expected number of induced subtrees shared by two independent copies of a random tree
Boris Pittel

TL;DR
This paper analyzes the expected size of the largest common induced subtree shared by two independent random trees, extending known bounds from binary trees to a broader class of Galton–Watson trees with mean one offspring.
Contribution
It generalizes the known $O(n^{1/2})$ bound for maximum agreement subtree size from binary trees to a wider class of Galton–Watson trees conditioned on leaf count.
Findings
The likely magnitude of MAST is $O(n^{1/2})$ for the considered class of trees.
The bound holds for trees generated by a Galton–Watson process with mean 1 offspring.
The result extends previous bounds from binary trees to more general random tree models.
Abstract
Consider a rooted tree with leaf-set , and with all non-leaf vertices having out-degree , at least. A rooted tree with leaf-set is induced by in if is the lowest common ancestor subtree for , with all its degree-2 vertices suppressed. A "maximum agreement subtree" (MAST) for a pair of two trees and is a tree with a largest leaf-set such that is induced by both in and . Bryant et al. \cite{BryMcKSte} and Bernstein et al. \cite{Ber} proved, among other results, that for and being two independent copies of a random binary (uniform or Yule-Harding distributed) tree , the likely magnitude order of is . We prove this bound for a wide class of random rooted trees : is a terminal tree of a branching, Galton--Watson,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Privacy-Preserving Technologies in Data
