Recovering a tree from the lengths of subtrees spanned by a randomly chosen sequence of leaves
Steven N. Evans, Daniel Lanoue

TL;DR
This paper investigates whether a tree's structure can be uniquely identified from the distribution of subtree lengths spanned by randomly chosen leaves, showing positive results for specific tree families.
Contribution
It demonstrates that under certain conditions and for specific tree families, the joint distribution of subtree lengths uniquely determines the tree structure.
Findings
Unique identification for trees with edges in general position
Identification for ultrametric trees
Identification for trees with equal edge weights in certain families
Abstract
Given an edge-weighted tree with leaves, sample the leaves uniformly at random without replacement and let , , be the length of the subtree spanned by the first leaves. We consider the question, "Can be identified (up to isomorphism) by the joint probability distribution of the random vector ?" We show that if is known {\em a priori} to belong to one of various families of edge-weighted trees, then the answer is, "Yes." These families include the edge-weighted trees with edge-weights in general position, the ultrametric edge-weighted trees, and certain families with equal weights on all edges such as -valent and rooted -ary trees for and caterpillars.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
