From Trees to Barcodes and Back Again II: Combinatorial and Probabilistic Aspects of a Topological Inverse Problem
Justin Curry, Jordan DeSha, Ad\'elie Garin, Kathryn Hess, Lida Kanari,, Brendan Mallery

TL;DR
This paper explores the combinatorial and probabilistic aspects of constructing merge trees from barcodes, revealing new structural distinctions and distribution formulas that enhance understanding of topological inverse problems.
Contribution
It provides a clear combinatorial distinction between phylogenetic and merge trees and derives formulas for the distribution of tree realization numbers under uniform barcode sampling.
Findings
Distinct counts of strata for BHV and merge trees.
Formulas for the distribution of realization numbers.
Higher moments characterized via Dirichlet convolution.
Abstract
In this paper we consider two aspects of the inverse problem of how to construct merge trees realizing a given barcode. Much of our investigation exploits a recently discovered connection between the symmetric group and barcodes in general position, based on the simple observation that death order is a permutation of birth order. The first important outcome of our study is a clear combinatorial distinction between the space of phylogenetic trees (as defined by Billera, Holmes and Vogtmann) and the space of merge trees. Generic BHV trees on leaf nodes fall into distinct strata, but the analogous number for merge trees is equal to the number of maximal chains in the lattice of partitions, i.e., . The second aspect of our study is the derivation of precise formulas for the distribution of tree realization numbers (the number of merge trees realizing a given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Fractal and DNA sequence analysis · Image Retrieval and Classification Techniques
