Spectral Top-Down Recovery of Latent Tree Models
Yariv Aizenbud, Ariel Jaffe, Meng Wang, Amber Hu, Noah Amsel, Boaz, Nadler, Joseph T. Chang, Yuval Kluger

TL;DR
This paper introduces Spectral Top-Down Recovery (STDR), a deterministic divide-and-conquer method for efficiently inferring large latent tree models from high-dimensional data, with proven consistency and improved runtime.
Contribution
STDR is a novel spectral, non-random partitioning approach that simplifies tree recovery and merging, outperforming previous methods in speed while maintaining accuracy.
Findings
STDR is statistically consistent under certain conditions.
STDR significantly reduces runtime compared to existing algorithms.
STDR achieves comparable or better accuracy in simulated phylogenetic data.
Abstract
Modeling the distribution of high dimensional data by a latent tree graphical model is a prevalent approach in multiple scientific domains. A common task is to infer the underlying tree structure, given only observations of its terminal nodes. Many algorithms for tree recovery are computationally intensive, which limits their applicability to trees of moderate size. For large trees, a common approach, termed divide-and-conquer, is to recover the tree structure in two steps. First, recover the structure separately of multiple, possibly random subsets of the terminal nodes. Second, merge the resulting subtrees to form a full tree. Here, we develop Spectral Top-Down Recovery (STDR), a deterministic divide-and-conquer approach to infer large latent tree models. Unlike previous methods, STDR partitions the terminal nodes in a non random way, based on the Fiedler vector of a suitable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Species Distribution and Climate Change · Data Analysis with R
MethodsNetwork On Network
