Likelihood landscape of binary latent model on a tree
David Clancy Jr., Hanbaek Lyu, Sebastien Roch

TL;DR
This paper provides a theoretical analysis of the likelihood landscape for binary latent tree models, showing conditions under which coordinate maximization efficiently finds the MLE.
Contribution
It proves strong concavity of the population log-likelihood in the reconstruction regime and explains the success of simple algorithms in latent tree inference.
Findings
Population log-likelihood is strongly concave near the true parameter.
Coordinate maximization converges exponentially fast to an accurate MLE.
Decay property of the population Hessian explains optimization success.
Abstract
We investigate the optimization landscape of maximum likelihood estimation (MLE) for the Cavender-Farris-Neyman (CFN) model, a two-state latent tree model fundamental to statistical phylogenetics and the ferromagnetic Ising model. Although the log-likelihood function is non-concave and may admit many critical points, simple coordinate maximization algorithms are remarkably effective in practice. We provide the first theoretical justification for this success. We prove that sufficiently deep inside the reconstruction regime, the population log-likelihood is strongly concave and smooth within a box around the true parameter, whose size is independent of tree topology and number of leaves. This fundamental result implies that the empirical landscape shares these regularity properties with high probability given polynomial sample complexity and also that coordinate maximization converges…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Computer Graphics and Visualization Techniques · Statistical and Computational Modeling
