Maximum Likelihood Estimation for Linear Gaussian Covariance Models
Piotr Zwiernik, Caroline Uhler, Donald Richards

TL;DR
This paper analyzes the behavior of maximum likelihood estimation in linear Gaussian covariance models, showing that it often behaves like a convex problem for sample sizes roughly 14 times the dimension, despite the non-convex nature.
Contribution
The paper provides new theoretical conditions ensuring convergence to the global maximum in non-convex MLE problems for Gaussian covariance models, supported by numerical evidence.
Findings
MLE behaves as convex for n ≈ 14p
Conditions guarantee convergence to global maximum
Results hold even for small p, like p=2
Abstract
We study parameter estimation in linear Gaussian covariance models, which are -dimensional Gaussian models with linear constraints on the covariance matrix. Maximum likelihood estimation for this class of models leads to a non-convex optimization problem which typically has many local maxima. Using recent results on the asymptotic distribution of extreme eigenvalues of the Wishart distribution, we provide sufficient conditions for any hill-climbing method to converge to the global maximum. Although we are primarily interested in the case in which , the proofs of our results utilize large-sample asymptotic theory under the scheme . Remarkably, our numerical simulations indicate that our results remain valid for as small as . An important consequence of this analysis is that for sample sizes , maximum likelihood estimation for linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
