TL;DR
This paper demonstrates that the maximum likelihood estimator for matrix and tensor normal models achieves nearly optimal sample complexity and error rates by leveraging geodesic convexity in the Fisher information geometry, without restrictive assumptions.
Contribution
It establishes nearly optimal sample complexity and error bounds for MLE in matrix and tensor normal models using geodesic convexity, independent of well-conditioned or sparse factors.
Findings
MLE achieves nearly optimal sample complexity.
Flip-flop algorithm converges linearly with high probability.
Strong geodesic convexity underpins the analysis.
Abstract
The matrix normal model, i.e., the family of Gaussian matrix-variate distributions whose covariance matrices are the Kronecker product of two lower dimensional factors, is frequently used to model matrix-variate data. The tensor normal model generalizes this family to Kronecker products of three or more factors. We study the estimation of the Kronecker factors of the covariance matrix in the matrix and tensor normal models. For the above models, we show that the maximum likelihood estimator (MLE) achieves nearly optimal nonasymptotic sample complexity and nearly tight error rates in the Fisher-Rao and Thompson metrics. In contrast to prior work, our results do not rely on the factors being well-conditioned or sparse, nor do we need to assume an accurate enough initial guess. For the matrix normal model, all our bounds are minimax optimal up to logarithmic factors, and for the tensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Near Optimal Sample Complexity For Matrix And Tensor Normal Models Via Geodesic Convexity· youtube
