Convex programming approach to robust estimation of a multivariate Gaussian model
Samuel Balmand, Arnak Dalalyan

TL;DR
This paper introduces a convex programming method for robustly estimating the mean and covariance of a multivariate Gaussian distribution in high-dimensional settings with outliers, achieving optimal rates under various norms.
Contribution
It develops a convex optimization-based estimator that is computationally efficient and provably optimal for robust Gaussian parameter estimation, even with high-dimensional data and outliers.
Findings
Estimator is rate optimal under entry-wise , Frobenius, and mixed / norms.
Method extends to high-dimensional cases where the inverse covariance is sparse.
Achieves robustness against a significant proportion of outliers.
Abstract
Multivariate Gaussian is often used as a first approximation to the distribution of high-dimensional data. Determining the parameters of this distribution under various constraints is a widely studied problem in statistics, and is often considered as a prototype for testing new algorithms or theoretical frameworks. In this paper, we develop a nonasymptotic approach to the problem of estimating the parameters of a multivariate Gaussian distribution when data are corrupted by outliers. We propose an estimator---efficiently computable by solving a convex program---that robustly estimates the population mean and the population covariance matrix even when the sample contains a significant proportion of outliers. Our estimator of the corruption matrix is provably rate optimal simultaneously for the entry-wise -norm, the Frobenius norm and the mixed norm. Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Advanced Statistical Process Monitoring
