Robust Estimation of Covariance Matrices: Adversarial Contamination and Beyond
Stanislav Minsker, Lang Wang

TL;DR
This paper develops a robust covariance matrix estimator effective in high-dimensional, contaminated, or heavy-tailed data scenarios, with theoretical guarantees and efficient algorithms.
Contribution
It introduces an adaptive estimator for low-rank covariance matrices robust to adversarial contamination and heavy tails, with tight deviation bounds and practical algorithms.
Findings
Estimator achieves tight deviation guarantees under weak assumptions.
Method adapts to low-rank structure and contamination proportion.
Algorithms enable efficient approximation of the estimator.
Abstract
We consider the problem of estimating the covariance structure of a random vector from a sample . We are interested in the situation when is large compared to but the covariance matrix of interest has (exactly or approximately) low rank. We assume that the given sample is (a) -adversarially corrupted, meaning that fraction of the observations could have been replaced by arbitrary vectors, or that (b) the sample is i.i.d. but the underlying distribution is heavy-tailed, meaning that the norm of possesses only finite fourth moments. We propose an estimator that is adaptive to the potential low-rank structure of the covariance matrix as well as to the proportion of contaminated data, and admits tight deviation guarantees despite rather weak assumptions on the underlying distribution. Finally, we discuss the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Advanced Statistical Methods and Models
