A fast and slightly robust covariance estimator
John Duchi, Saminul Haque, Rohith Kuditipudi

TL;DR
This paper introduces a fast, robust covariance estimator for subgaussian and heavy-tailed data that operates efficiently under low contamination levels, improving sample complexity and runtime over previous methods.
Contribution
The authors develop a near-linear sample complexity and efficient algorithm for covariance estimation under low contamination, applicable to both subgaussian and heavy-tailed distributions.
Findings
Achieves near-linear sample complexity $\widetilde{\Omega}(d)$ for subgaussian data.
Provides an algorithm with runtime $O((n+d)^{\omega + 1/2})$, faster than previous exponential-time methods.
Works effectively for heavy-tailed data within a smaller contamination regime.
Abstract
Let from a distribution with mean zero and covariance . Given a dataset such that , we are interested in finding an efficient estimator that achieves . We focus on the low contamination regime ). In this regime, prior work required either samples or runtime that is exponential in . We present an algorithm that, for subgaussian data, has near-linear sample complexity and runtime , where is the matrix multiplication exponent. We also show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
