Robust and efficient estimation of high dimensional scatter and location
Ricardo A. Maronna, Victor J. Yohai

TL;DR
This paper introduces and evaluates robust high-dimensional estimators for scatter and location that balance efficiency and resistance to outliers, outperforming traditional methods in large p settings.
Contribution
It compares three families of estimators—non-monotonic S-estimators, MM-estimators, and tau-estimators—using novel starting points, demonstrating their effectiveness in high-dimensional robust estimation.
Findings
Rocke and MM estimators achieve high efficiency and robustness with proper tuning.
Starting estimators significantly influence the performance of robust estimators.
Simulation results confirm the advantages of proposed estimators in high-dimensional settings.
Abstract
We deal with the equivariant estimation of scatter and location for p-dimensional data, giving emphasis to scatter. It it important that the estimators possess both a high efficiency for normal data and a high resistance to outliers, that is, a low bias under contamination. The most frequently employed estimators are not quite satisfactory in this respect. The Minimum Volume Ellipsoid (MVE) and Minimum Covariance Determinant (MCD) estimators are known to have a very low efficiency. S-Estimators (Davies 1987) with a monotonic weight function like the bisquare behave satisfactorily for "small" p, say p not larger than 10. Rocke (1996) showed that their efficiency tends to one with increasing p. Unfortunately, this advantage is paid with a serious loss of robustness for large p. We consider three families of estimators with controllable efficiencies: non-monotonic S-estimators (Rocke…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Advanced Statistical Process Monitoring
