Fast Estimation of the Median Covariation Matrix with Application to Online Robust Principal Components Analysis
Herv\'e Cardot, Antoine Godichon-Baggioni

TL;DR
This paper introduces recursive algorithms for fast, online estimation of the median covariation matrix, enabling robust principal components analysis in high-dimensional, large-scale data with applications to outlier detection.
Contribution
It proposes new recursive estimators for the median covariation matrix with proven asymptotic properties, suitable for online robust PCA in high-dimensional settings.
Findings
Recursive algorithms are computationally efficient for large data.
Robust PCA performs well compared to traditional methods.
Algorithms are implemented in the R package Gmedian.
Abstract
The geometric median covariation matrix is a robust multivariate indicator of dispersion which can be extended without any difficulty to functional data. We define estimators, based on recursive algorithms, that can be simply updated at each new observation and are able to deal rapidly with large samples of high dimensional data without being obliged to store all the data in memory. Asymptotic convergence properties of the recursive algorithms are studied under weak conditions. The computation of the principal components can also be performed online and this approach can be useful for online outlier detection. A simulation study clearly shows that this robust indicator is a competitive alternative to minimum covariance determinant when the dimension of the data is small and robust principal components analysis based on projection pursuit and spherical projections for high dimension…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
