Robust Principal Components by Casewise and Cellwise Weighting
Fabio Centofanti, Mia Hubert, Peter J. Rousseeuw

TL;DR
This paper introduces cellPCA, a robust PCA method that effectively handles casewise and cellwise outliers, as well as missing data, using a combined loss function and IRLS algorithm, with demonstrated superior performance.
Contribution
The paper proposes a novel robust PCA approach, cellPCA, capable of simultaneously addressing casewise outliers, cellwise outliers, and missing data within a unified framework.
Findings
CellPCA effectively detects outliers using residual cellmaps.
It outperforms traditional PCA in contaminated data scenarios.
The method is validated through simulations and real data examples.
Abstract
Principal component analysis (PCA) is a fundamental tool for analyzing multivariate data. Here the focus is on dimension reduction to the principal subspace, characterized by its projection matrix. The classical principal subspace can be strongly affected by the presence of outliers. Traditional robust approaches consider casewise outliers, that is, cases generated by an unspecified outlier distribution that differs from that of the clean cases. But there may also be cellwise outliers, which are suspicious entries that can occur anywhere in the data matrix. Another common issue is that some cells may be missing. This paper proposes a new robust PCA method, called cellPCA, that can simultaneously deal with casewise outliers, cellwise outliers, and missing cells. Its single objective function combines two robust loss functions, that together mitigate the effect of casewise and cellwise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Multi-Criteria Decision Making
