
TL;DR
CRTB is a novel robust dimension reduction method that effectively handles scattered cellwise outliers in multivariate data, maintaining interpretability and high fidelity in outlier detection.
Contribution
It introduces the first cellwise robust two-block dimension reduction technique with efficient algorithms and demonstrates superior performance over existing methods.
Findings
Resists contamination in over 50% of rows.
Recovers cellwise outlier patterns with high fidelity.
Produces interpretable results in practical examples.
Abstract
Cellwise Robust Twoblock (CRTB) is introduced, the first cellwise robust method for simultaneous dimension reduction of multivariate predictor and response blocks, in both a dense and a sparse variable-selecting variant. Classical robust methods protect against casewise outliers by downweighting or removing entire observations, a strategy that becomes inefficient -- and eventually breaks down -- when contamination is scattered across individual cells rather than concentrated in whole rows. CRTB combines a column-wise pre-filter for cellwise outlier detection with model-based imputation of flagged cells inside an iteratively reweighted M-estimation loop, retaining the clean cells of partially contaminated rows instead of discarding the observation. An efficient algorithm is provided that uses the classical twoblock SVD as a warm start and converges in a handful of IRLS iterations at a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
