Least trimmed squares regression with missing values and cellwise outliers
Jakob Raymaekers, Peter J. Rousseeuw

TL;DR
This paper introduces a novel robust regression method capable of handling both casewise and cellwise outliers, as well as missing data, with applicability to skewed distributions and out-of-sample predictions.
Contribution
It presents the first robust regression approach that addresses cellwise outliers, missing data, and skewed distributions simultaneously, also supporting out-of-sample prediction.
Findings
Method performs well in simulations
Effective on real dataset
Obeys first breakdown point for cellwise outliers
Abstract
Regression is the workhorse of statistics, and is often faced with real data that contain outliers. When these are casewise outliers, that is, cases that are entirely wrong or belong to a different population, the issue can be remedied by existing casewise robust regression methods. It is another matter when cellwise outliers occur, that is, suspicious individual entries in the data matrix containing the regressors and the response. We propose a new regression method that is robust to both casewise and cellwise outliers, and handles missing values as well. Its construction allows for skewed distributions. We show that it obeys the first breakdown result for cellwise robust regression. It is also the first such method that is geared to making robust out-of-sample predictions. Its performance is studied by simulation, and it is illustrated on a substantial real dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
