Efficient and Robust Estimation of Linear Regression with Normal Errors
Alain Desgagn\'e

TL;DR
This paper introduces a novel estimation method for linear regression with normal errors that maintains high efficiency both with and without outliers by using a mixture model and an EM algorithm.
Contribution
It proposes an original robust estimation approach combining a mixture of normal and FLP distributions, improving efficiency and outlier detection in linear regression.
Findings
N-FLP estimators outperform existing methods in simulations.
The method provides reliable confidence intervals and hypothesis tests.
Effective outlier detection with estimated outlier proportions.
Abstract
Linear regression with normally distributed errors - including particular cases such as ANOVA, Student's t-test or location-scale inference - is a widely used statistical procedure. In this case the ordinary least squares estimator possesses remarkable properties but is very sensitive to outliers. Several robust alternatives have been proposed, but there is still significant room for improvement. This paper thus proposes an original method of estimation that offers the best efficiency simultaneously in the absence and the presence of outliers, both for the estimation of the regression coefficients and the scale parameter. The approach first consists in broadening the normal assumption of the errors to a mixture of the normal and the filtered-log-Pareto (FLP), an original distribution designed to represent the outliers. The expectation-maximization (EM) algorithm is then adapted and we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Advanced Statistical Process Monitoring · Statistical Methods and Inference
