High-dimensional inference robust to outliers with l1-norm penalization

Jad Beyhum

arXiv:2012.14118·math.ST·February 8, 2021

High-dimensional inference robust to outliers with l1-norm penalization

Jad Beyhum

PDF

Open Access

TL;DR

This paper introduces a robust high-dimensional inference method that effectively handles outliers by combining l1-norm penalization and a two-step estimation process, achieving efficiency and computational simplicity.

Contribution

It proposes a novel two-step inference procedure using square-root lasso and OLS that is robust to outliers and attains semiparametric efficiency in high-dimensional settings.

Findings

01

Asymptotic normality of the two-step estimator established.

02

Method attains semiparametric efficiency bound in outlier-free models.

03

Computationally efficient with convex optimization solutions.

Abstract

This paper studies inference in the high-dimensional linear regression model with outliers. Sparsity constraints are imposed on the vector of coefficients of the covariates. The number of outliers can grow with the sample size while their proportion goes to 0. We propose a two-step procedure for inference on the coefficients of a fixed subset of regressors. The first step is a based on several square-root lasso l1-norm penalized estimators, while the second step is the ordinary least squares estimator applied to a well chosen regression. We establish asymptotic normality of the two-step estimator. The proposed procedure is efficient in the sense that it attains the semiparametric efficiency bound when applied to the model without outliers under homoscedasticity. This approach is also computationally advantageous, it amounts to solving a finite number of convex optimization programs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Advanced Statistical Process Monitoring