# Selection consistency of Lasso-based procedures for misspecified   high-dimensional binary model and random regressors

**Authors:** Mariusz Kubkowski, Jan Mielniczuk

arXiv: 1906.04175 · 2020-02-19

## TL;DR

This paper studies the consistency of Lasso-based methods for selecting predictors in high-dimensional binary regression models, including misspecified and semiparametric cases, demonstrating their effectiveness under certain conditions.

## Contribution

It introduces a two-step Lasso-based selection procedure with proven consistency in high-dimensional, potentially misspecified binary models, extending existing theoretical guarantees.

## Key findings

- The proposed method is consistent for large predictor sets.
- Support of true parameters can be accurately recovered.
- Method applies to semiparametric models with linear predictor conditions.

## Abstract

We consider selection of random predictors for high-dimensional regression problem with binary response for a general loss function. Important special case is when the binary model is semiparametric and the response function is misspecified under parametric model fit. Selection for such a scenario aims at recovering the support of the minimizer of the associated risk with large probability. We propose a two-step selection procedure which consists of screening and ordering predictors by Lasso method and then selecting a subset of predictors which minimizes Generalized Information Criterion on the corresponding nested family of models. We prove consistency of the selection method under conditions which allow for much larger number of predictors than number of observations. For the semiparametric case when distribution of random predictors satisfies linear regression conditions the true and the estimated parameters are collinear and their common support can be consistently identified.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.04175/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1906.04175/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/1906.04175/full.md

---
Source: https://tomesphere.com/paper/1906.04175