# Improving Lasso for model selection and prediction

**Authors:** Piotr Pokarowski, Wojciech Rejchel, Agnieszka Soltys, Michal Frej and, Jan Mielniczuk

arXiv: 1907.03025 · 2021-01-26

## TL;DR

This paper introduces a new, computationally efficient method to improve Lasso for model selection across various convex loss functions, demonstrating superior accuracy and theoretical consistency compared to existing approaches.

## Contribution

The paper proposes a novel model selection algorithm based on ordering Lasso coefficients and using the Generalized Information Criterion, applicable to diverse models and free from unknown parameter reliance.

## Key findings

- The method achieves consistent model selection under weak assumptions.
- Numerical experiments show improved accuracy over concave regularizations.
- The algorithm is implemented in the R package "DMRnet" and outperforms existing methods.

## Abstract

It is known that the Thresholded Lasso (TL), SCAD or MCP correct intrinsic estimation bias of the Lasso. In this paper we propose an alternative method of improving the Lasso for predictive models with general convex loss functions which encompass normal linear models, logistic regression, quantile regression or support vector machines. For a given penalty we order the absolute values of the Lasso non-zero coefficients and then select the final model from a small nested family by the Generalized Information Criterion. We derive exponential upper bounds on the selection error of the method. These results confirm that, at least for normal linear models, our algorithm seems to be the benchmark for the theory of model selection as it is constructive, computationally efficient and leads to consistent model selection under weak assumptions. Constructivity of the algorithm means that, in contrast to the TL, SCAD or MCP, consistent selection does not rely on the unknown parameters as the cone invertibility factor. Instead, our algorithm only needs the sample size, the number of predictors and an upper bound on the noise parameter. We show in numerical experiments on synthetic and real-world data sets that an implementation of our algorithm is more accurate than implementations of studied concave regularizations. Our procedure is contained in the R package "DMRnet" and available on the CRAN repository.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.03025/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1907.03025/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1907.03025/full.md

---
Source: https://tomesphere.com/paper/1907.03025