Selection and Estimation Optimality in High Dimensions with the TWIN   Penalty

Xiaowu Dai; Jared D. Huling

arXiv:1806.01936·stat.ME·June 7, 2018

Selection and Estimation Optimality in High Dimensions with the TWIN Penalty

Xiaowu Dai, Jared D. Huling

PDF

Open Access

TL;DR

The paper introduces the TWIN penalty class for variable selection in high-dimensional linear models, achieving high accuracy, minimax optimality, and practical efficiency with improved performance over existing methods.

Contribution

It proposes a new TWIN penalty class with data-adaptive properties, providing theoretical guarantees and efficient algorithms for large-scale variable selection.

Findings

01

TWIN penalties achieve high probability of correct model selection.

02

TWIN penalties result in minimax optimal estimators.

03

TWIN outperforms standard penalties in high correlation scenarios.

Abstract

We introduce a novel class of variable selection penalties called TWIN, which provides sensible data-adaptive penalization. Under a linear sparsity regime and random Gaussian designs we show that penalties in the TWIN class have a high probability of selecting the correct model and furthermore result in minimax optimal estimators. The general shape of penalty functions in the TWIN class is the key ingredient to its desirable properties and results in improved theoretical and empirical performance over existing penalties. In this work we introduce two examples of TWIN penalties that admit simple and efficient coordinate descent algorithms, making TWIN practical in large data settings. We demonstrate in challenging and realistic simulation settings with high correlations between active and inactive variables that TWIN has high power in variable selection while controlling the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms