Variable selection with Hamming loss

Cristina Butucea; Mohamed Ndaoud; Natalia A. Stepanova; Alexandre; B. Tsybakov

arXiv:1512.01832·math.ST·October 15, 2018

Variable selection with Hamming loss

Cristina Butucea, Mohamed Ndaoud, Natalia A. Stepanova, Alexandre, B. Tsybakov

PDF

TL;DR

This paper establishes non-asymptotic bounds and explicit minimax selectors for variable selection under Hamming loss in Gaussian models, extending results to dependent, non-Gaussian data, and crowdsourcing, with adaptive procedures for recovery.

Contribution

It provides the first explicit minimax risk bounds and selectors for variable selection under Hamming loss, including extensions to dependent and non-Gaussian data, with adaptive methods for recovery.

Findings

01

Derived non-asymptotic minimax risk bounds

02

Explicit minimax selectors for variable selection

03

Adaptive procedures for near-perfect recovery

Abstract

We derive non-asymptotic bounds for the minimax risk of variable selection under expected Hamming loss in the Gaussian mean model in $R^{d}$ for classes of $s$ -sparse vectors separated from 0 by a constant $a > 0$ . In some cases, we get exact expressions for the nonasymptotic minimax risk as a function of $d, s, a$ and find explicitly the minimax selectors. These results are extended to dependent or non-Gaussian observations and to the problem of crowdsourcing. Analogous conclusions are obtained for the probability of wrong recovery of the sparsity pattern. As corollaries, we derive necessary and sufficient conditions for such asymptotic properties as almost full recovery and exact recovery. Moreover, we propose data-driven selectors that provide almost full and exact recovery adaptively to the parameters of the classes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.