Grouping Strategies and Thresholding for High Dimensional Linear Models

Mathilde Mougeot; Dominique Picard; Karine Tribouley

arXiv:1207.2067·math.ST·July 10, 2012

Grouping Strategies and Thresholding for High Dimensional Linear Models

Mathilde Mougeot, Dominique Picard, Karine Tribouley

PDF

TL;DR

This paper introduces the GR-LOL algorithm for high-dimensional linear models with structured sparsity, utilizing grouping and thresholding to improve estimation accuracy, with theoretical convergence guarantees and practical advantages over existing methods.

Contribution

The paper proposes a novel two-step block thresholding algorithm, GR-LOL, with data-driven grouping strategies and proven convergence rates, enhancing high-dimensional regression estimation.

Findings

01

GR-LOL outperforms standard LOL in practical tests

02

Grouping can significantly improve estimation accuracy

03

GR-LOL compares favorably with group-Lasso methods

Abstract

The estimation problem in a high regression model with structured sparsity is investigated. An algorithm using a two steps block thresholding procedure called GR-LOL is provided. Convergence rates are produced: they depend on simple coherence-type indices of the Gram matrix -easily checkable on the data- as well as sparsity assumptions of the model parameters measured by a combination of $l_{1}$ within-blocks with $l_{q}, q < 1$ between-blocks norms. The simplicity of the coherence indicator suggests ways to optimize the rates of convergence when the group structure is not naturally given by the problem and is unknown. In such a case, an auto-driven procedure is provided to determine the regressors groups (number and contents). An intensive practical study compares our grouping methods with the standard LOL algorithm. We prove that the grouping rarely deteriorates the results but can improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.