Safe Screening for the Generalized Conditional Gradient Method

Yifan Sun; Francis Bach

arXiv:2002.09718·cs.LG·February 25, 2020·5 cites

Safe Screening for the Generalized Conditional Gradient Method

Yifan Sun, Francis Bach

PDF

Open Access

TL;DR

This paper introduces a generalized conditional gradient method with safe screening rules that efficiently promotes sparsity in structured regularizers, achieving convergence and support recovery guarantees.

Contribution

It extends the CGM framework to a gauge penalty setting, providing convergence analysis and a safe screening rule for support identification.

Findings

01

Supports sparse feature selection with stability over hyperparameters

02

Achieves $O(1/t)$ convergence without bounded iterates

03

Supports support recovery at rate $O(1/(t ext{delta}^2))$

Abstract

The conditional gradient method (CGM) has been widely used for fast sparse approximation, having a low per iteration computational cost for structured sparse regularizers. We explore the sparsity acquiring properties of a generalized CGM (gCGM), where the constraint is replaced by a penalty function based on a gauge penalty; this can be done without significantly increasing the per-iteration computation, and applies to general notions of sparsity. Without assuming bounded iterates, we show $O (1/ t)$ convergence of the function values and gap of gCGM. We couple this with a safe screening rule, and show that at a rate $O (1/ (t δ^{2}))$ , the screened support matches the support at the solution, where $δ \geq 0$ measures how close the problem is to being degenerate. In our experiments, we show that the gCGM for these modified penalties have similar feature selection properties as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research

MethodsFeature Selection