Safe Screening for the Generalized Conditional Gradient Method
Yifan Sun, Francis Bach

TL;DR
This paper introduces a generalized conditional gradient method with safe screening rules that efficiently promotes sparsity in structured regularizers, achieving convergence and support recovery guarantees.
Contribution
It extends the CGM framework to a gauge penalty setting, providing convergence analysis and a safe screening rule for support identification.
Findings
Supports sparse feature selection with stability over hyperparameters
Achieves $O(1/t)$ convergence without bounded iterates
Supports support recovery at rate $O(1/(t ext{delta}^2))$
Abstract
The conditional gradient method (CGM) has been widely used for fast sparse approximation, having a low per iteration computational cost for structured sparse regularizers. We explore the sparsity acquiring properties of a generalized CGM (gCGM), where the constraint is replaced by a penalty function based on a gauge penalty; this can be done without significantly increasing the per-iteration computation, and applies to general notions of sparsity. Without assuming bounded iterates, we show convergence of the function values and gap of gCGM. We couple this with a safe screening rule, and show that at a rate , the screened support matches the support at the solution, where measures how close the problem is to being degenerate. In our experiments, we show that the gCGM for these modified penalties have similar feature selection properties as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
MethodsFeature Selection
