Improved Complexities for Stochastic Conditional Gradient Methods under   Interpolation-like Conditions

Tesi Xiao; Krishnakumar Balasubramanian; Saeed Ghadimi

arXiv:2006.08167·math.OC·January 28, 2022

Improved Complexities for Stochastic Conditional Gradient Methods under Interpolation-like Conditions

Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi

PDF

Open Access

TL;DR

This paper improves the theoretical understanding of stochastic conditional gradient methods in over-parametrized machine learning, demonstrating better oracle complexities under interpolation-like conditions for convex objectives.

Contribution

It introduces improved complexity bounds for stochastic conditional gradient methods leveraging interpolation-like conditions, including a gradient sliding technique for further efficiency.

Findings

01

Convex case requires O(ε^{-2}) stochastic gradient calls

02

Gradient sliding reduces calls to O(ε^{-1.5})

03

Leverages interpolation-like conditions for complexity improvements

Abstract

We analyze stochastic conditional gradient methods for constrained optimization problems arising in over-parametrized machine learning. We show that one could leverage the interpolation-like conditions satisfied by such models to obtain improved oracle complexities. Specifically, when the objective function is convex, we show that the conditional gradient method requires $O (ϵ^{- 2})$ calls to the stochastic gradient oracle to find an $ϵ$ -optimal solution. Furthermore, by including a gradient sliding step, we show that the number of calls reduces to $O (ϵ^{- 1.5})$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research