Limitations on Variance-Reduction and Acceleration Schemes for Finite   Sum Optimization

Yossi Arjevani

arXiv:1706.01686·math.OC·December 8, 2017·1 cites

Limitations on Variance-Reduction and Acceleration Schemes for Finite Sum Optimization

Yossi Arjevani

PDF

Open Access

TL;DR

This paper investigates the limitations of variance-reduction and acceleration techniques in finite sum optimization, revealing that additional information and specific conditions are necessary for optimal complexity bounds.

Contribution

It establishes fundamental limitations on applying acceleration and variance reduction in finite sum problems without explicit knowledge of individual functions or strong convexity parameters.

Findings

01

Finite sum structure alone does not guarantee optimal complexity bounds.

02

Acceleration is not achievable without explicit strong convexity information.

03

Optimal complexity bounds depend on the uniformity of update rules across iterations.

Abstract

We study the conditions under which one is able to efficiently apply variance-reduction and acceleration schemes on finite sum optimization problems. First, we show that, perhaps surprisingly, the finite sum structure by itself, is not sufficient for obtaining a complexity bound of $\tilde{\cO} ((n + L / μ) ln (1/ ϵ))$ for $L$ -smooth and $μ$ -strongly convex individual functions - one must also know which individual function is being referred to by the oracle at each iteration. Next, we show that for a broad class of first-order and coordinate-descent finite sum algorithms (including, e.g., SDCA, SVRG, SAG), it is not possible to get an `accelerated' complexity bound of $\tilde{\cO} ((n + n L / μ) ln (1/ ϵ))$ , unless the strong convexity parameter is given explicitly. Lastly, we show that when this class of algorithms is used for minimizing $L$ -smooth and convex finite sums,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms