A Lower Bound for the Optimization of Finite Sums

Alekh Agarwal; Leon Bottou

arXiv:1410.0723·stat.ML·October 6, 2015·ICML·46 cites

A Lower Bound for the Optimization of Finite Sums

Alekh Agarwal, Leon Bottou

PDF

Open Access

TL;DR

This paper establishes a fundamental lower bound on the number of iterations needed for algorithms to optimize finite sums of smooth, strongly convex functions, and compares it with existing methods to identify scenarios where new algorithms are beneficial.

Contribution

It introduces a new lower bound for finite sum optimization and analyzes how recent methods compare to this theoretical limit in various data settings.

Findings

01

Lower bound of (n + \u221a{n(-1)} \, lg(1/\u03b5)) iterations.

02

Comparison of lower bounds with upper bounds of recent algorithms.

03

Identification of machine learning scenarios where new methods are advantageous.

Abstract

This paper presents a lower bound for optimizing a finite sum of $n$ functions, where each function is $L$ -smooth and the sum is $μ$ -strongly convex. We show that no algorithm can reach an error $ϵ$ in minimizing all functions from this class in fewer than $Ω (n + n (κ - 1) lo g (1/ ϵ))$ iterations, where $κ = L / μ$ is a surrogate condition number. We then compare this lower bound to upper bounds for recently developed methods specializing to this setting. When the functions involved in this sum are not arbitrary, but based on i.i.d. random data, then we further contrast these complexity results with those for optimal first-order methods to directly optimize the sum. The conclusion we draw is that a lot of caution is necessary for an accurate comparison, and identify machine learning scenarios where the new methods help computationally.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms