Optimization with First-Order Surrogate Functions

Julien Mairal (INRIA Grenoble Rh\^one-Alpes / LJK Laboratoire Jean; Kuntzmann)

arXiv:1305.3120·stat.ML·May 15, 2013·117 cites

Optimization with First-Order Surrogate Functions

Julien Mairal (INRIA Grenoble Rh\^one-Alpes / LJK Laboratoire Jean, Kuntzmann)

PDF

Open Access

TL;DR

This paper presents a unified framework for first-order optimization methods using surrogate functions, introduces a new incremental scheme, and demonstrates its effectiveness on large-scale machine learning problems.

Contribution

It unifies various first-order optimization algorithms under a common surrogate-based perspective and proposes a novel incremental scheme that improves performance on large-scale tasks.

Findings

01

Unified view of first-order methods like proximal gradient and Frank-Wolfe.

02

New incremental scheme matches or outperforms existing solvers.

03

Effective on large-scale machine learning problems.

Abstract

In this paper, we study optimization methods consisting of iteratively minimizing surrogates of an objective function. By proposing several algorithmic variants and simple convergence analyses, we make two main contributions. First, we provide a unified viewpoint for several first-order optimization techniques such as accelerated proximal gradient, block coordinate descent, or Frank-Wolfe algorithms. Second, we introduce a new incremental scheme that experimentally matches or outperforms state-of-the-art solvers for large-scale optimization problems typically arising in machine learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research