On Acceleration with Noise-Corrupted Gradients

Michael B. Cohen; Jelena Diakonikolas; Lorenzo Orecchia

arXiv:1805.12591·math.OC·August 1, 2018·6 cites

On Acceleration with Noise-Corrupted Gradients

Michael B. Cohen, Jelena Diakonikolas, Lorenzo Orecchia

PDF

Open Access

TL;DR

This paper introduces a new accelerated optimization method called AGDP, analyzes its stability under noisy gradient conditions, and provides modifications to improve robustness and reduce noise-induced errors.

Contribution

The paper presents AGDP, a generalized accelerated method, and offers a theoretical analysis of its performance with noisy gradients, including practical modifications for noise reduction.

Findings

01

AGDP outperforms previous methods in noisy settings

02

Theoretical analysis clarifies noise-acceleration interaction

03

Modified AGDP reduces gradient noise errors

Abstract

Accelerated algorithms have broad applications in large-scale optimization, due to their generality and fast convergence. However, their stability in the practical setting of noise-corrupted gradient oracles is not well-understood. This paper provides two main technical contributions: (i) a new accelerated method AGDP that generalizes Nesterov's AGD and improves on the recent method AXGD (Diakonikolas & Orecchia, 2018), and (ii) a theoretical study of accelerated algorithms under noisy and inexact gradient oracles, which is supported by numerical experiments. This study leverages the simplicity of AGDP and its analysis to clarify the interaction between noise and acceleration and to suggest modifications to the algorithm that reduce the mean and variance of the error incurred due to the gradient noise.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM