On a Conjecture Regarding the Adam Optimizer

Mohamed Akrout; Douglas Tweed

arXiv:2111.08162·cs.LG·September 12, 2022·1 cites

On a Conjecture Regarding the Adam Optimizer

Mohamed Akrout, Douglas Tweed

PDF

Open Access

TL;DR

This paper investigates the theoretical foundations of the Adam optimizer, disproves a key conjecture, and proposes a modified version that can underpin future analyses of Adam's effectiveness.

Contribution

It refutes Bock's conjecture about Adam and introduces a generalized version that can replace it in theoretical explanations.

Findings

01

Bock's conjecture is false.

02

A modified, generalized conjecture is proven.

03

The new conjecture supports Adam's theoretical analysis.

Abstract

Why does the Adam optimizer work so well in deep-learning applications? Adam's originators, Kingma and Ba, presented a mathematical argument that was meant to help explain its success, but Bock and colleagues have since reported that a key piece is missing from that argument $-$ an unproven lemma which we will call Bock's conjecture. Here we show that this conjecture is false, but we prove a modified version of it $-$ a generalization of a result of Reddi and colleagues $-$ which can take its place in analyses of Adam.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Constraint Satisfaction and Optimization · Neural Networks and Applications

MethodsAdam