Adam: A Method for Stochastic Optimization
Diederik P. Kingma, Jimmy Ba

TL;DR
Adam is a new stochastic optimization algorithm that adaptively estimates moments of gradients, offering efficiency, invariance, and robustness for large-scale, noisy, and sparse problems, with strong theoretical and empirical support.
Contribution
This paper introduces Adam, a novel adaptive gradient optimization method with theoretical convergence guarantees and practical advantages over existing algorithms.
Findings
Adam converges faster than traditional methods in experiments.
Adam performs well on large-scale and noisy datasets.
Theoretical analysis shows Adam has favorable convergence properties.
Abstract
We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or parameters. The method is also appropriate for non-stationary objectives and problems with very noisy and/or sparse gradients. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms, on which Adam was inspired, are discussed. We also analyze the theoretical convergence properties of the algorithm and provide a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Machine Learning and ELM
MethodsAdaMax · Adam
