EXAdam: The Power of Adaptive Cross-Moments
Ahmed M. Adly

TL;DR
EXAdam is an enhanced optimization algorithm building on Adam, introducing new debiasing and gradient acceleration techniques to improve convergence, robustness, and effectiveness in complex landscapes, demonstrated by faster training and better accuracy on CIFAR-10.
Contribution
The paper proposes EXAdam, a novel optimizer with debiasing and acceleration mechanisms, offering theoretical insights and empirical improvements over Adam.
Findings
38.46% faster convergence on CIFAR-10
Improved training, validation, and testing accuracies
Potential robustness to hyperparameters
Abstract
This paper introduces EXAdam (tended ), a novel optimization algorithm that builds upon the widely-used Adam optimizer. EXAdam incorporates two key enhancements: (1) new debiasing terms for improved moment estimation and (2) a gradient-based acceleration mechanism for increased responsiveness to the current loss landscape. These innovations work synergistically to address limitations of the original Adam algorithm, potentially offering improved convergence properties, enhanced ability to escape saddle points, and potentially greater robustness to hyperparameter choices, though this requires further investigation. We provide a theoretical analysis of EXAdam's components and their interactions, highlighting the algorithm's potential advantages in navigating complex optimization landscapes. Empirical evaluations demonstrate EXAdam's superiority over Adam,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Metaheuristic Optimization Algorithms Research · Machine Learning and Data Classification
MethodsAdam
