CompAdaGrad: A Compressed, Complementary, Computationally-Efficient Adaptive Gradient Method
Nishant A. Mehta, Alistair Rendell, Anish Varghese and, Christfried Webers

TL;DR
CompAdaGrad is a new adaptive gradient method that balances the theoretical benefits of full-matrix regularization with computational efficiency, performing well in high-dimensional online learning tasks.
Contribution
It introduces CompAdaGrad, which combines low-dimensional full-matrix regularization with diagonal regularization in the complementary space, improving performance over diagonal AdaGrad while maintaining linear complexity.
Findings
Outperforms diagonal AdaGrad in experiments
Maintains linear per-iteration complexity
Provides theoretical guarantees independent of regularizer choice
Abstract
The adaptive gradient online learning method known as AdaGrad has seen widespread use in the machine learning community in stochastic and adversarial online learning problems and more recently in deep learning methods. The method's full-matrix incarnation offers much better theoretical guarantees and potentially better empirical performance than its diagonal version; however, this version is computationally prohibitive and so the simpler diagonal version often is used in practice. We introduce a new method, CompAdaGrad, that navigates the space between these two schemes and show that this method can yield results much better than diagonal AdaGrad while avoiding the (effectively intractable) computational complexity of full-matrix AdaGrad for dimension . CompAdaGrad essentially performs full-matrix regularization in a low-dimensional subspace while performing diagonal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
MethodsAdaGrad
