Optimization using Parallel Gradient Evaluations on Multiple Parameters
Yash Chandak, Shiv Shankar, Venkata Gandikota, Philip S. Thomas, Arya, Mazumdar

TL;DR
This paper introduces a parallel gradient evaluation method for convex optimization that leverages multiple parameters simultaneously, achieving faster convergence and robustness with minimal additional complexity.
Contribution
It presents a novel first-order optimization method that uses multiple gradients in parallel, enhancing efficiency without increasing computational or memory costs.
Findings
Using gradients from as few as two parameters yields significant acceleration.
The method is robust to hyper-parameter settings.
Empirical results support the effectiveness of parallel gradient evaluations.
Abstract
We propose a first-order method for convex optimization, where instead of being restricted to the gradient from a single parameter, gradients from multiple parameters can be used during each step of gradient descent. This setup is particularly useful when a few processors are available that can be used in parallel for optimization. Our method uses gradients from multiple parameters in synergy to update these parameters together towards the optima. While doing so, it is ensured that the computational and memory complexity is of the same order as that of gradient descent. Empirical results demonstrate that even using gradients from as low as \textit{two} parameters, our method can often obtain significant acceleration and provide robustness to hyper-parameter settings. We remark that the primary goal of this work is less theoretical, and is instead aimed at exploring the understudied case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Memory and Neural Computing
