Distributed Extra-gradient with Optimal Complexity and Communication   Guarantees

Ali Ramezani-Kebrya; Kimon Antonakopoulos; Igor Krawczuk and; Justin Deschenaux; Volkan Cevher

arXiv:2308.09187·cs.LG·August 21, 2023

Distributed Extra-gradient with Optimal Complexity and Communication Guarantees

Ali Ramezani-Kebrya, Kimon Antonakopoulos, Igor Krawczuk and, Justin Deschenaux, Volkan Cevher

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a communication-efficient, quantized version of the extra-gradient algorithm for distributed monotone variational inequality problems, achieving optimal convergence rates and validated through real-world multi-GPU experiments.

Contribution

The paper proposes Q-GenX, an unbiased adaptive compression method for distributed VI problems, with adaptive step-size and optimal convergence guarantees.

Findings

01

Achieves ${ m O}(1/T)$ convergence under relative noise.

02

Achieves ${ m O}(1/ oot 2 T)$ convergence under absolute noise.

03

Validated through training GANs on multiple GPUs.

Abstract

We consider monotone variational inequality (VI) problems in multi-GPU settings where multiple processors/workers/clients have access to local stochastic dual vectors. This setting includes a broad range of important problems from distributed convex minimization to min-max and games. Extra-gradient, which is a de facto algorithm for monotone VI problems, has not been designed to be communication-efficient. To this end, we propose a quantized generalized extra-gradient (Q-GenX), which is an unbiased and adaptive compression method tailored to solve VIs. We provide an adaptive step-size rule, which adapts to the respective noise profiles at hand and achieve a fast rate of $O (1/ T)$ under relative noise, and an order-optimal $O (1/ T)$ under absolute noise and show distributed training accelerates convergence. Finally, we validate our theoretical results by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lions-epfl/qgenx
pytorchOfficial

Videos

Distributed Extra-gradient with Optimal Complexity and Communication Guarantees· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM