Stability and Generalization of the Decentralized Stochastic Gradient   Descent Ascent Algorithm

Miaoxi Zhu; Li Shen; Bo Du; Dacheng Tao

arXiv:2310.20369·cs.LG·November 1, 2023·1 cites

Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm

Miaoxi Zhu, Li Shen, Bo Du, Dacheng Tao

PDF

Open Access 1 Video

TL;DR

This paper studies the stability and generalization properties of the decentralized stochastic gradient descent ascent algorithm, showing it maintains good generalization similar to centralized methods under various network topologies.

Contribution

It provides the first theoretical analysis of the primal-dual generalization bounds for decentralized minimax algorithms, extending stability theory to decentralized settings.

Findings

01

Decentralized structure does not impair stability or generalization of D-SGDA.

02

Generalization bounds depend on network topology beyond sample size and learning rate.

03

Numerical experiments confirm theoretical predictions.

Abstract

The growing size of available data has attracted increasing interest in solving minimax problems in a decentralized manner for various machine learning tasks. Previous theoretical research has primarily focused on the convergence rate and communication complexity of decentralized minimax algorithms, with little attention given to their generalization. In this paper, we investigate the primal-dual generalization bound of the decentralized stochastic gradient descent ascent (D-SGDA) algorithm using the approach of algorithmic stability under both convex-concave and nonconvex-nonconcave settings. Our theory refines the algorithmic stability in a decentralized manner and demonstrates that the decentralized structure does not destroy the stability and generalization of D-SGDA, implying that it can generalize as well as the vanilla SGDA in certain situations. Our results analyze the impact of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Distributed Control Multi-Agent Systems