DIAMOND: Taming Sample and Communication Complexities in Decentralized   Bilevel Optimization

Peiwen Qiu; Yining Li; Zhuqing Liu; Prashant Khanduri; Jia Liu; Ness; B. Shroff; Elizabeth Serena Bentley; Kurt Turck

arXiv:2212.02376·cs.LG·January 23, 2023

DIAMOND: Taming Sample and Communication Complexities in Decentralized Bilevel Optimization

Peiwen Qiu, Yining Li, Zhuqing Liu, Prashant Khanduri, Jia Liu, Ness, B. Shroff, Elizabeth Serena Bentley, Kurt Turck

PDF

Open Access

TL;DR

This paper introduces DIAMOND, a decentralized bilevel optimization algorithm that reduces sample and communication complexities using a single-loop structure, momentum, and gradient tracking, suitable for edge networks.

Contribution

The paper presents DIAMOND, a novel decentralized bilevel optimization method with a single-loop design that avoids full gradient evaluations, lowering complexities and outperforming existing approaches.

Findings

01

Achieves $ ilde{O}( ext{epsilon}^{-3/2})$ sample and communication complexity.

02

Does not require full gradient evaluations, reducing computational cost.

03

Experimental results confirm theoretical complexity improvements.

Abstract

Decentralized bilevel optimization has received increasing attention recently due to its foundational role in many emerging multi-agent learning paradigms (e.g., multi-agent meta-learning and multi-agent reinforcement learning) over peer-to-peer edge networks. However, to work with the limited computation and communication capabilities of edge networks, a major challenge in developing decentralized bilevel optimization techniques is to lower sample and communication complexities. This motivates us to develop a new decentralized bilevel optimization called DIAMOND (decentralized single-timescale stochastic approximation with momentum and gradient-tracking). The contributions of this paper are as follows: i) our DIAMOND algorithm adopts a single-loop structure rather than following the natural double-loop structure of bilevel optimization, which offers low computation and implementation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Advanced Bandit Algorithms Research