DIAMOND: Taming Sample and Communication Complexities in Decentralized Bilevel Optimization
Peiwen Qiu, Yining Li, Zhuqing Liu, Prashant Khanduri, Jia Liu, Ness, B. Shroff, Elizabeth Serena Bentley, Kurt Turck

TL;DR
This paper introduces DIAMOND, a decentralized bilevel optimization algorithm that reduces sample and communication complexities using a single-loop structure, momentum, and gradient tracking, suitable for edge networks.
Contribution
The paper presents DIAMOND, a novel decentralized bilevel optimization method with a single-loop design that avoids full gradient evaluations, lowering complexities and outperforming existing approaches.
Findings
Achieves $ ilde{O}( ext{epsilon}^{-3/2})$ sample and communication complexity.
Does not require full gradient evaluations, reducing computational cost.
Experimental results confirm theoretical complexity improvements.
Abstract
Decentralized bilevel optimization has received increasing attention recently due to its foundational role in many emerging multi-agent learning paradigms (e.g., multi-agent meta-learning and multi-agent reinforcement learning) over peer-to-peer edge networks. However, to work with the limited computation and communication capabilities of edge networks, a major challenge in developing decentralized bilevel optimization techniques is to lower sample and communication complexities. This motivates us to develop a new decentralized bilevel optimization called DIAMOND (decentralized single-timescale stochastic approximation with momentum and gradient-tracking). The contributions of this paper are as follows: i) our DIAMOND algorithm adopts a single-loop structure rather than following the natural double-loop structure of bilevel optimization, which offers low computation and implementation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Advanced Bandit Algorithms Research
