Linear Convergence in Games with Delayed Feedback via Extra Prediction
Yuma Fujimoto, Kenshi Abe, and Kaito Ariu

TL;DR
This paper analyzes the impact of delayed feedback on convergence in bilinear games and demonstrates that extra optimism in prediction significantly accelerates convergence rates, offering a promising approach to mitigate delay effects.
Contribution
It introduces a novel analysis of WOGDA with extra optimism, showing improved convergence rates under feedback delays in bilinear games.
Findings
Extra optimism accelerates convergence rate to $ ext{exp}(- heta(t/(m^{2} ext{log} m)))$
Standard optimism achieves linear convergence at rate $ ext{exp}(- heta(t/m^{5}))$
Experiments confirm the theoretical acceleration due to extra optimism.
Abstract
Feedback delays are inevitable in real-world multi-agent learning. They are known to severely degrade performance, and the convergence rate under delayed feedback is still unclear, even for bilinear games. This paper derives the rate of linear convergence of Weighted Optimistic Gradient Descent-Ascent (WOGDA), which predicts future rewards with extra optimism, in unconstrained bilinear games. To analyze the algorithm, we interpret it as an approximation of the Extra Proximal Point (EPP), which is updated based on farther future rewards than the classical Proximal Point (PP). Our theorems show that standard optimism (predicting the next-step reward) achieves linear convergence to the equilibrium at a rate after iterations for delay . Moreover, employing extra optimism (predicting farther future reward) tolerates a larger step size and significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Game Theory and Applications
