Distributed Online Learning for Joint Regret with Communication Constraints
Dirk van der Hoeven, H\'edi Hadiji, Tim van Erven

TL;DR
This paper introduces a distributed online learning method that adapts to optimal graph partitions and efficiently manages communication constraints, achieving low joint regret in adversarial settings.
Contribution
It proposes a novel adaptive graph partitioning approach and a comparator-adaptive online convex optimization algorithm with delayed gradients.
Findings
The method adapts to the best graph partition among candidates.
It provides near-optimal gradient compression schemes based on communication bits.
The approach achieves low joint regret in adversarial online learning scenarios.
Abstract
We consider distributed online learning for joint regret with communication constraints. In this setting, there are multiple agents that are connected in a graph. Each round, an adversary first activates one of the agents to issue a prediction and provides a corresponding gradient, and then the agents are allowed to send a -bit message to their neighbors in the graph. All agents cooperate to control the joint regret, which is the sum of the losses of the activated agents minus the losses evaluated at the best fixed common comparator parameters . We observe that it is suboptimal for agents to wait for gradients that take too long to arrive. Instead, the graph should be partitioned into local clusters that communicate among themselves. Our main result is a new method that can adapt to the optimal graph partition for the adversarial activations and gradients, where the graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Distributed Control Multi-Agent Systems · Long-Term Effects of COVID-19
