Accelerating Distributed Online Meta-Learning via Multi-Agent Collaboration under Limited Communication
Sen Lin, Mehmet Dedeoglu, Junshan Zhang

TL;DR
This paper introduces a multi-agent online meta-learning framework that leverages limited communication to significantly accelerate learning speed, achieving near-optimal regret bounds and demonstrating practical benefits through experiments.
Contribution
It proposes a novel multi-agent online meta-learning algorithm with limited communication, providing theoretical guarantees of improved regret bounds over single-agent methods.
Findings
Achieves a regret of $O(rac{1}{ oot{N}T})$, showing faster convergence with more agents.
Develops a distributed gradient tracking algorithm with $O( oot{T/N})$ regret per agent.
Experimental results validate the theoretical speedup and effectiveness of the proposed method.
Abstract
Online meta-learning is emerging as an enabling technique for achieving edge intelligence in the IoT ecosystem. Nevertheless, to learn a good meta-model for within-task fast adaptation, a single agent alone has to learn over many tasks, and this is the so-called 'cold-start' problem. Observing that in a multi-agent network the learning tasks across different agents often share some model similarity, we ask the following fundamental question: "Is it possible to accelerate the online meta-learning across agents via limited communication and if yes how much benefit can be achieved? " To answer this question, we propose a multi-agent online meta-learning framework and cast it as an equivalent two-level nested online convex optimization (OCO) problem. By characterizing the upper bound of the agent-task-averaged regret, we show that the performance of multi-agent online meta-learning depends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques
