Accelerating Distributed Online Meta-Learning via Multi-Agent   Collaboration under Limited Communication

Sen Lin; Mehmet Dedeoglu; Junshan Zhang

arXiv:2012.08660·cs.LG·December 22, 2020

Accelerating Distributed Online Meta-Learning via Multi-Agent Collaboration under Limited Communication

Sen Lin, Mehmet Dedeoglu, Junshan Zhang

PDF

Open Access

TL;DR

This paper introduces a multi-agent online meta-learning framework that leverages limited communication to significantly accelerate learning speed, achieving near-optimal regret bounds and demonstrating practical benefits through experiments.

Contribution

It proposes a novel multi-agent online meta-learning algorithm with limited communication, providing theoretical guarantees of improved regret bounds over single-agent methods.

Findings

01

Achieves a regret of $O(rac{1}{ oot{N}T})$, showing faster convergence with more agents.

02

Develops a distributed gradient tracking algorithm with $O( oot{T/N})$ regret per agent.

03

Experimental results validate the theoretical speedup and effectiveness of the proposed method.

Abstract

Online meta-learning is emerging as an enabling technique for achieving edge intelligence in the IoT ecosystem. Nevertheless, to learn a good meta-model for within-task fast adaptation, a single agent alone has to learn over many tasks, and this is the so-called 'cold-start' problem. Observing that in a multi-agent network the learning tasks across different agents often share some model similarity, we ask the following fundamental question: "Is it possible to accelerate the online meta-learning across agents via limited communication and if yes how much benefit can be achieved? " To answer this question, we propose a multi-agent online meta-learning framework and cast it as an equivalent two-level nested online convex optimization (OCO) problem. By characterizing the upper bound of the agent-task-averaged regret, we show that the performance of multi-agent online meta-learning depends…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques