Multitask Online Mirror Descent

Nicol\`o Cesa-Bianchi; Pierre Laforgue; Andrea Paudice; Massimiliano; Pontil

arXiv:2106.02393·cs.LG·November 2, 2022

Multitask Online Mirror Descent

Nicol\`o Cesa-Bianchi, Pierre Laforgue, Andrea Paudice, Massimiliano, Pontil

PDF

Open Access

TL;DR

This paper introduces MT-OMD, a multitask online mirror descent algorithm that shares information across tasks, improving regret bounds when tasks are similar, with practical algorithms and experimental validation.

Contribution

The paper develops a multitask extension of Online Mirror Descent with theoretical regret bounds, closed-form updates, and empirical evidence of improved performance for similar tasks.

Findings

01

Regret bound of order √(1 + σ²(N-1))√T

02

Improved bounds when tasks are similar (σ² ≤ 1)

03

Algorithms with closed-form updates for practical use

Abstract

We introduce and analyze MT-OMD, a multitask generalization of Online Mirror Descent (OMD) which operates by sharing updates between tasks. We prove that the regret of MT-OMD is of order $1 + σ^{2} (N - 1) T$ , where $σ^{2}$ is the task variance according to the geometry induced by the regularizer, $N$ is the number of tasks, and $T$ is the time horizon. Whenever tasks are similar, that is $σ^{2} \leq 1$ , our method improves upon the $N T$ bound obtained by running independent OMDs on each task. We further provide a matching lower bound, and show that our multitask extensions of Online Gradient Descent and Exponentiated Gradient, two major instances of OMD, enjoy closed-form updates, making them easy to use in practice. Finally, we present experiments which support our theoretical findings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques