Multitask Learning with No Regret: from Improved Confidence Bounds to   Active Learning

Pier Giuseppe Sessa; Pierre Laforgue; Nicol\`o Cesa-Bianchi; Andreas; Krause

arXiv:2308.01744·cs.LG·August 4, 2023

Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning

Pier Giuseppe Sessa, Pierre Laforgue, Nicol\`o Cesa-Bianchi, Andreas, Krause

PDF

Open Access 1 Video

TL;DR

This paper develops new confidence intervals for multitask learning in the agnostic setting, enabling improved online regret bounds and active learning strategies that adapt to task similarity without prior knowledge.

Contribution

It introduces novel multitask confidence intervals, an adaptive online learning algorithm, and a multitask active learning setup with theoretical guarantees and empirical validation.

Findings

01

Improved regret bounds depending on task similarity

02

Adaptive algorithm that does not require prior knowledge of task similarity

03

Validated effectiveness on synthetic and drug discovery data

Abstract

Multitask learning is a powerful framework that enables one to simultaneously learn multiple related tasks by sharing information between them. Quantifying uncertainty in the estimated tasks is of pivotal importance for many downstream applications, such as online or active learning. In this work, we provide novel multitask confidence intervals in the challenging agnostic setting, i.e., when neither the similarity between tasks nor the tasks' features are available to the learner. The obtained intervals do not require i.i.d. data and can be directly applied to bound the regret in online learning. Through a refined analysis of the multitask information gain, we obtain new regret guarantees that, depending on a task similarity parameter, can significantly improve over treating tasks independently. We further propose a novel online learning algorithm that achieves such improved regret…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Data Stream Mining Techniques