Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning
Pier Giuseppe Sessa, Pierre Laforgue, Nicol\`o Cesa-Bianchi, Andreas, Krause

TL;DR
This paper develops new confidence intervals for multitask learning in the agnostic setting, enabling improved online regret bounds and active learning strategies that adapt to task similarity without prior knowledge.
Contribution
It introduces novel multitask confidence intervals, an adaptive online learning algorithm, and a multitask active learning setup with theoretical guarantees and empirical validation.
Findings
Improved regret bounds depending on task similarity
Adaptive algorithm that does not require prior knowledge of task similarity
Validated effectiveness on synthetic and drug discovery data
Abstract
Multitask learning is a powerful framework that enables one to simultaneously learn multiple related tasks by sharing information between them. Quantifying uncertainty in the estimated tasks is of pivotal importance for many downstream applications, such as online or active learning. In this work, we provide novel multitask confidence intervals in the challenging agnostic setting, i.e., when neither the similarity between tasks nor the tasks' features are available to the learner. The obtained intervals do not require i.i.d. data and can be directly applied to bound the regret in online learning. Through a refined analysis of the multitask information gain, we obtain new regret guarantees that, depending on a task similarity parameter, can significantly improve over treating tasks independently. We further propose a novel online learning algorithm that achieves such improved regret…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Data Stream Mining Techniques
