Online Multi-task Learning with Hard Constraints

Gabor Lugosi; Omiros Papaspiliopoulos; Gilles Stoltz (DMA; GREGH)

arXiv:0902.3526·stat.ML·March 27, 2009·21 cites

Online Multi-task Learning with Hard Constraints

Gabor Lugosi, Omiros Papaspiliopoulos, Gilles Stoltz (DMA, GREGH)

PDF

Open Access

TL;DR

This paper introduces an efficient online multi-task learning framework with constraints, enabling simultaneous decision-making across related tasks while satisfying specific restrictions, and extends the model to various complex scenarios.

Contribution

It proposes a tractable approach for constrained multi-task online learning, reducing the problem to an online shortest path computation, and extends the model to complex settings.

Findings

01

Efficient algorithms for constrained multi-task online learning.

02

Extension to tracking, bandit, and infinite task scenarios.

03

Reduction of constrained decision-making to shortest path problems.

Abstract

We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss "tracking" and "bandit" versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics