Making the most of your day: online learning for optimal allocation of   time

Etienne Boursier; Tristan Garrec; Vianney Perchet; Marco; Scarsini

arXiv:2102.08087·stat.ML·November 5, 2021

Making the most of your day: online learning for optimal allocation of time

Etienne Boursier, Tristan Garrec, Vianney Perchet, Marco, Scarsini

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates online learning strategies for optimal time allocation in sequential task acceptance scenarios, analyzing regret bounds under different knowledge assumptions about rewards and task durations.

Contribution

It introduces a novel framework for online time allocation learning, addressing unknown reward functions and task duration distributions, with theoretical regret analysis.

Findings

01

Derived regret bounds for known reward functions

02

Analyzed regret when reward functions are unknown

03

Identified key differences from standard bandit problems

Abstract

We study online learning for optimal allocation when the resource to be allocated is time. %Examples of possible applications include job scheduling for a computing server, a driver filling a day with rides, a landlord renting an estate, etc. An agent receives task proposals sequentially according to a Poisson process and can either accept or reject a proposed task. If she accepts the proposal, she is busy for the duration of the task and obtains a reward that depends on the task duration. If she rejects it, she remains on hold until a new task proposal arrives. We study the regret incurred by the agent, first when she knows her reward function but does not know the distribution of the task duration, and then when she does not know her reward function, either. This natural setting bears similarities with contextual (one-armed) bandits, but with the crucial difference that the normalized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eboursier/making_most_of_your_time
noneOfficial

Videos

Making the most of your day: online learning for optimal allocation of time· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Smart Grid Energy Management