A note on continuous-time online learning

Lexing Ying

arXiv:2405.10399·stat.ML·May 20, 2024

A note on continuous-time online learning

Lexing Ying

PDF

Open Access

TL;DR

This paper extends discrete-time online learning algorithms to continuous-time models across various problems, providing concise proofs of optimal regret bounds in the process.

Contribution

It introduces continuous-time versions of algorithms for online linear optimization and adversarial bandits, with simplified proofs of their optimal regret bounds.

Findings

01

Extended algorithms to continuous-time setting

02

Provided concise proofs of optimal regret bounds

03

Unified treatment across multiple online learning problems

Abstract

In online learning, the data is provided in a sequential order, and the goal of the learner is to make online decisions to minimize overall regrets. This note is concerned with continuous-time models and algorithms for several online learning problems: online linear optimization, adversarial bandit, and adversarial linear bandit. For each problem, we extend the discrete-time algorithm to the continuous-time setting and provide a concise proof of the optimal regret bound.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline and Blended Learning · Intelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics