High-Probability Risk Bounds via Sequential Predictors

Dirk van der Hoeven; Nikita Zhivotovskiy; Nicol\`o Cesa-Bianchi

arXiv:2308.07588·cs.LG·August 16, 2023

High-Probability Risk Bounds via Sequential Predictors

Dirk van der Hoeven, Nikita Zhivotovskiy, Nicol\`o Cesa-Bianchi

PDF

Open Access

TL;DR

This paper introduces a method to convert online learning algorithms into statistical estimators that achieve nearly optimal high-probability risk bounds, improving upon traditional guarantees and offering computational benefits.

Contribution

It presents a general second-order correction technique for online algorithms to obtain tight high-probability risk bounds in statistical learning.

Findings

01

Achieves nearly optimal high-probability risk bounds for classical estimation problems.

02

Enables improvements in dependency on problem parameters.

03

Offers computational advantages over batch methods.

Abstract

Online learning methods yield sequential regret bounds under minimal assumptions and provide in-expectation risk bounds for statistical learning. However, despite the apparent advantage of online guarantees over their statistical counterparts, recent findings indicate that in many important cases, regret bounds may not guarantee tight high-probability risk bounds in the statistical setting. In this work we show that online to batch conversions applied to general online learning algorithms can bypass this limitation. Via a general second-order correction to the loss function defining the regret, we obtain nearly optimal high-probability risk bounds for several classical statistical estimation problems, such as discrete distribution estimation, linear regression, logistic regression, and conditional density estimation. Our analysis relies on the fact that many online learning algorithms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Machine Learning and Algorithms