No-Regret Learning with Unbounded Losses: The Case of Logarithmic Pooling
Eric Neyman, Tim Roughgarden

TL;DR
This paper develops an online learning algorithm for aggregating expert probability forecasts using logarithmic pooling, achieving sublinear regret in a semi-adversarial setting with unbounded losses.
Contribution
It introduces a novel semi-adversarial framework for online expert aggregation with unbounded log loss and proposes an online mirror descent algorithm with provable regret bounds.
Findings
Achieves $O( oot{T} ext{log} T)$ regret in the proposed setting.
Extends online learning theory to unbounded loss functions.
Demonstrates effectiveness of logarithmic pooling in adversarial scenarios.
Abstract
For each of time steps, experts report probability distributions over outcomes; we wish to learn to aggregate these forecasts in a way that attains a no-regret guarantee. We focus on the fundamental and practical aggregation method known as logarithmic pooling -- a weighted average of log odds -- which is in a certain sense the optimal choice of pooling method if one is interested in minimizing log loss (as we take to be our loss function). We consider the problem of learning the best set of parameters (i.e. expert weights) in an online adversarial setting. We assume (by necessity) that the adversarial choices of outcomes and forecasts are consistent, in the sense that experts report calibrated forecasts. Imposing this constraint creates a (to our knowledge) novel semi-adversarial setting in which the adversary retains a large amount of flexibility. In this setting, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Anomaly Detection Techniques and Applications
