Full Swap Regret and Discretized Calibration
Maxwell Fishelson, Robert Kleinberg, Princewill Okoroafor, Renato Paes, Leme, Jon Schneider, Yifeng Teng

TL;DR
This paper introduces new algorithms for minimizing swap regret in complex game settings and applies them to improve online forecasting calibration, achieving sublinear regret bounds.
Contribution
It develops efficient algorithms for full swap regret minimization in high-dimensional convex action sets and connects these to calibration in online forecasting.
Findings
Achieves swap regret bounds of O(T^{(d+1)/(d+3)}) in structured games.
Designs algorithms with full swap regret bounds from O(T^{d/(d+2)}) to O(T^{(d+1)/(d+2)}).
Provides online forecasting algorithms with (T^{1/3}) calibration error.
Abstract
We study the problem of minimizing swap regret in structured normal-form games. Players have a very large (potentially infinite) number of pure actions, but each action has an embedding into -dimensional space and payoffs are given by bilinear functions of these embeddings. We provide an efficient learning algorithm for this setting that incurs at most swap regret after rounds. To achieve this, we introduce a new online learning problem we call \emph{full swap regret minimization}. In this problem, a learner repeatedly takes a (randomized) action in a bounded convex -dimensional action set and then receives a loss from the adversary, with the goal of minimizing their regret with respect to the \emph{worst-case} swap function mapping to . For varied assumptions about the convexity and smoothness of the loss…
Peer Reviews
Decision·ALT 2025
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNuclear reactor physics and engineering
MethodsSparse Evolutionary Training
