Polyhedral Instability Governs Regret in Online Learning
Yuetai Li, Fengqing Jiang, Yichen Feng, Kaiyuan Zheng, Luyao Niu, Bhaskar Ramasubramanian, Basel Alomair, Linda Bushnell, Radha Poovendran

TL;DR
This paper demonstrates that regret in online convex optimization with polyhedral structure is primarily determined by the number of active region changes, linking regret bounds to polyhedral instability measures.
Contribution
It introduces the concept of polyhedral instability as a key factor governing regret, providing new bounds that interpolate between classical rates and dimension-dependent rates.
Findings
Regret scales with the square root of the number of region switches and maximum vertices per region.
In submodular-concave games, regret depends on permutation-switch count, matching known rates.
Experiments confirm the theoretical scaling and show low-instability regimes can occur in practice.
Abstract
Many online decision problems over combinatorial actions are addressed via convex relaxations, leading to online convex optimization with piecewise linear objectives and induced polyhedral structure. We show that regret in such problems is governed by \emph{polyhedral instability}: the number of changes of the active region. Under full information feedback and fixed partition assumptions, if denotes the number of region switches and the maximum number of vertices per region, we prove interpolating between experts-like and dimension-dependent OCO rates. For online submodular--concave games under Lov\'{a}sz convexification, this reduces to the permutation-switch count , yielding the matching rate . Experiments on synthetic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
