Regret-Oracle Complexity Tradeoffs in Agnostic Online Learning

Idan Attias; Steve Hanneke; Arvind Ramaswami

arXiv:2605.07155·cs.LG·May 11, 2026

Regret-Oracle Complexity Tradeoffs in Agnostic Online Learning

Idan Attias, Steve Hanneke, Arvind Ramaswami

PDF

TL;DR

This paper introduces a new adaptive reduction method in agnostic online learning that reduces oracle complexity from doubly-exponential to polynomial, while maintaining near-optimal regret.

Contribution

It proposes a dynamic agnostic-to-realizable reduction using a weak-consistency oracle, significantly lowering oracle complexity and memory usage.

Findings

01

Reduces oracle complexity to O(T^{d_VC+1})

02

Maintains near-optimal expected regret

03

Provides bounds on regret-oracle complexity tradeoff

Abstract

Agnostic online learning is classically solved via a reduction to the realizable setting, utilizing Littlestone's Standard Optimal Algorithm (SOA) as a base learner. However, the SOA is computationally intractable to execute even for a single round. To overcome this barrier, recent work in oracle-efficient online learning replaces the SOA with a realizable base learner that accesses the concept class exclusively through an offline empirical risk minimization (ERM) oracle. While such agnostic learners achieve near-optimal expected regret, they suffer from a doubly-exponential oracle complexity of $O (T^{2^{O (d_{LD})}})$ , where $d_{LD}$ is the Littlestone dimension and $T$ is the number of rounds. In this work, we significantly improve this oracle complexity while relying on an even weaker primitive: a weak-consistency oracle, which merely decides whether a given…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.