Regret-Oracle Complexity Tradeoffs in Agnostic Online Learning
Idan Attias, Steve Hanneke, Arvind Ramaswami

TL;DR
This paper introduces a new adaptive reduction method in agnostic online learning that reduces oracle complexity from doubly-exponential to polynomial, while maintaining near-optimal regret.
Contribution
It proposes a dynamic agnostic-to-realizable reduction using a weak-consistency oracle, significantly lowering oracle complexity and memory usage.
Findings
Reduces oracle complexity to O(T^{d_VC+1})
Maintains near-optimal expected regret
Provides bounds on regret-oracle complexity tradeoff
Abstract
Agnostic online learning is classically solved via a reduction to the realizable setting, utilizing Littlestone's Standard Optimal Algorithm (SOA) as a base learner. However, the SOA is computationally intractable to execute even for a single round. To overcome this barrier, recent work in oracle-efficient online learning replaces the SOA with a realizable base learner that accesses the concept class exclusively through an offline empirical risk minimization (ERM) oracle. While such agnostic learners achieve near-optimal expected regret, they suffer from a doubly-exponential oracle complexity of , where is the Littlestone dimension and is the number of rounds. In this work, we significantly improve this oracle complexity while relying on an even weaker primitive: a weak-consistency oracle, which merely decides whether a given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
