A second order regret bound for NormalHedge

Yoav Freund; Nicholas J. A. Harvey; Victor S. Portella; Yabing Qi; Yu-Xiang Wang

arXiv:2602.08151·cs.LG·February 10, 2026

A second order regret bound for NormalHedge

Yoav Freund, Nicholas J. A. Harvey, Victor S. Portella, Yabing Qi, Yu-Xiang Wang

PDF

Open Access

TL;DR

This paper introduces a variant of NormalHedge that achieves a second-order regret bound for easy sequences, leveraging continuous-time analysis and self-concordance techniques.

Contribution

It presents a novel second-order regret bound for NormalHedge, extending its theoretical guarantees for prediction with expert advice on easy sequences.

Findings

01

Achieves a second-order $oldsymbol{ extit{ extbf{O}}}(\sqrt{V_T ext{log}(V_T/ ext{ extbf{epsilon}})})$ regret bound.

02

Uses continuous-time limit and stochastic differential equations for analysis.

03

Employs self-concordance techniques in the discrete-time setting.

Abstract

We consider the problem of prediction with expert advice for ``easy'' sequences. We show that a variant of NormalHedge enjoys a second-order $ϵ$ -quantile regret bound of $O (V_{T} lo g (V_{T} / ϵ))$ when $V_{T} > lo g N$ , where $V_{T}$ is the cumulative second moment of instantaneous per-expert regret averaged with respect to a natural distribution determined by the algorithm. The algorithm is motivated by a continuous time limit using Stochastic Differential Equations. The discrete time analysis uses self-concordance techniques.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Risk and Portfolio Optimization · Optimization and Search Problems