Learning to Bet for Horizon-Aware Anytime-Valid Testing

Ege Onur Taga; Samet Oymak; Shubhanshu Shekhar

arXiv:2603.19551·stat.ME·March 23, 2026

Learning to Bet for Horizon-Aware Anytime-Valid Testing

Ege Onur Taga, Samet Oymak, Shubhanshu Shekhar

PDF

Open Access

TL;DR

This paper introduces horizon-aware anytime-valid tests for bounded means, utilizing a betting framework and deep reinforcement learning to optimize betting strategies across different horizons and deadlines.

Contribution

It develops a novel horizon-aware betting framework, formulates it as a finite-horizon control problem, and employs deep reinforcement learning to learn optimal betting policies.

Findings

01

Kelly betting is optimal in certain state regions.

02

Aggressive betting can be advantageous when behind schedule.

03

The learned DQN policy achieves state-of-the-art results in experiments.

Abstract

We develop horizon-aware anytime-valid tests and confidence sequences for bounded means under a strict deadline $N$ . Using the betting/e-process framework, we cast horizon-aware betting as a finite-horizon optimal control problem with state space $(t, lo g W_{t})$ , where $t$ is the time and $W_{t}$ is the test martingale value. We first show that in certain interior regions of the state space, policies that deviate significantly from Kelly betting are provably suboptimal, while Kelly betting reaches the threshold with high probability. We then identify sufficient conditions showing that outside this region, more aggressive betting than Kelly can be better if the bettor is behind schedule, and less aggressive can be better if the bettor is ahead. Taken together these results suggest a simple phase diagram in the $(t, lo g W_{t})$ plane, delineating regions where Kelly, fractional Kelly, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Advanced Causal Inference Techniques