Learning to Bet for Horizon-Aware Anytime-Valid Testing
Ege Onur Taga, Samet Oymak, Shubhanshu Shekhar

TL;DR
This paper introduces horizon-aware anytime-valid tests for bounded means, utilizing a betting framework and deep reinforcement learning to optimize betting strategies across different horizons and deadlines.
Contribution
It develops a novel horizon-aware betting framework, formulates it as a finite-horizon control problem, and employs deep reinforcement learning to learn optimal betting policies.
Findings
Kelly betting is optimal in certain state regions.
Aggressive betting can be advantageous when behind schedule.
The learned DQN policy achieves state-of-the-art results in experiments.
Abstract
We develop horizon-aware anytime-valid tests and confidence sequences for bounded means under a strict deadline . Using the betting/e-process framework, we cast horizon-aware betting as a finite-horizon optimal control problem with state space , where is the time and is the test martingale value. We first show that in certain interior regions of the state space, policies that deviate significantly from Kelly betting are provably suboptimal, while Kelly betting reaches the threshold with high probability. We then identify sufficient conditions showing that outside this region, more aggressive betting than Kelly can be better if the bettor is behind schedule, and less aggressive can be better if the bettor is ahead. Taken together these results suggest a simple phase diagram in the plane, delineating regions where Kelly, fractional Kelly, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Advanced Causal Inference Techniques
