Stopping on the last success with unknown odds: Impossibility barriers and quantitative oracle bounds
Davy Paindaveine

TL;DR
This paper analyzes the last-success problem with unknown success probability, deriving explicit bounds and impossibility results for oracle-free strategies in sequential Bernoulli trials.
Contribution
It introduces a recursive expression for the plug-in rule's success probability and establishes fundamental limits on oracle-free decision rules.
Findings
Exact expression for plug-in rule success probability via recursion.
Finite-horizon bounds and minimax lower bounds for unknown p.
Asymptotic oracle-optimality in sparse regimes where p→0.
Abstract
We consider the classical last-success problem for sequential Bernoulli trials in the homogeneous setting where are i.i.d. but the success probability is unknown to the decision maker. When is known, Bruss' sum-the-odds theorem yields an optimal threshold rule with value . We study a natural oracle-free plug-in rule that replaces by the online empirical estimate and we denote its win probability by . First, we derive an exact expression for via a recursion for the state probabilities, enabling explicit comparisons with and revealing a finite-horizon separation between plug-in and oracle performance. Next, we formalize a first decision-theoretic obstruction inherent to the unknown- formulation: for every fixed , the dominance partial order on -blind (possibly randomized)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
