Non-Asymptotic Analysis of (Sticky) Track-and-Stop
Riccardo Poiani, Martino Bernasconi, Andrea Celli

TL;DR
This paper provides non-asymptotic performance guarantees for the Track-and-Stop and Sticky Track-and-Stop algorithms in pure exploration problems, extending their theoretical understanding beyond asymptotic regimes.
Contribution
It offers the first non-asymptotic analysis of these algorithms, including the Sticky variant, in environments with multiple correct answers.
Findings
Non-asymptotic guarantees established for Track-and-Stop.
Non-asymptotic guarantees established for Sticky Track-and-Stop.
Results extend understanding of algorithm performance in finite-sample settings.
Abstract
In pure exploration problems, a statistician sequentially collects information to answer a question about some stochastic and unknown environment. The probability of returning a wrong answer should not exceed a maximum risk parameter and good algorithms make as few queries to the environment as possible. The Track-and-Stop algorithm is a pioneering method to solve these problems. Specifically, it is well-known that it enjoys asymptotic optimality sample complexity guarantees for whenever the map from the environment to its correct answers is single-valued (e.g., best-arm identification with a unique optimal arm). The Sticky Track-and-Stop algorithm extends these results to settings where, for each environment, there might exist multiple correct answers (e.g., -optimal arm identification). Although both methods are optimal in the asymptotic regime, their…
Peer Reviews
Decision·ICLR 2026 Oral
1. The paper is well-written, with a clear structure, strong motivation, and a thorough discussion of related work. Overall, it is very easy to follow. 2. The theoretical result is a welcome addition to this field, as a non-asymptotic analysis for Track and Stop has been long-missing.
I did not find any obvious weaknesses. However, due to time constraints, I was unable to review the proof details thoroughly.
- The paper is well written and clearly structured. - The assumptions are well motivated, and the mathematical arguments are rigorous and accompanied by clear intuition. - The work contributes to a deeper understanding of the stopping behavior of TaS-type algorithms in finite-sample regimes.
- *Relation to Degenne et al. (2019) and Barrier et al. (2022) is underdeveloped* Both works introduce algorithmic variants of Track-and-Stop that address its early-phase instability and already provide non-asymptotic analyses. The paper does not sufficiently motivate why analyzing the vanilla TaS (and S-TaS) provides new insight beyond those results. The authors claim that the work of Degenne et al. (2019) requires to solve a “more challenging optimization problem,” but do not specify what di
The paper provides non-asymptotic bounds for these popular pure exploration algorithms, thereby addressing a gap in our understanding of the finite-time performance of these algorithms. The analysis seems rigorous and is clearly explained. Overall, the paper is good and I like it.
I don't see any obvjous weakness in the paper, but I have questions as outlined in the next section.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Target Tracking and Data Fusion in Sensor Networks
