Non-Asymptotic Analysis of (Sticky) Track-and-Stop

Riccardo Poiani; Martino Bernasconi; Andrea Celli

arXiv:2505.22475·cs.LG·February 19, 2026

Non-Asymptotic Analysis of (Sticky) Track-and-Stop

Riccardo Poiani, Martino Bernasconi, Andrea Celli

PDF

Open Access 3 Reviews

TL;DR

This paper provides non-asymptotic performance guarantees for the Track-and-Stop and Sticky Track-and-Stop algorithms in pure exploration problems, extending their theoretical understanding beyond asymptotic regimes.

Contribution

It offers the first non-asymptotic analysis of these algorithms, including the Sticky variant, in environments with multiple correct answers.

Findings

01

Non-asymptotic guarantees established for Track-and-Stop.

02

Non-asymptotic guarantees established for Sticky Track-and-Stop.

03

Results extend understanding of algorithm performance in finite-sample settings.

Abstract

In pure exploration problems, a statistician sequentially collects information to answer a question about some stochastic and unknown environment. The probability of returning a wrong answer should not exceed a maximum risk parameter $δ$ and good algorithms make as few queries to the environment as possible. The Track-and-Stop algorithm is a pioneering method to solve these problems. Specifically, it is well-known that it enjoys asymptotic optimality sample complexity guarantees for $δ \to 0$ whenever the map from the environment to its correct answers is single-valued (e.g., best-arm identification with a unique optimal arm). The Sticky Track-and-Stop algorithm extends these results to settings where, for each environment, there might exist multiple correct answers (e.g., $ϵ$ -optimal arm identification). Although both methods are optimal in the asymptotic regime, their…

Peer Reviews

Decision·ICLR 2026 Oral

Reviewer 01Rating 6Confidence 3

Strengths

1. The paper is well-written, with a clear structure, strong motivation, and a thorough discussion of related work. Overall, it is very easy to follow. 2. The theoretical result is a welcome addition to this field, as a non-asymptotic analysis for Track and Stop has been long-missing.

Weaknesses

I did not find any obvious weaknesses. However, due to time constraints, I was unable to review the proof details thoroughly.

Reviewer 02Rating 4Confidence 2

Strengths

- The paper is well written and clearly structured. - The assumptions are well motivated, and the mathematical arguments are rigorous and accompanied by clear intuition. - The work contributes to a deeper understanding of the stopping behavior of TaS-type algorithms in finite-sample regimes.

Weaknesses

- *Relation to Degenne et al. (2019) and Barrier et al. (2022) is underdeveloped* Both works introduce algorithmic variants of Track-and-Stop that address its early-phase instability and already provide non-asymptotic analyses. The paper does not sufficiently motivate why analyzing the vanilla TaS (and S-TaS) provides new insight beyond those results. The authors claim that the work of Degenne et al. (2019) requires to solve a “more challenging optimization problem,” but do not specify what di

Reviewer 03Rating 8Confidence 3

Strengths

The paper provides non-asymptotic bounds for these popular pure exploration algorithms, thereby addressing a gap in our understanding of the finite-time performance of these algorithms. The analysis seems rigorous and is clearly explained. Overall, the paper is good and I like it.

Weaknesses

I don't see any obvjous weakness in the paper, but I have questions as outlined in the next section.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Target Tracking and Data Fusion in Sensor Networks