One step futher: an explicit solution to Robbins' problem when $n=4$
R\'emi Dendievel, Yvik Swan

TL;DR
This paper provides an explicit solution to Robbins' optimal stopping problem for four observations, where the goal is to minimize the expected rank of the selected uniform random variable.
Contribution
The paper derives the first explicit solution for Robbins' problem when the number of observations is four, advancing understanding of optimal stopping rules in this setting.
Findings
Explicit solution for n=4 Robbins' problem
Optimal stopping rule minimizes expected rank
Advances theoretical understanding of sequential decision-making
Abstract
Fix some and let be independent random variables drawn from the uniform distribution on . A decision maker is shown the variables sequentially and, after each observation, must decide whether or not to keep the current one, with payoff the overall rank of the selected observation. Decisions are final: no recall is allowed, no regret is tolerated. The objective is to act in such a way as to minimise the expected payoff. In this note we give the explicit solution to this problem, known as Robbins' problem of optimal stopping, when .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Computability, Logic, AI Algorithms · Algorithms and Data Compression
