Uniformly bounded regret in the multi-secretary problem
Alessandro Arlotto, Itai Gurvich

TL;DR
This paper introduces an adaptive policy for the multi-secretary problem that guarantees a uniformly bounded regret regardless of the number of candidates or budget, highlighting the importance of adaptiveness in online decision-making.
Contribution
The paper presents a novel adaptive Budget-Ratio policy that achieves uniformly bounded regret in the multi-secretary problem, a significant improvement over non-adaptive strategies.
Findings
The proposed policy achieves a regret that does not grow with n or k.
Non-adaptive policies have regret growing like the square root of n.
Adaptiveness is essential for minimizing regret in the multi-secretary problem.
Abstract
In the secretary problem of Cayley (1875) and Moser (1956), non-negative, independent, random variables with common distribution are sequentially presented to a decision maker who decides when to stop and collect the most recent realization. The goal is to maximize the expected value of the collected element. In the -choice variant, the decision maker is allowed to make selections to maximize the expected total value of the selected elements. Assuming that the values are drawn from a known distribution with finite support, we prove that the best regret---the expected gap between the optimal online policy and its offline counterpart in which all values are made visible at time ---is uniformly bounded in the the number of candidates and the budget . Our proof is constructive: we develop an adaptive Budget-Ratio policy that achieves this performance. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
