On the Optimal Amount of Experimentation in Sequential Decision Problems

Dinah Rosenberg; Eilon Solan; Nicolas Vieille

arXiv:0907.2002·math.PR·July 14, 2009

On the Optimal Amount of Experimentation in Sequential Decision Problems

Dinah Rosenberg, Eilon Solan, Nicolas Vieille

PDF

TL;DR

This paper establishes a precise bound on the optimal experimentation level in sequential decision-making, demonstrating its relevance through a bound on the cut-off in a one-arm bandit problem.

Contribution

It introduces a tight bound on experimentation in optimal strategies and applies it to a specific bandit problem scenario.

Findings

01

Derived a tight bound on experimentation in sequential decisions

02

Applied the bound to determine the cut-off in a one-arm bandit problem

03

Demonstrated the practical relevance of the theoretical result

Abstract

We provide a tight bound on the amount of experimentation under the optimal strategy in sequential decision problems. We show the applicability of the result by providing a bound on the cut-off in a one-arm bandit problem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.