Diffusion Approximations for a Class of Sequential Testing Problems
Victor F. Araman, Rene Caldentey

TL;DR
This paper develops a diffusion approximation for Bayesian sequential experimentation problems, providing insights into optimal decision-making under uncertainty with applications in assortment selection and e-commerce.
Contribution
It introduces a diffusion asymptotic analysis for complex sequential testing problems, showing quadratic growth of complexity with action set size and demonstrating practical heuristics.
Findings
Diffusion approximation offers valuable insights into sequential testing.
Complexity grows quadratically with the number of actions.
Heuristics derived are effective and robust in practical scenarios.
Abstract
We consider a decision maker who must choose an action in order to maximize a reward function that depends also on an unknown parameter {\Theta}. The decision maker can delay taking the action in order to experiment and gather additional information on {\Theta}. We model the decision maker's problem using a Bayesian sequential experimentation framework and use dynamic programming and diffusion-asymptotic analysis to solve it. For that, we scale our problem in a way that both the average number of experiments that is conducted per unit of time is large and the informativeness of each individual experiment is low. Under such regime, we derive a diffusion approximation for the sequential experimentation problem, which provides a number of important insights about the nature of the problem and its solution. Our solution method also shows that the complexity of the problem grows only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Statistical Process Monitoring · Statistical Methods in Clinical Trials
MethodsDiffusion
