Optimizing Social Utility in Sequential Experiments
Ander Artola Velasco, Stratis Tsirtsis, Manuel Gomez-Rodriguez

TL;DR
This paper proposes a sequential experimental protocol with partial subsidy to improve social utility in high-cost, high-stakes product development trials, demonstrating significant efficiency gains.
Contribution
It introduces a belief MDP-based framework for optimal sequential experimentation with subsidies, enabling efficient computation of strategies and social welfare maximization.
Findings
Social utility can be increased by over 35% with the proposed protocol.
Optimal strategies can be computed efficiently via dynamic programming.
The social utility function is piecewise linear and convex over subsidy levels.
Abstract
Regulatory approval of products in high-stakes domains such as drug development requires statistical evidence of safety and efficacy through large-scale randomized controlled trials. However, the high financial cost of these trials may deter developers who lack absolute certainty in their product's efficacy, ultimately stifling the development of `moonshot' products that could offer high social utility. To address this inefficiency, in this paper, we introduce a statistical protocol for experimentation where the product developer (the agent) conducts a randomized controlled trial sequentially and the regulator (the principal) partially subsidizes its cost. By modeling the protocol using a belief Markov decision process, we show that the agent's optimal strategy can be found efficiently using dynamic programming. Further, we show that the social utility is a piecewise linear and convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
