Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models
Vincent Lim, Ellen Novoseller, Jeffrey Ichnowski, Huang Huang, Ken, Goldberg

TL;DR
This paper introduces RL-DAD, a reinforcement learning approach for optimal experimental design in non-differentiable implicit models, enabling efficient, simulation-based experiment planning without requiring likelihood computations.
Contribution
It reformulates Bayesian optimal experimental design as a Markov Decision Process and applies deep reinforcement learning to learn effective design policies for implicit models.
Findings
RL-DAD performs competitively with existing methods on benchmark tasks.
The approach enables quick online deployment of experiment policies.
It extends policy-based Bayesian experimental design to non-differentiable models.
Abstract
For applications in healthcare, physics, energy, robotics, and many other fields, designing maximally informative experiments is valuable, particularly when experiments are expensive, time-consuming, or pose safety hazards. While existing approaches can sequentially design experiments based on prior observation history, many of these methods do not extend to implicit models, where simulation is possible but computing the likelihood is intractable. Furthermore, they often require either significant online computation during deployment or a differentiable simulation system. We introduce Reinforcement Learning for Deep Adaptive Design (RL-DAD), a method for simulation-based optimal experimental design for non-differentiable implicit models. RL-DAD extends prior work in policy-based Bayesian Optimal Experimental Design (BOED) by reformulating it as a Markov Decision Process with a reward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Optimal Experimental Design Methods · Machine Learning in Materials Science
