Policy-Based Bayesian Experimental Design for Non-Differentiable   Implicit Models

Vincent Lim; Ellen Novoseller; Jeffrey Ichnowski; Huang Huang; Ken; Goldberg

arXiv:2203.04272·cs.LG·March 9, 2022

Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models

Vincent Lim, Ellen Novoseller, Jeffrey Ichnowski, Huang Huang, Ken, Goldberg

PDF

Open Access

TL;DR

This paper introduces RL-DAD, a reinforcement learning approach for optimal experimental design in non-differentiable implicit models, enabling efficient, simulation-based experiment planning without requiring likelihood computations.

Contribution

It reformulates Bayesian optimal experimental design as a Markov Decision Process and applies deep reinforcement learning to learn effective design policies for implicit models.

Findings

01

RL-DAD performs competitively with existing methods on benchmark tasks.

02

The approach enables quick online deployment of experiment policies.

03

It extends policy-based Bayesian experimental design to non-differentiable models.

Abstract

For applications in healthcare, physics, energy, robotics, and many other fields, designing maximally informative experiments is valuable, particularly when experiments are expensive, time-consuming, or pose safety hazards. While existing approaches can sequentially design experiments based on prior observation history, many of these methods do not extend to implicit models, where simulation is possible but computing the likelihood is intractable. Furthermore, they often require either significant online computation during deployment or a differentiable simulation system. We introduce Reinforcement Learning for Deep Adaptive Design (RL-DAD), a method for simulation-based optimal experimental design for non-differentiable implicit models. RL-DAD extends prior work in policy-based Bayesian Optimal Experimental Design (BOED) by reformulating it as a Markov Decision Process with a reward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Optimal Experimental Design Methods · Machine Learning in Materials Science