Inverse Bayesian Optimization: Learning Human Acquisition Functions in   an Exploration vs Exploitation Search Task

Nathan Sandholtz; Yohsuke Miyamoto; Luke Bornn; Maurice Smith

arXiv:2104.09237·cs.HC·February 4, 2022

Inverse Bayesian Optimization: Learning Human Acquisition Functions in an Exploration vs Exploitation Search Task

Nathan Sandholtz, Yohsuke Miyamoto, Luke Bornn, Maurice Smith

PDF

1 Repo

TL;DR

This paper presents a probabilistic framework to infer human acquisition functions in exploration-exploitation tasks by modeling observed behavior as samples from Bayesian optimization, improving understanding of human decision-making.

Contribution

It introduces a novel inverse Bayesian optimization method to estimate human acquisition functions from behavioral data, allowing for deviations and augmentations to standard models.

Findings

01

Many subjects show exploration preferences beyond standard acquisition functions.

02

Augmented acquisition functions better fit human behavior in the task.

03

The framework enables inference of individual human strategies in optimization tasks.

Abstract

This paper introduces a probabilistic framework to estimate parameters of an acquisition function given observed human behavior that can be modeled as a collection of sample paths from a Bayesian optimization procedure. The methodology involves defining a likelihood on observed human behavior from an optimization task, where the likelihood is parameterized by a Bayesian optimization subroutine governed by an unknown acquisition function. This structure enables us to make inference on a subject's acquisition function while allowing their behavior to deviate around the solution to the Bayesian optimization subroutine. To test our methods, we designed a sequential optimization task which forced subjects to balance exploration and exploitation in search of an invisible target location. Applying our proposed methods to the resulting data, we find that many subjects tend to exhibit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nsandholtz/hotspot_paper
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.