Bob and Alice Go to a Bar: Reasoning About Future With Probabilistic   Programs

David Tolpin; Tomer Dobkin

arXiv:2108.03834·cs.AI·October 7, 2021

Bob and Alice Go to a Bar: Reasoning About Future With Probabilistic Programs

David Tolpin, Tomer Dobkin

PDF

Open Access

TL;DR

This paper presents a novel approach to reinforcement learning by formulating it as Bayesian inference based on stochastic agent preferences, eliminating the need for reward functions and enabling probabilistic reasoning about agent behaviors.

Contribution

It introduces a reward-free Bayesian inference framework for reinforcement learning using stochastic preferences, providing a solid probabilistic foundation for modeling agent behavior.

Findings

01

Successfully modeled a two-agent coordinate game

02

Demonstrated reasoning about a single agent in noisy environments

03

Unified different scenarios under the same probabilistic principles

Abstract

It is well known that reinforcement learning can be cast as inference in an appropriate probabilistic model. However, this commonly involves introducing a distribution over agent trajectories with probabilities proportional to exponentiated rewards. In this work, we formulate reinforcement learning as Bayesian inference without resorting to rewards, and show that rewards are derived from agent's preferences, rather than the other way around. We argue that agent preferences should be specified stochastically rather than deterministically. Reinforcement learning via inference with stochastic preferences naturally describes agent behaviors, does not require introducing rewards and exponential weighing of trajectories, and allows to reason about agents using the solid foundation of Bayesian statistics. Stochastic conditioning, a probabilistic programming paradigm for conditioning models on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Auction Theory and Applications