Bob and Alice Go to a Bar: Reasoning About Future With Probabilistic Programs
David Tolpin, Tomer Dobkin

TL;DR
This paper presents a novel approach to reinforcement learning by formulating it as Bayesian inference based on stochastic agent preferences, eliminating the need for reward functions and enabling probabilistic reasoning about agent behaviors.
Contribution
It introduces a reward-free Bayesian inference framework for reinforcement learning using stochastic preferences, providing a solid probabilistic foundation for modeling agent behavior.
Findings
Successfully modeled a two-agent coordinate game
Demonstrated reasoning about a single agent in noisy environments
Unified different scenarios under the same probabilistic principles
Abstract
It is well known that reinforcement learning can be cast as inference in an appropriate probabilistic model. However, this commonly involves introducing a distribution over agent trajectories with probabilities proportional to exponentiated rewards. In this work, we formulate reinforcement learning as Bayesian inference without resorting to rewards, and show that rewards are derived from agent's preferences, rather than the other way around. We argue that agent preferences should be specified stochastically rather than deterministically. Reinforcement learning via inference with stochastic preferences naturally describes agent behaviors, does not require introducing rewards and exponential weighing of trajectories, and allows to reason about agents using the solid foundation of Bayesian statistics. Stochastic conditioning, a probabilistic programming paradigm for conditioning models on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Auction Theory and Applications
