Dynamic Non-Bayesian Decision Making
D. Monderer, M. Tennenholtz

TL;DR
This paper studies non-Bayesian decision-making in repeated incomplete information games, establishing the existence of efficient stochastic policies for perfect monitoring and analyzing limitations in imperfect monitoring scenarios.
Contribution
It introduces a new framework for non-Bayesian agents, proving the existence of near-optimal stochastic policies in perfect monitoring and characterizing the absence of such policies in imperfect monitoring.
Findings
Efficient stochastic policies exist in perfect monitoring with high probability.
No deterministic policies can achieve long-run optimality in perfect monitoring.
Deterministic efficient strategies exist under maxmin criterion in imperfect monitoring.
Abstract
The model of a non-Bayesian agent who faces a repeated game with incomplete information against Nature is an appropriate tool for modeling general agent-environment interactions. In such a model the environment state (controlled by Nature) may change arbitrarily, and the feedback/reward function is initially unknown. The agent is not Bayesian, that is he does not form a prior probability neither on the state selection strategy of Nature, nor on his reward function. A policy for the agent is a function which assigns an action to every history of observations and actions. Two basic feedback structures are considered. In one of them -- the perfect monitoring case -- the agent is able to observe the previous environment state as part of his feedback, while in the other -- the imperfect monitoring case -- all that is available to the agent is the reward obtained. Both of these settings refer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models · Game Theory and Applications · Complex Systems and Time Series Analysis
