Equilibrium in Misspecified Markov Decision Processes

Ignacio Esponda; Demian Pouzo

arXiv:1502.06901·q-fin.EC·January 4, 2018

Equilibrium in Misspecified Markov Decision Processes

Ignacio Esponda, Demian Pouzo

PDF

TL;DR

This paper analyzes Markov decision processes where the agent's model is misspecified, developing an equilibrium concept to understand long-term behavior despite model uncertainty and updating beliefs via Bayes' rule.

Contribution

It introduces an equilibrium framework for misspecified Markov decision processes and characterizes steady state behavior under asymptotic analysis, extending static results to dynamic settings.

Findings

01

Equilibrium coincides with Berk-Nash equilibrium in static cases

02

Provides conditions for steady state characterization

03

Discusses issues related to negative experimentation value

Abstract

We study Markov decision problems where the agent does not know the transition probability function mapping current states and actions to future states. The agent has a prior belief over a set of possible transition functions and updates beliefs using Bayes' rule. We allow her to be misspecified in the sense that the true transition probability function is not in the support of her prior. This problem is relevant in many economic settings but is usually not amenable to analysis by the researcher. We make the problem tractable by studying asymptotic behavior. We propose an equilibrium notion and provide conditions under which it characterizes steady state behavior. In the special case where the problem is static, equilibrium coincides with the single-agent version of Berk-Nash equilibrium (Esponda and Pouzo (2016)). We also discuss subtle issues that arise exclusively in dynamic settings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.