On the Monotonicity and Rate of Convergence of the Markovian Persuasion Value
Dimitry Shaiderman

TL;DR
This paper analyzes the properties of the Markovian persuasion model, showing how the discounted value behaves as the discount factor varies and providing bounds on its convergence rate in ergodic cases.
Contribution
It establishes the monotonicity of the discounted value trajectory and derives an upper bound on the convergence rate for ergodic Markov chains in persuasion games.
Findings
The discounted value decreases monotonically with the discount factor.
An upper bound on the convergence rate as the discount factor approaches 1.
Extension of results to Markov chain games.
Abstract
We study a dynamic Bayesian persuasion model called Markovian persuasion. In such a model, the belief of the receiver regarding the current state of a Markov chain , over a finite state space , is controlled through signals she obtains from a sender, who observes in real time. At each stage , the receiver takes an action based on his current belief, which together with the realized state of , determines the 'th stage payoff of the sender. The sender's goal in a Markovian persuasion game is to find a signaling policy that maximizes her expected -discounted sum of stage payoffs for a discount factor . We show that starting from any invariant distribution the trajectory of the -discounted value is a monotone decreasing in . By combining this result with the opposite increasing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Advanced Queuing Theory Analysis · Advanced Bandit Algorithms Research
