Smooth Fictitious Play in Stochastic Games with Perturbed Payoffs and Unknown Transitions
Lucas Baudin, Rida Laraki

TL;DR
This paper introduces a family of decentralized, model-free learning algorithms for stochastic games that extend smooth fictitious play, ensuring convergence to regularized Nash equilibria even with perturbed payoffs and unknown transitions.
Contribution
It develops new regularized, decentralized algorithms for stochastic games that do not require knowledge of transition probabilities or exact payoffs, extending classical fictitious play to more realistic settings.
Findings
Algorithms converge to stationary regularized Nash equilibria in stochastic games.
Procedures also converge to optimal policies in single-player MDPs, serving as an alternative to Q-learning.
The methods are model-free and handle perturbed payoffs, broadening applicability.
Abstract
Recent extensions to dynamic games of the well-known fictitious play learning procedure in static games were proved to globally converge to stationary Nash equilibria in two important classes of dynamic games (zero-sum and identical-interest discounted stochastic games). However, those decentralized algorithms need the players to know exactly the model (the transition probabilities and their payoffs at every stage). To overcome these strong assumptions, our paper introduces regularizations of the systems in (Leslie 2020; Baudin 2022) to construct a family of new decentralized learning algorithms which are model-free (players don't know the transitions and their payoffs are perturbed at every stage). Our procedures can be seen as extensions to stochastic games of the classical smooth fictitious play learning procedures in static games (where the players best responses are regularized,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExperimental Behavioral Economics Studies · Decision-Making and Behavioral Economics · Economic theories and models
