Logit-Q Dynamics for Efficient Learning in Stochastic Teams
Ahmed Said Donmez, Onur Unlu, and Muhammed O. Sayin

TL;DR
This paper introduces logit-Q dynamics, a novel approach combining log-linear learning and Q-learning, to achieve efficient learning and near-optimal equilibrium in stochastic games with unknown dynamics.
Contribution
It proposes a new family of logit-Q dynamics that effectively combines existing learning methods for stochastic games, providing convergence guarantees and approximation bounds.
Findings
Achieves near-efficient equilibrium in stochastic teams with unknown dynamics.
Proves convergence of logit-Q dynamics in potential stochastic games.
Quantifies the approximation error of the proposed dynamics.
Abstract
We present a new family of logit-Q dynamics for efficient learning in stochastic games by combining the log-linear learning (also known as logit dynamics) for the repeated play of normal-form games with Q-learning for unknown Markov decision processes within the auxiliary stage-game framework. In this framework, we view stochastic games as agents repeatedly playing some stage game associated with the current state of the underlying game while the agents' Q-functions determine the payoffs of these stage games. We show that the logit-Q dynamics presented reach (near) efficient equilibrium in stochastic teams with unknown dynamics and quantify the approximation error. We also show the rationality of the logit-Q dynamics against agents following pure stationary strategies and the convergence of the dynamics in stochastic games where the stage-payoffs induce potential games, yet only a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Economic Policies and Impacts · Experimental Behavioral Economics Studies
MethodsSoftmax
