Logit-Q Dynamics for Efficient Learning in Stochastic Teams

Ahmed Said Donmez; Onur Unlu; and Muhammed O. Sayin

arXiv:2302.09806·cs.GT·March 17, 2025·1 cites

Logit-Q Dynamics for Efficient Learning in Stochastic Teams

Ahmed Said Donmez, Onur Unlu, and Muhammed O. Sayin

PDF

Open Access

TL;DR

This paper introduces logit-Q dynamics, a novel approach combining log-linear learning and Q-learning, to achieve efficient learning and near-optimal equilibrium in stochastic games with unknown dynamics.

Contribution

It proposes a new family of logit-Q dynamics that effectively combines existing learning methods for stochastic games, providing convergence guarantees and approximation bounds.

Findings

01

Achieves near-efficient equilibrium in stochastic teams with unknown dynamics.

02

Proves convergence of logit-Q dynamics in potential stochastic games.

03

Quantifies the approximation error of the proposed dynamics.

Abstract

We present a new family of logit-Q dynamics for efficient learning in stochastic games by combining the log-linear learning (also known as logit dynamics) for the repeated play of normal-form games with Q-learning for unknown Markov decision processes within the auxiliary stage-game framework. In this framework, we view stochastic games as agents repeatedly playing some stage game associated with the current state of the underlying game while the agents' Q-functions determine the payoffs of these stage games. We show that the logit-Q dynamics presented reach (near) efficient equilibrium in stochastic teams with unknown dynamics and quantify the approximation error. We also show the rationality of the logit-Q dynamics against agents following pure stationary strategies and the convergence of the dynamics in stochastic games where the stage-payoffs induce potential games, yet only a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Economic Policies and Impacts · Experimental Behavioral Economics Studies

MethodsSoftmax