Linear Convergence of Independent Natural Policy Gradient in Games with   Entropy Regularization

Youbang Sun; Tao Liu; P. R. Kumar; Shahin Shahrampour

arXiv:2405.02769·cs.LG·May 7, 2024

Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization

Youbang Sun, Tao Liu, P. R. Kumar, Shahin Shahrampour

PDF

Open Access

TL;DR

This paper proves that the entropy-regularized independent natural policy gradient algorithm converges linearly to the quantal response equilibrium in multi-agent games, supported by theoretical analysis and empirical validation.

Contribution

It establishes the linear convergence of entropy-regularized independent NPG to QRE in multi-agent settings, extending understanding beyond Nash equilibrium approximation.

Findings

01

Convergence to QRE occurs at a linear rate under sufficient entropy regularization.

02

The results apply to various game types, including cooperative and potential games.

03

Empirical results confirm theoretical convergence in multiple game scenarios.

Abstract

This work focuses on the entropy-regularized independent natural policy gradient (NPG) algorithm in multi-agent reinforcement learning. In this work, agents are assumed to have access to an oracle with exact policy evaluation and seek to maximize their respective independent rewards. Each individual's reward is assumed to depend on the actions of all the agents in the multi-agent system, leading to a game between agents. We assume all agents make decisions under a policy with bounded rationality, which is enforced by the introduction of entropy regularization. In practice, a smaller regularization implies the agents are more rational and behave closer to Nash policies. On the other hand, agents with larger regularization acts more randomly, which ensures more exploration. We show that, under sufficient entropy regularization, the dynamics of this system converge at a linear rate to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic Policies and Impacts