Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning   Algorithms

Liyuan Zheng; Tanner Fiez; Zane Alumbaugh; Benjamin Chasnov and; Lillian J. Ratliff

arXiv:2109.12286·cs.LG·September 28, 2021·1 cites

Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

Liyuan Zheng, Tanner Fiez, Zane Alumbaugh, Benjamin Chasnov and, Lillian J. Ratliff

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a game-theoretic Stackelberg framework for actor-critic reinforcement learning, leading to algorithms with improved convergence and performance over traditional methods.

Contribution

It models actor-critic interactions as a Stackelberg game and develops a meta-framework with theoretical convergence guarantees and empirical performance improvements.

Findings

01

Mitigates cycling in learning dynamics

02

Accelerates convergence compared to gradient dynamics

03

Outperforms standard actor-critic algorithms in experiments

Abstract

The hierarchical interaction between the actor and critic in actor-critic based reinforcement learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this viewpoint and model the actor and critic interaction as a two-player general-sum game with a leader-follower structure known as a Stackelberg game. Given this abstraction, we propose a meta-framework for Stackelberg actor-critic algorithms where the leader player follows the total derivative of its objective instead of the usual individual gradient. From a theoretical standpoint, we develop a policy gradient theorem for the refined update and provide a local convergence guarantee for the Stackelberg actor-critic algorithms to a local Stackelberg equilibrium. From an empirical standpoint, we demonstrate via simple examples that the learning dynamics we study mitigate cycling and accelerate convergence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leozhengzly/stackelberg-actor-critic-algos
pytorchOfficial

Videos

Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms· underline

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Neural Networks and Reservoir Computing