Performative Reinforcement Learning

Debmalya Mandal; Stelios Triantafyllou; and Goran Radanovic

arXiv:2207.00046·cs.LG·June 8, 2023

Performative Reinforcement Learning

Debmalya Mandal, Stelios Triantafyllou, and Goran Radanovic

PDF

Open Access 1 Video

TL;DR

This paper introduces performative reinforcement learning, analyzing how policies influence environment dynamics, and demonstrates convergence to stable policies through theoretical proofs and experiments.

Contribution

It formalizes performative reinforcement learning, introduces performatively stable policies, and provides convergence analysis for various optimization settings.

Findings

01

Repeated optimization converges to performatively stable policies.

02

Gradient ascent steps also lead to convergence under certain conditions.

03

Experimental results show convergence depends on regularization, smoothness, and sample size.

Abstract

We introduce the framework of performative reinforcement learning where the policy chosen by the learner affects the underlying reward and transition dynamics of the environment. Following the recent literature on performative prediction~\cite{Perdomo et. al., 2020}, we introduce the concept of performatively stable policy. We then consider a regularized version of the reinforcement learning problem and show that repeatedly optimizing this objective converges to a performatively stable policy under reasonable assumptions on the transition dynamics. Our proof utilizes the dual perspective of the reinforcement learning problem and may be of independent interest in analyzing the convergence of other algorithms with decision-dependent environments. We then extend our results for the setting where the learner just performs gradient ascent steps instead of fully optimizing the objective, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Performative Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Neural dynamics and brain function