Experience-replay Innovative Dynamics

Tuo Zhang; Leonardo Stella; Julian Barreiro-Gomez

arXiv:2501.12199·cs.LG·January 28, 2025

Experience-replay Innovative Dynamics

Tuo Zhang, Leonardo Stella, Julian Barreiro-Gomez

PDF

Open Access

TL;DR

This paper introduces a novel multi-agent reinforcement learning algorithm based on innovative dynamics like BNN and Smith, which can potentially improve convergence properties beyond traditional replicator dynamics, supported by theoretical and empirical evidence.

Contribution

The paper develops a new MARL algorithm using experience replay and revision protocols that emulate innovative dynamics, extending theoretical guarantees beyond replicator dynamics.

Findings

01

The algorithm's trajectories can be tuned to match innovative dynamics behaviors.

02

Theoretical framework extends convergence guarantees for MARL.

03

Empirical results validate the theoretical insights.

Abstract

Despite its groundbreaking success, multi-agent reinforcement learning (MARL) still suffers from instability and nonstationarity. Replicator dynamics, the most well-known model from evolutionary game theory (EGT), provide a theoretical framework for the convergence of the trajectories to Nash equilibria and, as a result, have been used to ensure formal guarantees for MARL algorithms in stable game settings. However, they exhibit the opposite behavior in other settings, which poses the problem of finding alternatives to ensure convergence. In contrast, innovative dynamics, such as the Brown-von Neumann-Nash (BNN) or Smith, result in periodic trajectories with the potential to approximate Nash equilibria. Yet, no MARL algorithms based on these dynamics have been proposed. In response to this challenge, we develop a novel experience replay-based MARL algorithm that incorporates revision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making