Solving Two-Player General-Sum Games Between Swarms

Mukesh Ghimire; Lei Zhang; Wenlong Zhang; Yi Ren; and Zhe Xu

arXiv:2310.01682·cs.MA·November 6, 2023·1 cites

Solving Two-Player General-Sum Games Between Swarms

Mukesh Ghimire, Lei Zhang, Wenlong Zhang, Yi Ren, and Zhe Xu

PDF

Open Access

TL;DR

This paper extends physics-informed neural network methods to solve two-player swarm-level general-sum games, demonstrating improved policies over some reinforcement learning approaches and comparable results with traditional numerical solvers.

Contribution

It introduces a novel approach for applying PINNs to swarm-level games using the Kolmogorov forward equation, addressing high-dimensional challenges.

Findings

01

PINN-based policies outperform Nash DDQN in payoff.

02

PINN results are comparable to numerical solvers.

03

Extension from agent-level to swarm-level games achieved.

Abstract

Hamilton-Jacobi-Isaacs (HJI) PDEs are the governing equations for the two-player general-sum games. Unlike Reinforcement Learning (RL) methods, which are data-intensive methods for learning value function, learning HJ PDEs provide a guaranteed convergence to the Nash Equilibrium value of the game when it exists. However, a caveat is that solving HJ PDEs becomes intractable when the state dimension increases. To circumvent the curse of dimensionality (CoD), physics-informed machine learning methods with supervision can be used and have been shown to be effective in generating equilibrial policies in two-player general-sum games. In this work, we extend the existing work on agent-level two-player games to a two-player swarm-level game, where two sub-swarms play a general-sum game. We consider the \textit{Kolmogorov forward equation} as the dynamic model for the evolution of the densities…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Reservoir Computing · Reinforcement Learning in Robotics