Specification-Guided Learning of Nash Equilibria with High Social   Welfare

Kishor Jothimurugan; Suguman Bansal; Osbert Bastani; Rajeev Alur

arXiv:2206.03348·cs.GT·June 8, 2022·1 cites

Specification-Guided Learning of Nash Equilibria with High Social Welfare

Kishor Jothimurugan, Suguman Bansal, Osbert Bastani, Rajeev Alur

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning framework that uses high-level specifications to efficiently find Nash equilibrium policies in multi-agent systems, maximizing social welfare.

Contribution

It presents a new specification-guided approach for training Nash equilibrium policies that prioritize social welfare, outperforming existing methods.

Findings

01

Successfully computes Nash equilibrium policies with high social welfare

02

Outperforms state-of-the-art baselines in equilibrium quality

03

Demonstrates effectiveness in challenging control problems

Abstract

Reinforcement learning has been shown to be an effective strategy for automatically training policies for challenging control problems. Focusing on non-cooperative multi-agent systems, we propose a novel reinforcement learning framework for training joint policies that form a Nash equilibrium. In our approach, rather than providing low-level reward functions, the user provides high-level specifications that encode the objective of each agent. Then, guided by the structure of the specifications, our algorithm searches over policies to identify one that provably forms an $ϵ$ -Nash equilibrium (with high probability). Importantly, it prioritizes policies in a way that maximizes social welfare across all agents. Our empirical evaluation demonstrates that our algorithm computes equilibrium policies with high social welfare, whereas state-of-the-art baselines either fail to compute Nash…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Data Stream Mining Techniques