Risk-Sensitive Bayesian Games for Multi-Agent Reinforcement Learning   under Policy Uncertainty

Hannes Eriksson; Debabrota Basu; Mina Alibeigi; Christos Dimitrakakis

arXiv:2203.10045·cs.LG·March 21, 2022·1 cites

Risk-Sensitive Bayesian Games for Multi-Agent Reinforcement Learning under Policy Uncertainty

Hannes Eriksson, Debabrota Basu, Mina Alibeigi, Christos Dimitrakakis

PDF

Open Access

TL;DR

This paper introduces risk-sensitive algorithms for Bayesian games in multi-agent reinforcement learning, focusing on uncertainty over agent types and demonstrating improved performance over risk-neutral methods.

Contribution

It develops risk-sensitive variants of existing algorithms like IBR, FP, and DAPG to handle type uncertainty in stochastic games, advancing the field of risk-aware multi-agent RL.

Findings

01

Risk-sensitive DAPG outperforms risk-neutral algorithms in experiments.

02

Focus on type uncertainty offers new insights into stochastic game risks.

03

Algorithms improve social welfare in general-sum stochastic games.

Abstract

In stochastic games with incomplete information, the uncertainty is evoked by the lack of knowledge about a player's own and the other players' types, i.e. the utility function and the policy space, and also the inherent stochasticity of different players' interactions. In existing literature, the risk in stochastic games has been studied in terms of the inherent uncertainty evoked by the variability of transitions and actions. In this work, we instead focus on the risk associated with the \textit{uncertainty over types}. We contrast this with the multi-agent reinforcement learning framework where the other agents have fixed stationary policies and investigate risk-sensitiveness due to the uncertainty about the other agents' adaptive policies. We propose risk-sensitive versions of existing algorithms proposed for risk-neutral stochastic games, such as Iterated Best Response (IBR),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDecision-Making and Behavioral Economics · Energy, Environment, and Transportation Policies · Reinforcement Learning in Robotics