Prosocial learning agents solve generalized Stag Hunts better than   selfish ones

Alexander Peysakhovich; Adam Lerer

arXiv:1709.02865·cs.AI·December 11, 2017·25 cites

Prosocial learning agents solve generalized Stag Hunts better than selfish ones

Alexander Peysakhovich, Adam Lerer

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that making a single reinforcement learning agent prosocial by caring about its partners' rewards can significantly improve group outcomes in Stag Hunt environments, including complex scenarios with raw inputs.

Contribution

The study introduces a simple modification to reward functions, making one agent prosocial, which enhances convergence to optimal outcomes in multi-agent Stag Hunt games.

Findings

01

Prosocial agents increase likelihood of cooperative convergence.

02

Single-agent prosociality improves long-term group payoffs.

03

Results extend to complex environments with raw pixel inputs.

Abstract

Deep reinforcement learning has become an important paradigm for constructing agents that can enter complex multi-agent situations and improve their policies through experience. One commonly used technique is reactive training - applying standard RL methods while treating other agents as a part of the learner's environment. It is known that in general-sum games reactive training can lead groups of agents to converge to inefficient outcomes. We focus on one such class of environments: Stag Hunt games. Here agents either choose a risky cooperative policy (which leads to high payoffs if both choose it but low payoffs to an agent who attempts it alone) or a safe one (which leads to a safe payoff no matter what). We ask how we can change the learning rule of a single agent to improve its outcomes in Stag Hunts that include other reactive learners. We extend existing work on reward-shaping in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sharedcare/MARL-Cooperative
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Game Theory and Cooperation · Experimental Behavioral Economics Studies · Reinforcement Learning in Robotics