Prosocial learning agents solve generalized Stag Hunts better than selfish ones
Alexander Peysakhovich, Adam Lerer

TL;DR
This paper demonstrates that making a single reinforcement learning agent prosocial by caring about its partners' rewards can significantly improve group outcomes in Stag Hunt environments, including complex scenarios with raw inputs.
Contribution
The study introduces a simple modification to reward functions, making one agent prosocial, which enhances convergence to optimal outcomes in multi-agent Stag Hunt games.
Findings
Prosocial agents increase likelihood of cooperative convergence.
Single-agent prosociality improves long-term group payoffs.
Results extend to complex environments with raw pixel inputs.
Abstract
Deep reinforcement learning has become an important paradigm for constructing agents that can enter complex multi-agent situations and improve their policies through experience. One commonly used technique is reactive training - applying standard RL methods while treating other agents as a part of the learner's environment. It is known that in general-sum games reactive training can lead groups of agents to converge to inefficient outcomes. We focus on one such class of environments: Stag Hunt games. Here agents either choose a risky cooperative policy (which leads to high payoffs if both choose it but low payoffs to an agent who attempts it alone) or a safe one (which leads to a safe payoff no matter what). We ask how we can change the learning rule of a single agent to improve its outcomes in Stag Hunts that include other reactive learners. We extend existing work on reward-shaping in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Experimental Behavioral Economics Studies · Reinforcement Learning in Robotics
