Reciprocal Reward Influence Encourages Cooperation From Self-Interested   Agents

John L. Zhou; Weizhe Hong; Jonathan C. Kao

arXiv:2406.01641·cs.MA·January 16, 2025

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

John L. Zhou, Weizhe Hong, Jonathan C. Kao

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Reciprocators, a new reinforcement learning approach where agents are motivated to reciprocate influence, promoting cooperation among self-interested agents without complex modeling or privileged access, effective in social dilemmas.

Contribution

The paper proposes Reciprocators, a novel learning rule-agnostic and sample-efficient method that encourages cooperation by reciprocating influence, without differentiating through opponents' policies.

Findings

01

Reciprocators successfully promote cooperation in social dilemmas.

02

The approach is learning rule-agnostic and sample-efficient.

03

Reciprocators influence opponents' Q-values to favor mutual benefit.

Abstract

Cooperation between self-interested individuals is a widespread phenomenon in the natural world, but remains elusive in interactions between artificially intelligent agents. Instead, naive reinforcement learning algorithms typically converge to Pareto-dominated outcomes in even the simplest of social dilemmas. An emerging literature on opponent shaping has demonstrated the ability to reach prosocial outcomes by influencing the learning of other agents. However, such methods differentiate through the learning step of other agents or optimize for meta-game dynamics, which rely on privileged access to opponents' learning algorithms or exponential sample complexity, respectively. To provide a learning rule-agnostic and sample-efficient alternative, we introduce Reciprocators, reinforcement learning agents which are intrinsically motivated to reciprocate the influence of opponents' actions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

johnlyzhou/reciprocator
pytorchOfficial

Videos

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents· slideslive

Taxonomy

TopicsEvolutionary Game Theory and Cooperation