Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
John L. Zhou, Weizhe Hong, Jonathan C. Kao

TL;DR
This paper introduces Reciprocators, a new reinforcement learning approach where agents are motivated to reciprocate influence, promoting cooperation among self-interested agents without complex modeling or privileged access, effective in social dilemmas.
Contribution
The paper proposes Reciprocators, a novel learning rule-agnostic and sample-efficient method that encourages cooperation by reciprocating influence, without differentiating through opponents' policies.
Findings
Reciprocators successfully promote cooperation in social dilemmas.
The approach is learning rule-agnostic and sample-efficient.
Reciprocators influence opponents' Q-values to favor mutual benefit.
Abstract
Cooperation between self-interested individuals is a widespread phenomenon in the natural world, but remains elusive in interactions between artificially intelligent agents. Instead, naive reinforcement learning algorithms typically converge to Pareto-dominated outcomes in even the simplest of social dilemmas. An emerging literature on opponent shaping has demonstrated the ability to reach prosocial outcomes by influencing the learning of other agents. However, such methods differentiate through the learning step of other agents or optimize for meta-game dynamics, which rely on privileged access to opponents' learning algorithms or exponential sample complexity, respectively. To provide a learning rule-agnostic and sample-efficient alternative, we introduce Reciprocators, reinforcement learning agents which are intrinsically motivated to reciprocate the influence of opponents' actions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsEvolutionary Game Theory and Cooperation
