Tackling Asymmetric and Circular Sequential Social Dilemmas with Reinforcement Learning and Graph-based Tit-for-Tat
Tangui Le Gl\'eau, Xavier Marjou, Tayeb Lemlouma, Benoit Radier

TL;DR
This paper introduces Circular Sequential Social Dilemmas (CSSD) to model asymmetric cooperation and proposes a graph-based Tit-for-Tat strategy combined with reinforcement learning to promote cooperation among self-interested agents.
Contribution
It extends existing social dilemma frameworks to include circular and asymmetric cooperation and develops a novel graph-based TFT method for multi-agent reinforcement learning.
Findings
Graph-based TFT encourages mutual cooperation in circular dilemmas.
The approach effectively handles asymmetric cooperation scenarios.
Experiments show improved cooperation among agents in a grid world environment.
Abstract
In many societal and industrial interactions, participants generally prefer their pure self-interest at the expense of the global welfare. Known as social dilemmas, this category of non-cooperative games offers situations where multiple actors should all cooperate to achieve the best outcome but greed and fear lead to a worst self-interested issue. Recently, the emergence of Deep Reinforcement Learning (RL) has generated revived interest in social dilemmas with the introduction of Sequential Social Dilemma (SSD). Cooperative agents mixing RL policies and Tit-for-tat (TFT) strategies have successfully addressed some non-optimal Nash equilibrium issues. However, this kind of paradigm requires symmetrical and direct cooperation between actors, conditions that are not met when mutual cooperation become asymmetric and is possible only with at least a third actor in a circular way. To tackle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Experimental Behavioral Economics Studies · Gambling Behavior and Treatments
Methods1x1 Convolution · Non Maximum Suppression · Convolution · SSD
