Tackling Asymmetric and Circular Sequential Social Dilemmas with   Reinforcement Learning and Graph-based Tit-for-Tat

Tangui Le Gl\'eau; Xavier Marjou; Tayeb Lemlouma; Benoit Radier

arXiv:2206.12909·cs.AI·June 28, 2022·1 cites

Tackling Asymmetric and Circular Sequential Social Dilemmas with Reinforcement Learning and Graph-based Tit-for-Tat

Tangui Le Gl\'eau, Xavier Marjou, Tayeb Lemlouma, Benoit Radier

PDF

Open Access 1 Repo

TL;DR

This paper introduces Circular Sequential Social Dilemmas (CSSD) to model asymmetric cooperation and proposes a graph-based Tit-for-Tat strategy combined with reinforcement learning to promote cooperation among self-interested agents.

Contribution

It extends existing social dilemma frameworks to include circular and asymmetric cooperation and develops a novel graph-based TFT method for multi-agent reinforcement learning.

Findings

01

Graph-based TFT encourages mutual cooperation in circular dilemmas.

02

The approach effectively handles asymmetric cooperation scenarios.

03

Experiments show improved cooperation among agents in a grid world environment.

Abstract

In many societal and industrial interactions, participants generally prefer their pure self-interest at the expense of the global welfare. Known as social dilemmas, this category of non-cooperative games offers situations where multiple actors should all cooperate to achieve the best outcome but greed and fear lead to a worst self-interested issue. Recently, the emergence of Deep Reinforcement Learning (RL) has generated revived interest in social dilemmas with the introduction of Sequential Social Dilemma (SSD). Cooperative agents mixing RL policies and Tit-for-tat (TFT) strategies have successfully addressed some non-optimal Nash equilibrium issues. However, this kind of paradigm requires symmetrical and direct cooperation between actors, conditions that are not met when mutual cooperation become asymmetric and is possible only with at least a third actor in a circular way. To tackle…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

submission-conf/neurips_cooperativeai
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Game Theory and Cooperation · Experimental Behavioral Economics Studies · Gambling Behavior and Treatments

Methods1x1 Convolution · Non Maximum Suppression · Convolution · SSD