Social Influence as Intrinsic Motivation for Multi-Agent Deep   Reinforcement Learning

Natasha Jaques; Angeliki Lazaridou; Edward Hughes; Caglar Gulcehre,; Pedro A. Ortega; DJ Strouse; Joel Z. Leibo; Nando de Freitas

arXiv:1810.08647·cs.LG·June 20, 2019·263 cites

Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning

Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre,, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas

PDF

Open Access 3 Repos

TL;DR

This paper introduces a causal influence-based reward mechanism for multi-agent deep reinforcement learning, promoting coordination and communication by incentivizing agents to have a significant impact on others' actions, leading to improved social behaviors.

Contribution

It presents a novel decentralized influence reward method using counterfactual reasoning, enabling agents to learn diverse policies and emergent communication without centralized training.

Findings

01

Enhanced coordination and communication in social dilemma environments.

02

Dramatic improvements in learning speed and policy diversity.

03

Agents develop meaningful communication protocols.

Abstract

We propose a unified mechanism for achieving coordination and communication in Multi-Agent Reinforcement Learning (MARL), through rewarding agents for having causal influence over other agents' actions. Causal influence is assessed using counterfactual reasoning. At each timestep, an agent simulates alternate actions that it could have taken, and computes their effect on the behavior of other agents. Actions that lead to bigger changes in other agents' behavior are considered influential and are rewarded. We show that this is equivalent to rewarding agents for having high mutual information between their actions. Empirical results demonstrate that influence leads to enhanced coordination and communication in challenging social dilemma environments, dramatically increasing the learning curves of the deep RL agents, and leading to more meaningful learned communication protocols. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExperimental Behavioral Economics Studies