Multiagent Cooperation and Competition with Deep Reinforcement Learning

Ardi Tampuu; Tambet Matiisen; Dorian Kodelja; Ilya Kuzovkin; Kristjan; Korjus; Juhan Aru; Jaan Aru; Raul Vicente

arXiv:1511.08779·cs.AI·November 30, 2015

Multiagent Cooperation and Competition with Deep Reinforcement Learning

Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan, Korjus, Juhan Aru, Jaan Aru, Raul Vicente

PDF

4 Repos

TL;DR

This paper extends Deep Q-Learning to multiagent settings, demonstrating how independent agents can learn competitive or collaborative behaviors in Pong, revealing emergent strategies and the transition between behaviors.

Contribution

It introduces a multiagent Deep Q-Network framework and explores how different reward schemes lead to diverse emergent behaviors in a classic game.

Findings

01

Competitive agents learn to score efficiently.

02

Collaborative agents maximize game duration.

03

Behavior transitions from competition to cooperation.

Abstract

Multiagent systems appear in most social, economical, and political situations. In the present work we extend the Deep Q-Learning Network architecture proposed by Google DeepMind to multiagent environments and investigate how two agents controlled by independent Deep Q-Networks interact in the classic videogame Pong. By manipulating the classical rewarding scheme of Pong we demonstrate how competitive and collaborative behaviors emerge. Competitive agents learn to play and score efficiently. Agents trained under collaborative rewarding schemes find an optimal strategy to keep the ball in the game as long as possible. We also describe the progression from competitive to collaborative behavior. The present work demonstrates that Deep Q-Networks can become a practical tool for studying the decentralized learning of multiagent systems living in highly complex environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning