Learning cooperative behaviours in adversarial multi-agent systems

Ni Wang; Gautham P. Das; Alan G. Millard

arXiv:2302.05528·cs.AI·February 14, 2023

Learning cooperative behaviours in adversarial multi-agent systems

Ni Wang, Gautham P. Das, Alan G. Millard

PDF

1 Repo

TL;DR

This paper introduces TripleSumo, an extended multi-agent platform for studying cooperative behaviors in adversarial environments with continuous actions, demonstrating how agents can learn cooperation through reinforcement learning.

Contribution

It extends RoboSumo to TripleSumo, enabling investigation of cooperative behaviors in continuous, contact-rich adversarial settings with a new training scenario.

Findings

01

Agents can learn effective cooperation strategies.

02

Hybrid reward structures improve learning efficiency.

03

Cooperative behaviors increase winning probabilities.

Abstract

This work extends an existing virtual multi-agent platform called RoboSumo to create TripleSumo -- a platform for investigating multi-agent cooperative behaviors in continuous action spaces, with physical contact in an adversarial environment. In this paper we investigate a scenario in which two agents, namely `Bug' and `Ant', must team up and push another agent `Spider' out of the arena. To tackle this goal, the newly added agent `Bug' is trained during an ongoing match between `Ant' and `Spider'. `Bug' must develop awareness of the other agents' actions, infer the strategy of both sides, and eventually learn an action policy to cooperate. The reinforcement learning algorithm Deep Deterministic Policy Gradient (DDPG) is implemented with a hybrid reward structure combining dense and sparse rewards. The cooperative behavior is quantitatively evaluated by the mean probability of winning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

niart/triplesumo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.