Multi-Agent Training for Pommerman: Curriculum Learning and   Population-based Self-Play Approach

Nhat-Minh Huynh; Hoang-Giang Cao; I-Chen Wu

arXiv:2407.00662·cs.MA·January 9, 2025

Multi-Agent Training for Pommerman: Curriculum Learning and Population-based Self-Play Approach

Nhat-Minh Huynh, Hoang-Giang Cao, I-Chen Wu

PDF

Open Access

TL;DR

This paper introduces a novel multi-agent training system for Pommerman that combines curriculum learning, population-based self-play, adaptive reward adjustment, and Elo-based matchmaking, leading to improved agent performance.

Contribution

It presents a new training framework integrating curriculum learning and self-play with adaptive reward and matchmaking, addressing key challenges in multi-agent reinforcement learning for Pommerman.

Findings

01

Trained agents outperform existing top agents.

02

Adaptive reward mechanism improves learning efficiency.

03

Elo-based matchmaking effectively pairs competitive agents.

Abstract

Pommerman is a multi-agent environment that has received considerable attention from researchers in recent years. This environment is an ideal benchmark for multi-agent training, providing a battleground for two teams with communication capabilities among allied agents. Pommerman presents significant challenges for model-free reinforcement learning due to delayed action effects, sparse rewards, and false positives, where opponent players can lose due to their own mistakes. This study introduces a system designed to train multi-agent systems to play Pommerman using a combination of curriculum learning and population-based self-play. We also tackle two challenging problems when deploying the multi-agent training system for competitive games: sparse reward and suitable matchmaking mechanism. Specifically, we propose an adaptive annealing factor based on agents' performance to adjust the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Game Theory and Cooperation

MethodsSoftmax · Attention Is All You Need