SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC
Yue Deng, Yan Yu, Weiyu Ma, Zirui Wang, Wenhui Zhu, Jian Zhao, Yin, Zhang

TL;DR
SMAC-HARD is a new benchmark for multi-agent reinforcement learning in StarCraft that introduces diverse opponent strategies and self-play to better evaluate and improve algorithm robustness against varied adversaries.
Contribution
We propose SMAC-HARD, a novel benchmark with customizable opponent strategies and self-play support, addressing overfitting and evaluation limitations in existing SMAC environments.
Findings
State-of-the-art algorithms struggle with mixed strategy opponents.
Black-box testing reveals difficulty in policy transfer to unseen adversaries.
SMAC-HARD enhances robustness and generalization in MARL algorithms.
Abstract
The availability of challenging simulation environments is pivotal for advancing the field of Multi-Agent Reinforcement Learning (MARL). In cooperative MARL settings, the StarCraft Multi-Agent Challenge (SMAC) has gained prominence as a benchmark for algorithms following centralized training with decentralized execution paradigm. However, with continual advancements in SMAC, many algorithms now exhibit near-optimal performance, complicating the evaluation of their true effectiveness. To alleviate this problem, in this work, we highlight a critical issue: the default opponent policy in these environments lacks sufficient diversity, leading MARL algorithms to overfit and exploit unintended vulnerabilities rather than learning robust strategies. To overcome these limitations, we propose SMAC-HARD, a novel benchmark designed to enhance training robustness and evaluation comprehensiveness.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
