Exploiting Semantic Epsilon Greedy Exploration Strategy in Multi-Agent Reinforcement Learning
Hon Tik Tse, Ho-fung Leung

TL;DR
This paper introduces QMIX(SEG), a novel MARL approach that enhances exploration by semantically grouping actions in a hierarchical epsilon greedy strategy, significantly improving performance on complex benchmarks.
Contribution
It proposes a new semantic epsilon greedy exploration strategy integrated with QMIX, enabling more effective exploration in multi-agent reinforcement learning.
Findings
QMIX(SEG) outperforms standard QMIX on SMAC benchmark.
Semantic grouping improves exploration efficiency.
Approach is competitive with state-of-the-art MARL methods.
Abstract
Multi-agent reinforcement learning (MARL) can model many real world applications. However, many MARL approaches rely on epsilon greedy for exploration, which may discourage visiting advantageous states in hard scenarios. In this paper, we propose a new approach QMIX(SEG) for tackling MARL. It makes use of the value function factorization method QMIX to train per-agent policies and a novel Semantic Epsilon Greedy (SEG) exploration strategy. SEG is a simple extension to the conventional epsilon greedy exploration strategy, yet it is experimentally shown to greatly improve the performance of MARL. We first cluster actions into groups of actions with similar effects and then use the groups in a bi-level epsilon greedy exploration hierarchy for action selection. We argue that SEG facilitates semantic exploration by exploring in the space of groups of actions, which have richer semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Multimodal Machine Learning Applications
MethodsEpsilon Greedy Exploration
