Individual specialization in multi-task environments with multiagent   reinforcement learners

Marco Jerome Gasparrini; Ricard Sol\'e; Mart\'i S\'anchez-Fibla

arXiv:1912.12671·cs.AI·January 1, 2020·1 cites

Individual specialization in multi-task environments with multiagent reinforcement learners

Marco Jerome Gasparrini, Ricard Sol\'e, Mart\'i S\'anchez-Fibla

PDF

Open Access

TL;DR

This paper investigates how multi-agent reinforcement learning agents can develop specialization in multi-task environments, highlighting the limitations of epsilon-greedy exploration and proposing entropy-regularized policy methods for better convergence.

Contribution

It introduces a novel approach using entropy-regularized policy updates for independent learners, improving specialization and convergence in multi-task multi-agent settings.

Findings

01

Epsilon-greedy exploration synchronizes agents, hindering specialization.

02

Entropy-regularized policies lead to smoother convergence.

03

Increased number of agents correlates with higher likelihood of specialization.

Abstract

There is a growing interest in Multi-Agent Reinforcement Learning (MARL) as the first steps towards building general intelligent agents that learn to make low and high-level decisions in non-stationary complex environments in the presence of other agents. Previous results point us towards increased conditions for coordination, efficiency/fairness, and common-pool resource sharing. We further study coordination in multi-task environments where several rewarding tasks can be performed and thus agents don't necessarily need to perform well in all tasks, but under certain conditions may specialize. An observation derived from the study is that epsilon greedy exploration of value-based reinforcement learning methods is not adequate for multi-agent independent learners because the epsilon parameter that controls the probability of selecting a random action synchronizes the agents artificially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Game Theory and Applications

MethodsEpsilon Greedy Exploration