Disentangling Successor Features for Coordination in Multi-agent   Reinforcement Learning

Seung Hyun Kim; Neale Van Stralen; Girish Chowdhary; Huy T. Tran

arXiv:2202.07741·cs.MA·February 17, 2022·1 cites

Disentangling Successor Features for Coordination in Multi-agent Reinforcement Learning

Seung Hyun Kim, Neale Van Stralen, Girish Chowdhary, Huy T. Tran

PDF

Open Access

TL;DR

This paper introduces a method using successor features to improve coordination among decentralized agents in multi-agent reinforcement learning, especially in unstructured tasks with sparse rewards.

Contribution

It proposes a novel approach to disentangle individual agent impacts on the global value function, enabling stable decentralized training.

Findings

01

Improved performance over existing methods.

02

Reduced training time.

03

Effective in unstructured, sparse reward environments.

Abstract

Multi-agent reinforcement learning (MARL) is a promising framework for solving complex tasks with many agents. However, a key challenge in MARL is defining private utility functions that ensure coordination when training decentralized agents. This challenge is especially prevalent in unstructured tasks with sparse rewards and many agents. We show that successor features can help address this challenge by disentangling an individual agent's impact on the global value function from that of all other agents. We use this disentanglement to compactly represent private utilities that support stable training of decentralized agents in unstructured tasks. We implement our approach using a centralized training, decentralized execution architecture and test it in a variety of multi-agent environments. Our results show improved performance and training time relative to existing methods and suggest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning