Actor-Critic Algorithms for Constrained Multi-agent Reinforcement   Learning

Raghuram Bharadwaj Diddigi; Sai Koti Reddy Danda; Prabuchandran K.J.,; Shalabh Bhatnagar

arXiv:1905.02907·cs.MA·July 14, 2020·5 cites

Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning

Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Prabuchandran K.J.,, Shalabh Bhatnagar

PDF

Open Access

TL;DR

This paper introduces nested actor-critic algorithms for multi-agent reinforcement learning with constraints, enabling agents to optimize joint actions while satisfying specified constraints through a Lagrangian relaxation approach.

Contribution

It proposes a novel nested actor-critic framework that jointly learns policies and Lagrange multipliers for constrained multi-agent reinforcement learning.

Findings

01

Algorithms effectively handle constraints in cooperative tasks

02

Proposed methods outperform baseline approaches

03

Demonstrated convergence and robustness in experiments

Abstract

In cooperative stochastic games multiple agents work towards learning joint optimal actions in an unknown environment to achieve a common goal. In many real-world applications, however, constraints are often imposed on the actions that can be jointly taken by the agents. In such scenarios the agents aim to learn joint actions to achieve a common goal (minimizing a specified cost function) while meeting the given constraints (specified via certain penalty functions). In this paper, we consider the relaxation of the constrained optimization problem by constructing the Lagrangian of the cost and penalty functions. We propose a nested actor-critic solution approach to solve this relaxed problem. In this approach, an actor-critic scheme is employed to improve the policy for a given Lagrange parameter update on a faster timescale as in the classical actor-critic architecture. A meta…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Game Theory and Cooperation · Traffic control and management