Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning
Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Prabuchandran K.J.,, Shalabh Bhatnagar

TL;DR
This paper introduces nested actor-critic algorithms for multi-agent reinforcement learning with constraints, enabling agents to optimize joint actions while satisfying specified constraints through a Lagrangian relaxation approach.
Contribution
It proposes a novel nested actor-critic framework that jointly learns policies and Lagrange multipliers for constrained multi-agent reinforcement learning.
Findings
Algorithms effectively handle constraints in cooperative tasks
Proposed methods outperform baseline approaches
Demonstrated convergence and robustness in experiments
Abstract
In cooperative stochastic games multiple agents work towards learning joint optimal actions in an unknown environment to achieve a common goal. In many real-world applications, however, constraints are often imposed on the actions that can be jointly taken by the agents. In such scenarios the agents aim to learn joint actions to achieve a common goal (minimizing a specified cost function) while meeting the given constraints (specified via certain penalty functions). In this paper, we consider the relaxation of the constrained optimization problem by constructing the Lagrangian of the cost and penalty functions. We propose a nested actor-critic solution approach to solve this relaxed problem. In this approach, an actor-critic scheme is employed to improve the policy for a given Lagrange parameter update on a faster timescale as in the classical actor-critic architecture. A meta…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Game Theory and Cooperation · Traffic control and management
