Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
P.Parnika, Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda and, Shalabh Bhatnagar

TL;DR
This paper extends the Attention Actor-Critic algorithm to multi-agent constrained reinforcement learning, enabling agents to optimize shared goals while satisfying action constraints through different attention modes, leading to improved performance.
Contribution
It introduces a novel multi-mode attention mechanism within Actor-Critic for constrained multi-agent RL, enhancing the ability to optimize goals and satisfy constraints simultaneously.
Findings
Effective in benchmark environments
Improves action quality under constraints
Demonstrates better convergence behavior
Abstract
In this work, we consider the problem of computing optimal actions for Reinforcement Learning (RL) agents in a co-operative setting, where the objective is to optimize a common goal. However, in many real-life applications, in addition to optimizing the goal, the agents are required to satisfy certain constraints specified on their actions. Under this setting, the objective of the agents is to not only learn the actions that optimize the common objective but also meet the specified constraints. In recent times, the Actor-Critic algorithm with an attention mechanism has been successfully applied to obtain optimal actions for RL agents in multi-agent environments. In this work, we extend this algorithm to the constrained multi-agent RL setting. The idea here is that optimizing the common goal and satisfying the constraints may require different modes of attention. By incorporating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural Networks and Reservoir Computing
