Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment
Kartik Nagpal, Dayi Dong, Jean-Baptiste Bouvier, Negar Mehr

TL;DR
This paper introduces LLM-MCA, a novel method using large language models to improve credit assignment in multi-agent reinforcement learning, achieving superior performance and explainability across various benchmarks.
Contribution
The paper proposes a new LLM-based approach for credit assignment in multi-agent systems, including a reward-critic and explicit task assignment extension, outperforming existing methods.
Findings
Outperforms state-of-the-art on multiple benchmarks
Generates annotated trajectory datasets for analysis
Enhances explainability of multi-agent credit assignment
Abstract
Recent work, spanning from autonomous vehicle coordination to in-space assembly, has shown the importance of learning collaborative behavior for enabling robots to achieve shared goals. A common approach for learning this cooperative behavior is to utilize the centralized-training decentralized-execution paradigm. However, this approach also introduces a new challenge: how do we evaluate the contributions of each agent's actions to the overall success or failure of the team. This credit assignment problem has remained open, and has been extensively studied in the Multi-Agent Reinforcement Learning literature. In fact, humans manually inspecting agent behavior often generate better credit evaluations than existing methods. We combine this observation with recent works which show Large Language Models demonstrate human-level performance at many pattern recognition tasks. Our key idea is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
