Leveraging Large Language Models for Effective and Explainable   Multi-Agent Credit Assignment

Kartik Nagpal; Dayi Dong; Jean-Baptiste Bouvier; Negar Mehr

arXiv:2502.16863·cs.MA·February 25, 2025

Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment

Kartik Nagpal, Dayi Dong, Jean-Baptiste Bouvier, Negar Mehr

PDF

Open Access

TL;DR

This paper introduces LLM-MCA, a novel method using large language models to improve credit assignment in multi-agent reinforcement learning, achieving superior performance and explainability across various benchmarks.

Contribution

The paper proposes a new LLM-based approach for credit assignment in multi-agent systems, including a reward-critic and explicit task assignment extension, outperforming existing methods.

Findings

01

Outperforms state-of-the-art on multiple benchmarks

02

Generates annotated trajectory datasets for analysis

03

Enhances explainability of multi-agent credit assignment

Abstract

Recent work, spanning from autonomous vehicle coordination to in-space assembly, has shown the importance of learning collaborative behavior for enabling robots to achieve shared goals. A common approach for learning this cooperative behavior is to utilize the centralized-training decentralized-execution paradigm. However, this approach also introduces a new challenge: how do we evaluate the contributions of each agent's actions to the overall success or failure of the team. This credit assignment problem has remained open, and has been extensively studied in the Multi-Agent Reinforcement Learning literature. In fact, humans manually inspecting agent behavior often generate better credit evaluations than existing methods. We combine this observation with recent works which show Large Language Models demonstrate human-level performance at many pattern recognition tasks. Our key idea is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling