Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning

Shuai Han; Mehdi Dastani; Shihan Wang

arXiv:2505.08630·cs.LG·May 14, 2025

Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning

Shuai Han, Mehdi Dastani, Shihan Wang

PDF

TL;DR

This paper presents a novel multi-agent reinforcement learning method that improves credit assignment and exploration in sparse-reward environments by calculating the influence scope of agents on states, leading to significant performance gains.

Contribution

The paper introduces the Influence Scope of Agents (ISA) algorithm, which enhances credit assignment and exploration by quantifying agents' influence on state attributes in sparse-reward MARL.

Findings

01

ISA significantly outperforms baseline methods in various scenarios.

02

The method effectively improves credit assignment accuracy.

03

Enhanced exploration leads to better policy learning.

Abstract

Training cooperative agents in sparse-reward scenarios poses significant challenges for multi-agent reinforcement learning (MARL). Without clear feedback on actions at each step in sparse-reward setting, previous methods struggle with precise credit assignment among agents and effective exploration. In this paper, we introduce a novel method to deal with both credit assignment and exploration problems in reward-sparse domains. Accordingly, we propose an algorithm that calculates the Influence Scope of Agents (ISA) on states by taking specific value of the dimensions/attributes of states that can be influenced by individual agents. The mutual dependence between agents' actions and state attributes are then used to calculate the credit assignment and to delimit the exploration space for each individual agent. We then evaluate ISA in a variety of sparse-reward multi-agent scenarios. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.