Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

Dogan Urgun; Gokhan Gungor

arXiv:2603.24324·cs.LG·April 7, 2026

Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

Dogan Urgun, Gokhan Gungor

PDF

TL;DR

This paper presents an automated reward design framework using large language models to improve coordination in cooperative multi-agent reinforcement learning, reducing manual effort.

Contribution

It introduces a novel LLM-guided reward synthesis method that generates effective auxiliary rewards for multi-agent systems, evaluated across diverse environments.

Findings

01

Higher task returns and delivery counts achieved

02

Most gains in environments with interaction bottlenecks

03

Stronger interdependence and better signal alignment in synthesized rewards

Abstract

Designing effective auxiliary rewards for cooperative multi-agent systems remains challenging, as misaligned incentives can induce suboptimal coordination, particularly when sparse task rewards provide insufficient grounding for coordinated behavior. This study introduces an automated reward design framework that uses large language models to synthesize executable reward programs from environment instrumentation. The procedure constrains candidate programs within a formal validity envelope and trains policies from scratch using MAPPO under a fixed computational budget. The candidates are then evaluated based on their performance, and selection across generations relies solely on the sparse task returns. The framework is evaluated in four Overcooked-AI layouts characterized by varying levels of corridor congestion, handoff dependencies, and structural asymmetries. The proposed reward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.