Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning
Dogan Urgun, Gokhan Gungor

TL;DR
This paper presents an automated reward design framework using large language models to improve coordination in cooperative multi-agent reinforcement learning, reducing manual effort.
Contribution
It introduces a novel LLM-guided reward synthesis method that generates effective auxiliary rewards for multi-agent systems, evaluated across diverse environments.
Findings
Higher task returns and delivery counts achieved
Most gains in environments with interaction bottlenecks
Stronger interdependence and better signal alignment in synthesized rewards
Abstract
Designing effective auxiliary rewards for cooperative multi-agent systems remains challenging, as misaligned incentives can induce suboptimal coordination, particularly when sparse task rewards provide insufficient grounding for coordinated behavior. This study introduces an automated reward design framework that uses large language models to synthesize executable reward programs from environment instrumentation. The procedure constrains candidate programs within a formal validity envelope and trains policies from scratch using MAPPO under a fixed computational budget. The candidates are then evaluated based on their performance, and selection across generations relies solely on the sparse task returns. The framework is evaluated in four Overcooked-AI layouts characterized by varying levels of corridor congestion, handoff dependencies, and structural asymmetries. The proposed reward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
