Embodiment-Induced Coordination Regimes in Tabular Multi-Agent Q-Learning

Muhammad Ahmed Atif; Nehal Naeem Haji; Mohammad Shahid Shaikh; Muhammad Ebad Atif

arXiv:2601.17454·cs.MA·January 27, 2026

Embodiment-Induced Coordination Regimes in Tabular Multi-Agent Q-Learning

Muhammad Ahmed Atif, Nehal Naeem Haji, Mohammad Shahid Shaikh, Muhammad Ebad Atif

PDF

Open Access

TL;DR

This study critically evaluates the effectiveness of centralized versus independent Q-learning in multi-agent systems under embodiment constraints, revealing that centralized methods do not always enhance coordination and can sometimes hinder it.

Contribution

It provides a controlled, tabular analysis showing that embodiment constraints influence the relative performance of centralized and independent multi-agent Q-learning, challenging common assumptions.

Findings

01

Centralized learning often outperforms independent learning but not consistently.

02

Embodiment constraints can turn coordination into a liability.

03

Role asymmetry affects the stability of coordination regimes.

Abstract

Centralized value learning is often assumed to improve coordination and stability in multi-agent reinforcement learning, yet this assumption is rarely tested under controlled conditions. We directly evaluate it in a fully tabular predator-prey gridworld by comparing independent and centralized Q-learning under explicit embodiment constraints on agent speed and stamina. Across multiple kinematic regimes and asymmetric agent roles, centralized learning fails to provide a consistent advantage and is frequently outperformed by fully independent learning, even under full observability and exact value estimation. Moreover, asymmetric centralized-independent configurations induce persistent coordination breakdowns rather than transient learning instability. By eliminating confounding effects from function approximation and representation learning, our tabular analysis isolates coordination…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Game Theory and Cooperation · Embodied and Extended Cognition