MARL-CC: A Mathematical Framework forMulti-Agent Reinforcement Learning in ConnectedAutonomous Vehicles: Addressing Nonlinearity,Partial Observability, and Credit Assignment forOptimal Control

Mazyar Taghavi; Javad Vahidi

arXiv:2511.17653·math.GM·November 25, 2025

MARL-CC: A Mathematical Framework forMulti-Agent Reinforcement Learning in ConnectedAutonomous Vehicles: Addressing Nonlinearity,Partial Observability, and Credit Assignment forOptimal Control

Mazyar Taghavi, Javad Vahidi

PDF

Open Access

TL;DR

This paper introduces MARL-CC, a comprehensive mathematical framework for multi-agent reinforcement learning in connected autonomous vehicles, addressing key challenges like nonlinearity, partial observability, and credit assignment to improve stability and cooperation.

Contribution

The paper presents MARL-CC, a novel unified framework integrating control theory, Bayesian inference, and credit assignment, with proven convergence and stability guarantees for multi-agent systems.

Findings

01

Up to 40% faster convergence compared to baselines

02

Enhanced cooperative efficiency in simulations and real-world tests

03

Robustness to stochastic disturbances and communication delays

Abstract

Multi-Agent Reinforcement Learning (MARL) has emerged as a powerfulparadigm for cooperative decision-making in connected autonomous vehicles(CAVs); however, existing approaches often fail to guarantee stability, optimality,and interpretability in systems characterized by nonlinear dynamics,partial observability, and complex inter-agent coupling. This study addressesthese foundational challenges by introducing MARL-CC, a unified MathematicalFramework for Multi-Agent Reinforcement Learning with Control Coordination.The proposed framework integrates differential geometric control, Bayesian inference,and Shapley-value-based credit assignment within a coherent optimizationarchitecture, ensuring bounded policy updates, decentralized belief estimation,and equitable reward distribution. Theoretical analyses establish convergence andstability guarantees under stochastic disturbances and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Traffic control and management