Non-Exchangeable Mean Field Markov Decision Processes with common noise : from Bellman equation to quantitative propagation of chaos

Samy Mekkaoui; Huy\^en Pham

arXiv:2603.00009·math.OC·March 3, 2026

Non-Exchangeable Mean Field Markov Decision Processes with common noise : from Bellman equation to quantitative propagation of chaos

Samy Mekkaoui, Huy\^en Pham

PDF

Open Access

TL;DR

This paper develops a new framework for infinite-horizon Markov Decision Processes involving heterogeneous agents with common noise, establishing Bellman equations and quantitative propagation of chaos without assuming exchangeability.

Contribution

It introduces the CNEMF-MDP framework, showing equivalence of formulations, characterizing the value function, and providing finite-population bounds for near-optimal policies.

Findings

01

Established the equivalence between strong and label-state formulations.

02

Derived sharp finite-population bounds for propagation of chaos.

03

Constructed near-optimal policies from the limiting MDP.

Abstract

We study infinite-horizon Markov Decision Processes (MDPs) with a continuum of heterogeneous agents interacting through a common noise, without assuming exchangeability. We introduce the framework of Conditional Non-Exchangeable Mean Field MDPs (CNEMF-MDPs) in both a strong formulation and a label-state formulation. We establish the equivalence between these two formulations by showing that the control problem can be lifted to a standard MDP defined on the Wasserstein space of probability measures over the product of the label and state spaces. Here, the label space represents agent heterogeneity, the state space is the individual state space, and a fixed distribution specifies the population of agent labels. Within this framework, we characterize the value function as the unique fixed point of an appropriate Bellman operator acting on this Wasserstein space. Our second contribution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Distributed Control Multi-Agent Systems · Stochastic processes and financial applications