Discrete-Time Mean Field Control with Environment States
Kai Cui, Anam Tahir, Mark Sinzger, Heinz Koeppl

TL;DR
This paper develops a theoretical framework for discrete-time mean field control with environment states, proving approximate optimality and applying deep reinforcement learning for solutions, with results validated against multi-agent RL methods.
Contribution
It introduces a rigorous theoretical analysis of mean field control with environment states and demonstrates the effectiveness of deep RL methods for solving such problems.
Findings
Approximate optimality established as number of agents increases.
Deep RL methods effectively solve the mean field control problem.
Learned policies converge to mean field performance with many agents.
Abstract
Multi-agent reinforcement learning methods have shown remarkable potential in solving complex multi-agent problems but mostly lack theoretical guarantees. Recently, mean field control and mean field games have been established as a tractable solution for large-scale multi-agent problems with many agents. In this work, driven by a motivating scheduling problem, we consider a discrete-time mean field control model with common environment states. We rigorously establish approximate optimality as the number of agents grows in the finite agent case and find that a dynamic programming principle holds, resulting in the existence of an optimal stationary policy. As exact solutions are difficult in general due to the resulting continuous action space of the limiting mean field Markov decision process, we apply established deep reinforcement learning methods to solve the associated mean field…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
