Kernel Based Maximum Entropy Inverse Reinforcement Learning for Mean-Field Games

Berkay Anahtarci; Can Deha Kariksiz; Naci Saldi

arXiv:2507.14529·cs.LG·March 6, 2026

Kernel Based Maximum Entropy Inverse Reinforcement Learning for Mean-Field Games

Berkay Anahtarci, Can Deha Kariksiz, Naci Saldi

PDF

Open Access

TL;DR

This paper introduces a kernel-based maximum entropy inverse reinforcement learning method for mean-field games, enabling the inference of complex reward functions from expert data and demonstrating significant improvements over linear models.

Contribution

It develops a novel RKHS-based IRL framework for mean-field games, providing theoretical guarantees and extending to finite-horizon non-stationary scenarios.

Findings

01

Kernel-based IRL reduces policy recovery error by over an order of magnitude.

02

The framework is validated on a mean-field traffic routing game.

03

The method extends to finite-horizon non-stationary settings with convergence guarantees.

Abstract

We consider the maximum causal entropy inverse reinforcement learning (IRL) problem for infinite-horizon stationary mean-field games (MFG), in which we model the unknown reward function within a reproducing kernel Hilbert space (RKHS). This allows the inference of rich and potentially nonlinear reward structures directly from expert demonstrations, in contrast to most existing approaches for MFGs that typically restrict the reward to a linear combination of a fixed finite set of basis functions and rely on finite-horizon formulations. We introduce a Lagrangian relaxation that enables us to reformulate the problem as an unconstrained log-likelihood maximization and obtain a solution via a gradient ascent algorithm. To establish the theoretical consistency of the algorithm, we prove the smoothness of the log-likelihood objective through the Fr\'echet differentiability of the related soft…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Game Theory and Applications