$N$-player games and mean-field games with smooth dependence on past   absorptions

Luciano Campi; Maddalena Ghio; Giulia Livieri

arXiv:1902.02670·math.PR·November 5, 2021

$N$-player games and mean-field games with smooth dependence on past absorptions

Luciano Campi, Maddalena Ghio, Giulia Livieri

PDF

TL;DR

This paper advances the theory of mean-field games with absorption by allowing more general state dynamics and costs, including infinite-dimensional dependence and linear growth, and proves existence, uniqueness, and approximate Nash equilibria.

Contribution

It extends mean-field game models with absorption to more general, possibly infinite-dimensional settings with relaxed boundedness conditions, and establishes fundamental existence and uniqueness results.

Findings

01

Proved existence of solutions in strict and relaxed feedback forms.

02

Established uniqueness under monotonicity conditions.

03

Showed approximate Nash equilibria for large N-player games.

Abstract

Mean-field games with absorption is a class of games, that have been introduced in Campi and Fischer (2018) and that can be viewed as natural limits of symmetric stochastic differential games with a large number of players who, interacting through a mean-field, leave the game as soon as their private states hit some given boundary. In this paper, we push the study of such games further, extending their scope along two main directions. First, we allow the state dynamics and the costs to have a very general, possibly infinite-dimensional, dependence on the (non-normalized) empirical sub-probability measure of the survivors' states. This includes the particularly relevant case where the mean-field interaction among the players is done through the empirical measure of the survivors together with the fraction of absorbed players over time. Second, the boundedness of coefficients and costs…

Equations407

X_{t}^{N, i} = X_{0}^{N, i}

X_{t}^{N, i} = X_{0}^{N, i}

μ_{t}^{N} (\cdot) ≐ \frac{1}{N} i = 1 \sum N δ_{X_{t}^{N, i}} (\cdot) 1_{[0, τ^{X^{N, i}})} (t) .

μ_{t}^{N} (\cdot) ≐ \frac{1}{N} i = 1 \sum N δ_{X_{t}^{N, i}} (\cdot) 1_{[0, τ^{X^{N, i}})} (t) .

\displaystyle J^{N,i}\left(\textbf{{u}}^{N}\right)\doteq\mathbb{E}\Biggl{[}\int_{0}^{\tau^{N,i}}\bar{f}\left(s,X_{s}^{N,i},\mu_{s}^{N},u^{N,i}\left(s,\textbf{X}^{N}\right)\right)ds+F\left(\tau^{N,i},X^{N,i}_{\tau^{N,i}}\right)\Biggr{]}

\displaystyle J^{N,i}\left(\textbf{{u}}^{N}\right)\doteq\mathbb{E}\Biggl{[}\int_{0}^{\tau^{N,i}}\bar{f}\left(s,X_{s}^{N,i},\mu_{s}^{N},u^{N,i}\left(s,\textbf{X}^{N}\right)\right)ds+F\left(\tau^{N,i},X^{N,i}_{\tau^{N,i}}\right)\Biggr{]}

P_{1} (E)

P_{1} (E)

M_{\leq 1, 1} (E)

W_{1} (μ, ν) ≐ π \in Π (μ, ν) in f \int_{E \times E} d_{E} (x, y) d π (x, y) = f \in Lip_{1} (E; R) sup \int_{E} f (x) d (μ - ν) (x)

W_{1} (μ, ν) ≐ π \in Π (μ, ν) in f \int_{E \times E} d_{E} (x, y) d π (x, y) = f \in Lip_{1} (E; R) sup \int_{E} f (x) d (μ - ν) (x)

\overset{ˉ}{b} : [0, T] \times R^{d} \times M_{\leq 1, 1} (R^{d}) \times Γ \to R^{d}, σ \in R^{d \times d},

\overset{ˉ}{b} : [0, T] \times R^{d} \times M_{\leq 1, 1} (R^{d}) \times Γ \to R^{d}, σ \in R^{d \times d},

\overset{ˉ}{f} : [0, T] \times R^{d} \times M_{\leq 1, 1} (R^{d}) \times Γ \to [0, \infty), F : [0, T] \times R^{d} \to [0, \infty) .

b (t, φ, θ, u)

b (t, φ, θ, u)

f (t, φ, θ, u)

b : [0, T] \times X \times P_{1} (X) \times Γ \to R^{d},

b : [0, T] \times X \times P_{1} (X) \times Γ \to R^{d},

f : [0, T] \times X \times P_{1} (X) \times Γ \to [0, \infty)

\int_{R^{d}} ψ (x) g (t, θ) (d x) ≐ \int_{X} ψ (φ (t)) 1_{[0, τ^{φ})} (t) θ (d φ) .

\int_{R^{d}} ψ (x) g (t, θ) (d x) ≐ \int_{X} ψ (φ (t)) 1_{[0, τ^{φ})} (t) θ (d φ) .

(t, φ, θ, u) \mapsto (t, φ (t), g (t, θ), u) .

(t, φ, θ, u) \mapsto (t, φ (t), g (t, θ), u) .

m (μ) ≐ \int_{R^{d}} ∣ x ∣ μ (d x) and m (t; θ) ≐ \int_{X} ∣ φ (t) ∣ 1_{[0, τ^{φ})} (t) θ (d φ) .

m (μ) ≐ \int_{R^{d}} ∣ x ∣ μ (d x) and m (t; θ) ≐ \int_{X} ∣ φ (t) ∣ 1_{[0, τ^{φ})} (t) θ (d φ) .

\overset{ˉ}{b} (t, x, μ, u) - \overset{ˉ}{b} (t, x^{'}, μ, u) \leq L ∣ x - x^{'} ∣, x, x^{'} \in R^{d}

\overset{ˉ}{b} (t, x, μ, u) - \overset{ˉ}{b} (t, x^{'}, μ, u) \leq L ∣ x - x^{'} ∣, x, x^{'} \in R^{d}

\overset{ˉ}{b} (t, x, μ, u) \leq C (1 + ∣ x ∣ + m (μ))

\overset{ˉ}{b} (t, x, μ, u) \leq C (1 + ∣ x ∣ + m (μ))

\overset{ˉ}{f} (t, x, μ, u)

\overset{ˉ}{f} (t, x, μ, u)

X_{t} = X_{0} + \int_{0}^{t} \overset{ˉ}{b} (s, X_{s}, μ_{s}, u (s, X)) d s + σ W_{t}, t \in [0, T],

X_{t} = X_{0} + \int_{0}^{t} \overset{ˉ}{b} (s, X_{s}, μ_{s}, u (s, X)) d s + σ W_{t}, t \in [0, T],

U_{f b} ≐ {u : [0, T] \times X \to Γ : u is progressively measurable} .

U_{f b} ≐ {u : [0, T] \times X \to Γ : u is progressively measurable} .

\displaystyle J^{\mu}\left(u\right)\doteq\mathbb{E}\Biggl{[}\int_{0}^{\tau}\bar{f}\left(s,X_{s},\mu_{s},u\left(s,X\right)\right)ds+F\left(\tau,X_{\tau}\right)\Biggr{]}

\displaystyle J^{\mu}\left(u\right)\doteq\mathbb{E}\Biggl{[}\int_{0}^{\tau}\bar{f}\left(s,X_{s},\mu_{s},u\left(s,X\right)\right)ds+F\left(\tau,X_{\tau}\right)\Biggr{]}

V^{μ} ≐ u \in U_{f b} in f J^{μ} (u) .

V^{μ} ≐ u \in U_{f b} in f J^{μ} (u) .

μ_{t} (\cdot) = P ({X_{t} \in \cdot} \cap {τ^{X} > t}), t \in [0, T] .

μ_{t} (\cdot) = P ({X_{t} \in \cdot} \cap {τ^{X} > t}), t \in [0, T] .

V ≐ {q \in M_{f} ([0, T] \times Γ) : q (d t, d γ) = d t q_{t} (d γ), t \mapsto q_{t} \in P (Γ) Borel measurable}

V ≐ {q \in M_{f} ([0, T] \times Γ) : q (d t, d γ) = d t q_{t} (d γ), t \mapsto q_{t} \in P (Γ) Borel measurable}

X_{t}

X_{t}

J^{μ} (λ)

μ_{t} (\cdot) = Q ({X_{t} \in \cdot} \cap {τ^{X} > t}), t \in [0, T] .

μ_{t} (\cdot) = Q ({X_{t} \in \cdot} \cap {τ^{X} > t}), t \in [0, T] .

X_{t} = X_{0} + \int_{0}^{t} \overset{ˉ}{b} (s, X_{s}, μ_{s}, u_{s}) d s + σ W_{t}, t \in [0, T]

X_{t} = X_{0} + \int_{0}^{t} \overset{ˉ}{b} (s, X_{s}, μ_{s}, u_{s}) d s + σ W_{t}, t \in [0, T]

X_{t}

X_{t}

X_{t} = X_{0} + \int_{0}^{t} \overset{ˉ}{b}^{n} (s, X_{s}, μ_{s}, u (s, X)) d s + σ W_{t}, t \in [0, T]

X_{t} = X_{0} + \int_{0}^{t} \overset{ˉ}{b}^{n} (s, X_{s}, μ_{s}, u (s, X)) d s + σ W_{t}, t \in [0, T]

V^{n, μ} ≐ u \in U_{f b} in f J^{n, μ} (u) .

V^{n, μ} ≐ u \in U_{f b} in f J^{n, μ} (u) .

h^{n} (t, x, θ, z, u)

h^{n} (t, x, θ, z, u)

H^{n} (t, x, θ, z)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

$N$ **-player games and mean-field games with

smooth dependence on past absorptions

**

Luciano Campi London School of Economics, Department of Statistics, Columbia House, Houghton Street, London, WC2A 2AE. Università degli Studi di Milano, Dipartimento di Matematica “Federigo Enriques”, Via Saldini 50, 20133, Milano, Italy. Email: [email protected].

Maddalena Ghio Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa. Email: [email protected].

Giulia Livieri Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa. Email: [email protected].

Abstract

Mean-field games with absorption is a class of games that has been introduced in [9] and that can be viewed as natural limits of symmetric stochastic differential games with a large number of players who, interacting through a mean-field, leave the game as soon as their private states hit some given boundary.

In this paper, we push the study of such games further, extending their scope along two main directions. First, we allow the state dynamics and the costs to have a very general, possibly infinite-dimensional, dependence on the (non-normalized) empirical sub-probability measure of the survivors’ states. This includes the particularly relevant case where the mean-field interaction among the players is done through the empirical measure of the survivors together with the fraction of absorbed players over time. Second, the boundedness of coefficients and costs has been considerably relaxed including drift and costs with linear growth in the state variables, hence allowing for more realistic dynamics for players’ private states. We prove the existence of solutions of the MFG in strict as well as relaxed feedback form, and we establish uniqueness of the MFG solutions under monotonicity conditions of Lasry-Lions type. Finally, we show in a setting with finite-dimensional interaction that such solutions induce approximate Nash equilibria for the $N$ -player game with vanishing error as $N\to\infty$ .

Key words and phrases: Nash equilibrium, mean-field game, absorbing boundary, McKean-Vlasov limit, controlled martingale problem, relaxed control.

2000 AMS subject classifications: 60B10, 60K35, 91A06, 93E20.

1 Introduction
2 Preliminaries and assumptions
3 Existence of solutions of the mean-field game
3.1 Approximating MFGs
3.2 Convergence of the approximating MFGs
3.3 Optimality of the limit points
3.4 Existence of solutions
4 Uniqueness of solutions of the mean-field game
5 Approximate Nash equilibria for the $N$ -player game with finite-dimensional interaction
5.1 The setting with finite-dimensional interaction
5.2 The $N$ -player approximation theorem
5.3 Propagation of chaos
5.4 Proof of the The $N$ -player approximation theorem
A Appendix
A.1 Existence and uniqueness of solution of SDEs with sub-linear drift
A.2 Characterization of the set $\mathcal{Q}$
A.3 Additional convergence results

1 Introduction

Mean-field games (MFGs for short) are, loosely speaking, limits of symmetric stochastic differential games with a large number of players, where each of them interacts with the average behaviour of his/her competitors. They were introduced in the seminal papers by Lasry and Lions [45, 46, 47] and, simultaneously, by Huang et al. [36]. An increasing stream of research has been flourishing since then, producing theoretical results as well as a wide range of applications in many fields such as economics, finance, crowd dynamics and social sciences in general. For an excellent presentation of the theory we refer to the lecture notes of Cardaliaguet [10] and the two-volume monograph by Carmona and Delarue [11].

Motivation. In most of the literature on MFGs, all players stay in the game until the end of the period, while in many applications, especially in economics and finance, it is natural to have a mechanism deciding when some player has to leave. Such a mechanism can be modelled by introducing an absorbing boundary for the state space as in Campi and Fischer [9], which is the starting point of our study (other related references will be discussed later in detail). Therein, existence of solutions of the MFG and construction of approximate Nash equilibria for the $N$ -player games were provided under some boundedness assumptions on the coefficients and without including the effect of past absorption on the survivors’ behaviour. The present paper continues the investigation of this kind of games, with the following main extensions.

(i)

We recast MFGs with absorption in a more general setting, most common to the MFG literature, where the dependence of the dynamics and costs on the empirical measure is infinite-dimensional.

(ii)

We introduce a direct dependence on past absorptions in the drift of the Stochastic Differential Equations (SDEs) describing the evolution of the players’ states by letting the initial distribution of players lose mass over time. Such a loss of mass corresponds to the exit of the absorbed players from the game, so that the proportion of the absorbed players has an effect on the future evolution of the survivors. This feature was not present in [9], where the empirical measure of the survivors was re-normalized at each time. Such a dependence on past absorptions is also included in the costs.

(iii)

We allow both the drift and the cost functional of the players to grow at most linearly with the state, hence they are not necessarily bounded unlike in [9]. Moreover, the set of non-absorbing states $\mathcal{O}$ can also be unbounded. Dropping the boundedness of the game data increases the flexibility of our setting, which can include more realistic dynamics from the viewpoint of applications (for more details, see later in this introduction).

To be more precise, the purpose of this paper is to study $N$ -player games and related MFGs in the presence of an absorbing set (i.e. a player is eliminated from the game once his/her private state leaves a given open set $\mathcal{O}\in\mathbb{R}^{d}$ ), and where the vector of private states $\textbf{X}^{N}\doteq(X^{N,1},\ldots,X^{N,N})$ evolves according to

[TABLE]

for $i\in\left\{1,\ldots,N\right\}$ , where $\textbf{{u}}^{N}\doteq(u^{N,1},\ldots,u^{N,N})$ is a vector of feedback strategies, $W^{N,1},\ldots,$ $W^{N,N}$ are independent $d$ -dimensional Wiener processes defined on some filtered probability space, $\sigma$ is the (non-degenerate) diffusion matrix and $\bar{b}$ is a given drift functional. Finally, $\mu^{N}$ is the random flow of empirical sub-probability measures representing the empirical distribution of the survivors

[TABLE]

Each player evaluates a strategy vector $\textbf{{u}}^{N}$ according to his/her expected costs

[TABLE]

over a random time horizon. In Eq.(1.2), $\textbf{X}^{N}$ is the $N$ -player dynamics under $\textbf{{u}}^{N}$ and $\tau^{N,i}\doteq\tau^{X^{N,i}}\wedge T$ . In the present work, we are interested in drifts $\bar{b}$ and costs $\bar{f}$ with sub-linear growth, hence possibly unbounded. Further details on the setting with all the technical assumptions will be given in Section 2.

The dynamics above is also motivated by economic models for corporate finance, systemic risk, and asset allocation. For instance, we can interpret players as firms whose values are represented by the state variables $X^{N,i}$ for $i\in\{1,\ldots,N\}$ . Each company is affected by the fraction of both defaulted and non-defaulted firms and takes strategic decisions accordingly. Moreover, sub-linearity of the drift allows to include a mean-reversion term representing some herding behaviour. A possible application is the pricing of portfolio credit derivatives where the pricing depends upon the so called distance-to-default of the assets in the portfolio (Hambly and Ledger [32]). Alternatively, each player can be interpreted as a bank, whose monetary reserve evolves according to the stochastic dynamics in Eq.(1.1) where the drift depends on both the rate of interbank borrowing/lending and on a controlled borrowing/lending rate to a central bank, as in [13]. However, in [13] no absorbing boundary conditions are considered. The latter features could be incorporated in the model by introducing absorbing boundary conditions at the default level, similarly to [32]. This would enable to study the impact of defaults on systemic risk and stability of the financial system described by the game. Last but not least, the proposed set-up allows for a Brownian motion with an Ornstein–Uhlenbeck type drift modelling for the private state, a model that has been used (for instance) for the notion of flocking to default in the financial literature (Fouque and Sun [26]). However, in the present paper we focus on the mathematical properties of the proposed family of games and we leave the applications for future research.

Main results. The main contributions of the paper can be summarized as follows:

•

We introduce the MFG with smooth dependence on past absorptions, i.e. the limit model corresponding to the above $N$ -player games as $N$ tends to infinity. For a solution of the MFG, the empirical sub-probability measures $(\mu_{t}^{N})_{t\in[0,T]}$ are replaced by flows of sub-probability measures on $\mathbb{R}^{d}$ ; see Definition 2.1.

•

We prove existence of a relaxed feedback MFG solution and, under an additional convexity assumption, we show that there are optimal feedback strategies in strict form; see Theorem 3.1, Proposition 3.4 and Proposition 3.5. Additionally, we show that there exist relaxed and strict feedback solutions that are Markovian up to the exit time; see Proposition 3.6.

•

We prove uniqueness of the MFG solution under standard monotonicity conditions of the Lasry-Lions type formulated for sub-probability measures; see Theorem 4.1.

•

We study approximate Nash equilibria for the $N$ -player game in a setting where the dependence on the measure variable is finite-dimensional. Precisely, we show that if we have a feedback solution of the MFG (either relaxed or strict), we can construct a sequence of approximate Nash equilibria for the corresponding $N$ -player games with a vanishing approximation error as $N\rightarrow\infty$ ; see Theorem 5.1 and Corollary 5.2. It is worth stressing that the construction produces approximate $N$ -player equilibria in feedback strategies (instead of the more common open-loop strategies).

The proof of the existence of feedback solutions of the MFG is inspired by the truncation procedure introduced by [41]. We construct a sequence of approximating MFGs, each one with bounded drift and cost functional, to which we can apply the results of [9]. Then, we prove convergence of the solutions of these approximating MFGs to a solution of the original one. Nonetheless, the procedure in [41] cannot be applied directly to our case mainly due to the history dependency and the discontinuities induced by past absorptions. In particular, a different instance of the mimicking result of [8] applies to our framework.

To establish the uniqueness result we follow standard monotonicity arguments, with some adjustments due to the dependence of the coefficients on a flow of sub-probability measures instead of probability measures. In particular, the uniqueness result relies on an additional (standard) monotonicity assumption on the running cost of the Lasry-Lions type.

The proof of the construction of approximate Nash equilibria for the $N$ -player game is based on weak convergence arguments and controlled martingale problems. The use of martingale problems in proving convergence to the McKean-Vlasov limit and propagation of chaos for weakly interacting systems goes back to [27], [54] and [50]. We observe that, whereas standard results prove convergence in law of the empirical measures, in the present paper we follow the approach of [42] to obtain a strong form of propagation of chaos with possibly unbounded and path-dependent drift. We show that the empirical measures converge in a stronger topology (the $\tau$ -topology), a result that enables us to take the limit as $N\rightarrow\infty$ without assuming any regularity of the feedback strategies with respect to the state process. In our framework, unlike [9], the continuity of the MFG optimal control for almost every path of the state variable with respect of the Wiener measure is no longer feasible. Indeed, the PDE-based estimates that were used in [9] to get such a regularity are not available anymore due to the possible unboundedness of the drift and the running cost.

Related literature. We have already discussed the paper [9], so here we focus on some other contributions in the literature of mean-field models and games related to our study. First, we cite the works of [29] and [30] where a model based on point processes for correlated defaults timing in a portfolio of firms is introduced and analysed. [29] prove a LLN for the default rate as the number $N$ of firms goes to infinity.

Motivated by modelling the contagion effect are the works of [32], [33] and [34] too. The first work provides a LLN for the empirical measure of a system of finitely many (uncontrolled) diffusions on the half-line, absorbed when they hit zero and correlated through the proportion of absorbed processes. In [33] the model is extended to include a positive feedback mechanism when the particles hit the barrier, thus modelling contagious blow-ups. A mathematical complement to the previous work is provided in [48]. More recently, [34] have proposed a general model for systemic (or macroscopic) events. By working on a set-up similar to [32], they interpret the diffusions as distances-to-default of financial institutions and model the correlation effect through a common source of noise and a form of mean-reversion in the drift. A form of endogenous contagion mechanism is also considered.

On the side of applications to economics, [16] and [17] study oligopolistic models with exhaustible resources formulated as MFGs with absorption at zero. Their model keeps track of the fraction of active players at each time. However, this fraction appears in the objective functions but not in the state variable.

Two more papers are those by [19] and [20], where a particle system approach is used to study the mathematical properties of an integrate-and-fire model from neurology. The particles’ dynamics have some resetting mechanism which activates as soon as some particle hits a given boundary. Besides, we cite two recent papers by Nadtochiy and Shkolnikov [51, 52]. The first one focuses on the cascade effect in an interbank mean-field model with defaults and a contagion effect modelled via a singular interaction through hitting times. The second one investigates the associated mean-field game also including more general dynamics and connection structures.

Finally, we mention a class of MFGs that has been considered quite recently especially in relation to bank run models, that is MFGs of optimal stopping or timing; see, for instance, [5], [7], [12] and [53]. Therein, the agents solve an optimal stopping problem so that the terminal time is directly chosen by them instead of being determined by the evolution of the controlled state as in our setting. In both settings the terminal time is in fact a random time and the state evolution might be affected by the fraction of leavers and the empirical measure of the remainers.

Structure of the paper. In Section 2 we introduce the notation and present both the $N$ -player and the MFGs along with the main assumptions. Section 3 contains the results on the existence of feedback MFG solutions. In Section 4 we prove the uniqueness of MFG solutions under some monotonicity condition of the Lasry-Lions type. In Section 5 we specialize to a finite dimensional setting and construct approximate Nash equilibria in feedback form for the $N$ -player game using the MFG solutions. The technical results used in the paper can be found in the Appendix A.

2 Preliminaries and assumptions

In this section, we provide the definitions of the different spaces of trajectories and measures used in the paper along with the corresponding topologies, distances and notions of convergence. In addition, we describe the MFG with smooth dependence on past absorptions and give the definition of solution of the MFG. We conclude the section by introducing the MFGs with truncated coefficients, which will be used in the proof of existence of MFG solutions.

Spaces of trajectories. Let $d\in\mathbb{N}$ . We denote by $\mathcal{O}\subset\mathbb{R}^{d}$ an open subset of $\mathbb{R}^{d}$ representing the space of the players’ private states and by $\mathcal{X}\doteq C([0,T];\mathbb{R}^{d})$ the space of $\mathbb{R}^{d}$ -valued continuous trajectories on the time interval $[0,T]$ , $T<\infty$ . The space $\mathbb{R}^{d}$ is equipped with the standard Euclidean norm, always indicated by $|\cdot|$ , while $\mathcal{X}$ with the sup-norm, denoted by $\|\cdot\|_{\infty}$ , which makes $\mathcal{X}$ separable and complete. We use the notation $\|\cdot\|_{\infty,t}$ whenever the sup-norm is computed over the time interval $[0,t]$ , $t<T$ . Besides, we denote with $\mathcal{X}^{N}\doteq C([0,T];\mathbb{R}^{d\times N})$ the space of $N$ -dimensional vectors of continuous trajectories and identify it with $\mathcal{X}^{\times N}$ .

Spaces of measures. We use flows of probability and sub-probability measures to describe the distribution of players and its time evolution in $\mathcal{O}$ . For $E$ a Polish space, let $\mathcal{M}_{f}(E)$ denote the space of finite Borel measures on $E$ , $\mathcal{P}(E)$ the space of Borel probability measures on $E$ and $\mathcal{M}_{\leq 1}(E)$ the space of Borel sub-probability measures on $E$ , i.e. measures $\mu\in\mathcal{M}_{f}(E)$ such that $\mu(E)\leq 1$ . These spaces are endowed with the weak convergence of measures (Billingsley [6]). We will often write $\mu^{n}\overset{w}{\rightharpoonup}\mu$ to indicate weak convergence of $\mu^{n}$ towards $\mu$ as $n\to\infty$ and $\xi_{n}\overset{\mathcal{L}}{\longrightarrow}\xi$ to denote convergence in law of a sequence of random variables $(\xi_{n})_{n\in\mathbb{N}}$ (defined on possibly different probability spaces) to a limit random variable $\xi$ .

We define by $\Upsilon_{\mathcal{P}}^{T}(E)$ (resp. by $\Upsilon_{\leq 1}^{T}(E)$ ) the spaces of measurable flows of probability (resp. sub-probability) measures on $E$ , i.e. the space of Borel measurable maps $\pi$ (resp. $\mu$ ) from the time interval $[0,T]$ to $\mathcal{P}(E)$ (resp. $\mathcal{M}_{\leq 1}(E)$ ). Wherever possible without confusion, we use $\Upsilon_{\mathcal{P}}^{T}$ (resp. $\Upsilon_{\leq 1}^{T}$ ) when $E=\mathbb{R}^{d}$ . We denote by $\mathcal{P}_{1}(E)$ and by $\mathcal{M}_{\leq 1,1}(E)$ the following subsets of $\mathcal{P}(E)$ and $\mathcal{M}_{\leq 1}(E)$ :

[TABLE]

We endow $\mathcal{P}_{1}(E)$ with the 1-Wasserstein distance $W_{1}$

[TABLE]

where $\Pi(\mu,\nu)\subset\mathcal{P}_{1}(E\times E)$ represents the set of probability measures with given marginals $\mu$ and $\nu$ , and $\text{Lip}_{1}(E;\mathbb{R})$ the set of Lipschitz functions on $E$ with unitary Lipschitz constant. The second equality in Eq.(2.1) is due to the Kantorovich-Rubinstein Theorem (see, for instance, Theorem 6.1.1 in Ambrosio et al. [2]). Notice that $(\mathcal{P}_{1}(E),W_{1})$ is a separable and complete metric space whenever $(E,d_{E})$ is separable and complete. Finally, let $\Upsilon_{\mathcal{P},1}^{T}(E)$ (resp. $\Upsilon_{\leq 1,1}^{T}(E)$ ) denote the space of measurable flows of probability measures in $\mathcal{P}_{1}(E)$ (resp. in $\mathcal{M}_{\leq 1,1}(E)$ ). Again, wherever possible without confusion, we use $\Upsilon_{\mathcal{P},1}^{T}$ and $\Upsilon_{\leq 1,1}^{T}$ when $E=\mathbb{R}^{d}$ .

The canonical space. We will often work on the canonical filtered probability space, denoted by $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P})$ and defined as follows. Set $\Omega\doteq\mathcal{X}$ , let $\xi$ be an $\mathbb{R}^{d}$ -valued random variable with law $\nu\in\mathcal{P}(\mathbb{R}^{d})$ and let $W$ be a $d$ -dimensional Wiener process on $\mathcal{X}$ independent of $\xi$ . Define $\mathcal{W}^{\nu}\in\mathcal{P}(\mathcal{X})$ as the law of $\xi+\sigma W$ . Set $\mathcal{F}$ as the $\mathcal{W}^{\nu}$ -completion of the Borel $\sigma$ -algebra $\mathcal{B}(\mathcal{X})$ and $(\mathcal{F})_{t\in[0,T]}$ as the $\mathcal{W}^{\nu}$ -augmentation of the filtration generated by the canonical process $\hat{X}$ on $\mathcal{X}$ , i.e. $\hat{X}_{t}(\varphi)\doteq\varphi(t)$ for all $(t,\varphi)\in[0,T]\times\mathcal{X}$ . In particular, $(\mathcal{F})_{t\in[0,T]}$ satisfies the usual conditions. Finally set $\mathbb{P}\doteq\mathcal{W}^{\nu}$ and $W\doteq\sigma^{-1}(\xi-\hat{X})$ , which is a Wiener process on $\mathcal{X}$ . Where no confusion is possible, we will write $X$ for $\hat{X}$ .

Now, let $\mathcal{O}\subset\mathbb{R}^{d}$ be a non-empty open set, the set of non-absorbing states, and let $\Gamma\subset\mathbb{R}^{d}$ be the set of control actions. For each $\varphi\in\mathcal{X}$ we set $\tau^{\varphi}\doteq\inf\{t\in[0,T]:\,\varphi(t)\not\in\mathcal{O}\}$ , with the convention $\inf\emptyset=\infty$ , and $\tau(\varphi)\doteq\tau^{\varphi}\wedge T$ . In order to set up the dynamics of the players’ states, we need to introduce the following functions:

[TABLE]

Since we will have to impose some joint continuity property for the functions above, in particular with respect to the $\mu$ -variable, and there is no natural metrizable topology over the set of sub-probability measures $\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})$ , it will be convenient to work with the following reparameterization of a suitable restriction of $\bar{b}$ and $\bar{f}$ :

[TABLE]

where $b$ and $f$ are progressively measurable functionals such that

[TABLE]

while $g:[0,T]\times\mathcal{P}_{1}(\mathcal{X})\rightarrow\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})$ is defined by its action on the test functions of the 1-Wasserstein convergence, i.e., on the functions $\psi\in C(\mathbb{R}^{d})$ with sub-linear growth, as

[TABLE]

In words, the functions $b$ and $f$ above are reparameterizatons of the restrictions of $\bar{b}$ and $\bar{f}$ , respectively, to the range of the map

[TABLE]

Moreover, for each $\mu\in\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})$ and $\theta\in\mathcal{P}_{1}(\mathcal{X})$ we introduce the notation

[TABLE]

Now, we collect the necessary assumptions on all initial data in order to state our main results. Some further assumptions will be given later in the paper when necessary.

(H1)

The drift $\bar{b}$ satisfies the following uniform Lipschitz continuity:

[TABLE]

for any $(t,\mu,u)\in[0,T]\times\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})\times\Gamma$ . Moreover it has sub-linear growth, i.e.

[TABLE]

for all $(t,x,\mu,u)\in[0,T]\times\mathbb{R}^{d}\times\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})\times\Gamma$ and for a positive constant $C>0$ .

(H2)

The running costs $\bar{f}$ and the terminal cost $F$ have sub-linear growth, i.e.

[TABLE]

for all $(t,x,\mu,u)\in[0,T]\times\mathbb{R}^{d}\times\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})\times\Gamma$ , $(t,x)\in[0,T]\times\mathbb{R}^{d}$ and for a positive constant $C>0$ .

(H3)

$\bar{b}$ and $\bar{f}$ are such that their reparametrizations $b$ and $f$ are jointly continuous at points $(t,\varphi,\theta,u)\in\left[0,T\right]\times\mathcal{X}\times\mathcal{P}_{1}(\mathcal{X})\times\Gamma$ such that $\theta\ll\mathcal{W}^{\nu}$ . Moreover, $F$ is jointly continuous on $[0,T]\times\mathbb{R}^{d}$ .

(H4)

The set $\mathcal{O}$ is open, convex and strictly included in $\mathbb{R}^{d}$ with $\mathcal{C}^{2}$ -boundary, i.e. $\partial\mathcal{O}$ is the graph of a $\mathcal{C}^{2}$ function. Alternatively, $\mathcal{O}=(0,\infty)^{\times d}$ is also allowed.

(H5)

The set $\Gamma\subset\mathbb{R}^{d}$ is compact.

(H6)

The diffusion matrix $\sigma\in\mathbb{R}^{d\times d}$ has full rank.

(H7)

The initial distribution $\nu\in\mathcal{P}(\mathbb{R}^{d})$ has support in $\mathcal{O}$ and satisfies $\int_{\mathcal{O}}\text{e}^{\lambda|x|^{2}}\nu(dx)<\infty$ for some $\lambda>0$ .

(H8)

The initial conditions of the $N$ -player game $X^{N,i}_{0}$ , $i\in\{1,\ldots,N\}$ , are i.i.d. and with the initial condition of the MFG $X_{0}$ , they are all distributed as $\nu\in\mathcal{P}(\mathbb{R}^{d})$ .

Before turning to the MFG dynamics, some remarks on the assumptions above are in order.

Remark 2.1.

The growth assumptions in (H1) and (H2) could be further refined. For instance, one could assume sub-linear and sub-polynomial growth of the drift and diffusion matrix with suitable exponents as, e.g., in [41]. Moreover, the running cost $f$ could certainly take real values; however, without loss of generality and given the interpretation as a cost term, we have assumed $f\geq 0$ .

Remark 2.2.

The continuity properties in (H3) are crucial in the passage to the limit performed in Proposition 3.2. Since the laws of the processes that we consider are absolutely continuous with respect to the Wiener measure $\mathcal{W}^{\nu}$ (they belong to the set $\mathcal{Q}\subset\mathcal{P}(\mathcal{X})$ of laws of Brownian-driven processes with sub-linear drift that we introduce and characterize in the Appendix A, cfr. Lemma A.3), it is sufficient to require continuity at points $\theta\ll\mathcal{W}^{\nu}$ . The passage to the limit in the measure argument can then be performed by Lemma A.4 together with Lemma A.5.

Remark 2.3.

Admittedly, compactness of $\Gamma$ is a strong assumption, but it will play an important role in order to obtain existence and uniqueness of weak solutions of the SDEs for the player state’s dynamics in both the MFG and the $N$ -player games. In particular, it enables a line of arguments based on Beněs’ condition – ensured by the boundedness of the coefficient in the control variable – and Girsanov’s theorem (see Remark 2.5 for more precise references), which is one of the main tools of our approach.

Remark 2.4.

The nondegeneracy of $\sigma$ as in (H6) is justified by the counter-example in [9], Section 7, where it was shown that a feedback MFG solution does not necessarily induce a sequence of approximate Nash equilibria with vanishing error. A careful inspection of such a counter-example reveals that it can be easily adapted to our setting since, in that particular context, dividing by the initial number of players $N$ (as in our setting) or renormalizing each time by the current number of players (as in the counter-example) turn out to be equivalent for $N$ large. Finally, even though state dependency of the diffusion matrix can be handled using very similar techniques, we have decided to leave it out and focus on other more interesting aspects of the model. For the same reason we leave aside a possible dependence of $\sigma$ on the control, as it would just increase the level of technicality of the proofs due to the use of martingale measures (see [41]).

The mean-field dynamics. Given a flow of sub-probability measures $\mu\in\Upsilon^{T}_{\leq 1,1}$ and a feedback progressively measurable control $u:\left[0,T\right]\times\mathcal{X}\rightarrow\Gamma$ , the representative player’s state evolves according to the equation

[TABLE]

where $X$ is a $d$ -dimensional stochastic process starting at $X_{0}\overset{d}{\sim}\nu\in\mathcal{P}(\mathbb{R}^{d})$ and $W$ is a $d$ -dimensional Wiener process on some filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P})$ . Solutions of Eq.(2.3) are understood to be in the weak sense (see Remark 2.5 below).

Let $\mathcal{U}_{fb}$ denote the set of all feedback controls defined as

[TABLE]

The cost associated with a strategy $u\in\mathcal{U}_{fb}$ , a flow of sub-probability measures $\mu\in\Upsilon^{T}_{\leq 1,1}$ and an initial distribution $\nu\in\mathcal{P}(\mathbb{R}^{d})$ is given by (we omit, for the sake of simplicity, the explicit dependence on $\nu$ )

[TABLE]

where $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},W,X)$ is a solution of Eq.(2.3) under $u$ with initial distribution $\nu$ , and $\tau\doteq\tau^{X}\wedge T$ the random time horizon. Finally we set

[TABLE]

Remark 2.5.

For a given flow of sub-probability measures $\mu$ , thanks to the linear growth of $\bar{b}$ in the state variable $\varphi$ and to the boundedness of the action space $\Gamma$ , we have that both existence and uniqueness in law of a weak solution of Eq.(2.3) is guaranteed by Lemma A.1, and by Proposition 5.3.6, Remark 5.3.8 and Proposition 5.3.10 in [39] (see our Lemma A.2). Precisely, this can be proved by means of Girsanov’s theorem and Beněs’ condition [4].

The notion of solution we consider for the MFG is the following.

Definition 2.1 (Feedback MFG solution).

A feedback solution of the MFG is a pair $(u,\mu)\in\mathcal{U}_{fb}\times\Upsilon_{\leq 1,1}^{T}$ such that:

(i)

Strategy $u$ is optimal for $\mu$ , i.e. $V^{\mu}=J^{\mu}(u)$ .

(ii)

Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},X,W)$ is a weak solution of Eq.(2.3) with flow of sub-probability measures $\mu$ , strategy $u$ and initial condition $\nu$ . Then

[TABLE]

Relaxed controls. It will be very convenient to use relaxed controls (see [23] for a precise definition), which allow us to view progressively measurable controls with values on a compact set $\Gamma$ as elements of the space of probability measures on $\Gamma$ . The latter space is compact when endowed with the weak convergence of measures. The space $\mathcal{V}$ of relaxed controls is given by

[TABLE]

i.e. it is the set of all finite positive measures on $[0,T]\times\Gamma$ with Lebesgue time marginal. With a slight abuse of notation, we denote with $\hat{\Lambda}$ both the identity map and the canonical process on $\mathcal{V}$ (where no confusion is possible, we drop the hat and write $\Lambda$ in place of $\hat{\Lambda}$ ). Precisely, a single-player relaxed control is a $\mathcal{V}$ -valued random variable $\Lambda$ such that $(\Lambda_{t})_{t\in[0,T]}$ is a progressively measurable $\mathcal{P}(\Gamma)$ -valued stochastic process. We say that $\Lambda$ is a feedback control if there exists a progressively measurable functional $\lambda:[0,T]\times\mathcal{X}\rightarrow\mathcal{P}(\mathcal{X})$ such that $\Lambda_{t}=\lambda(t,X)$ for all $t\in[0,T]$ , with $X$ denoting the player’s dynamics. Moreover, we say that $\Lambda$ is a strict and feedback control if there exists $u\in\mathcal{U}_{fb}$ such that $\lambda(t,X)=\delta_{u(t,X)}$ for all $t\in[0,T]$ .

Let $\widetilde{\mathcal{U}}_{fb}$ be the set of relaxed feedback controls for the MFG. We rewrite the dynamics and the cost functional of the MFG (Eq.(2.3)) and Eq.(2.4)) using relaxed controls:

[TABLE]

where $t\in[0,T]$ and $\lambda\in\widetilde{\mathcal{U}}_{fb}$ . Moreover, we extend accordingly the notion of feedback solutions of the MFG.

Definition 2.2 (Relaxed feedback MFG solution).

A relaxed feedback solution of the MFG is a pair $(\lambda,\mu)\in\tilde{\mathcal{U}}_{fb}\times\Upsilon_{\leq 1,1}^{T}$ such that:

(i)

$\lambda$ is optimal, i.e. $V^{\mu}=J^{\mu}(\lambda)$ .

(ii)

Let $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},X,W)$ be a weak solution of Eq.(2.5) with flow of sub-probability measures $\mu$ , control $\lambda$ and initial condition $\nu$ . Then

[TABLE]

Feedback and open-loop controls. Feedback controls induce stochastic open-loop controls, i.e. tuples $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},X,u,W)$ that are weak solutions of

[TABLE]

where $u$ is a progressively measurable $\Gamma$ -valued stochastic process. As a consequence, the computation of the infimum of $J^{\mu}(\cdot)$ over the class of stochastic open-loop controls would imply a lower value for $V^{\mu}$ . However, thanks to Proposition 2.6 in [23], the two minimization problems are equivalent from the point of view of the value function.

A similar argument holds also in the case of feedback relaxed controls, that induce relaxed stochastic open-loop controls, tuples $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},X,\Lambda,W)$ that are weak solutions of

[TABLE]

where $\Lambda$ is a progressively measurable $\mathcal{P}(\Gamma)$ -valued stochastic process.

In the rest of the paper we will call $\mathbb{U}$ the set of open-loop controls and, for the sake of brevity and where no confusion is possible, denote with $u$ an element of $\mathbb{U}$ implying the whole tuple $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},X,u,W)$ . Similarly, we will call $\tilde{\mathbb{U}}$ the set of open-loop relaxed controls and denote with $\Lambda$ an element of $\tilde{\mathbb{U}}$ implying the whole tuple $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},X,\Lambda,W)$ .

The extended canonical probability space. When dealing with relaxed controls we will work on the following extension of the canonical probability space $\mathcal{X}$ . Set $\tilde{\Omega}\doteq\mathcal{X}\times\mathcal{V}$ , let $\mathcal{F}$ and $(\mathcal{F}_{t})_{t\in[0,T]}$ be the canonical $\sigma$ -algebra and the canonical filtration on $\mathcal{X}$ , respectively, whereas $\mathcal{G}$ and $(\mathcal{G}_{t})_{t\in[0,T]}$ denote the Borel $\sigma$ -algebra and the filtration generated by the canonical process $\hat{\Lambda}$ on $\mathcal{V}$ , respectively. Finally, we set $\tilde{\mathcal{F}}_{t}\doteq\mathcal{F}_{t}\otimes\mathcal{G}_{t}$ for all $t\in[0,T]$ , and $\tilde{\mathcal{F}}\doteq\mathcal{F}\otimes\mathcal{G}$ .

Approximating MFGs. We conclude this preliminary section by introducing a suitable sequence of approximating MFGs, which is obtained by truncation of the coefficients of the original MFG similarly as in [41]. Such a sequence will be useful in the proof of existence of a MFG solution along the following lines: we will prove existence of feedback MFG solutions of the approximating MFGs in the sequence by extending the existence result of [9]. Then, by letting the truncation threshold go to infinity, we will obtain a solution of the original MFG. This approach relies on two additional assumptions (Assumptions (C1) and (C2) below) that will be introduced later in this part.

Let $(K_{n})_{n\in\mathbb{N}}\subset\mathbb{R}_{+}$ be an increasing sequence such that $K_{n}\nearrow+\infty$ . The $n^{\rm th}$ approximating MFG model, denoted by MFG( $n$ ), is obtained as follows.

$\left(\mathbf{T}_{n}\right)$

$\bar{b}^{n}(x)=\bar{b}(x)$ when $|\bar{b}(x)|\leq K_{n}$ , while it is continuously truncated at level $K_{n}$ , i.e. $|\bar{b}^{n}(x)|=K_{n}$ , otherwise. Similarly for the costs $\bar{f}^{n}$ and $F^{n}$ and for the associated functions $b^{n}$ and $f^{n}$ .

Notice that we do not truncate the possibly unbounded set $\mathcal{O}$ of non-absorbing states. In each MFG( $n$ ) the representative player’s state evolves as in Eq.(2.3) with $\bar{b}$ replaced by $\bar{b}^{n}$ , i.e.

[TABLE]

when the player is using the strict control $u$ , and similarly when he/she is using a relaxed control. Moreover, in the cost functional $\bar{f}$ and $F$ are replaced by their truncated counterpart $\bar{f}^{n}$ and $F^{n}$ . The associated cost functional is denoted by $J^{n,\mu}\left(u\right)$ or $J^{n,\mu}\left(\lambda\right)$ depending on whether the player is implementing a strict strategy $u$ or a relaxed one $\lambda$ . The optimal values are defined, accordingly, by

[TABLE]

The definitions of strict and relaxed MFG solutions given above for the (un-truncated) MFG can clearly be applied to the approximating MFG( $n$ )s with the obvious modifications. We associate to the MFG( $n$ )s the following Hamiltonians:

[TABLE]

and the set of minimizers

[TABLE]

for $(t,x,\theta,z)\in[0,T]\times\mathbb{R}^{d}\times\mathcal{P}_{1}(\mathcal{X})\times\mathbb{R}^{d}$ . In the next section on existence of MFG solutions we will rely on the following additional convexity assumptions:

(C1)

For each $n\in\mathbb{N}$ , $A^{n}(t,x,\theta,z)$ is convex for all $(t,x,\theta,z)\in[0,T]\times\mathbb{R}^{d}\times\mathcal{P}_{1}(\mathcal{X})\times\mathbb{R}^{d}$ .

(C2)

The running cost $f$ is convex in the control variable $u\in\Gamma$ .

Remark 2.6.

Assumption (C1) is common in control theory and it is crucial in order to apply fixed point theorems. In our case it is satisfied if, for instance, the running cost $f$ is bounded and convex in the control variable $u\in\Gamma$ . Indeed in this case, due to the flexibility in the choice of the truncation thresholds, choosing $K^{n}\geq\|f\|_{\infty}$ for all $n\in\mathbb{N}$ we have $f^{n}=f$ for all $n\in\mathbb{N}$ . Then convexity is preserved by adding any sub-linear term. Finally, we observe that Assumption (C2) will be used in Section 3.4 for obtaining the existence of strict MFG solutions.

3 Existence of solutions of the mean-field game

Throughout this section Assumptions (H1)-(H8) are in force. Under these and the additional convexity Assumptions (C1) and (C2) we show that both a relaxed and a strict feedback solution of the MFG exist; see Theorem 3.1 below together with Proposition 3.4 and Proposition 3.5. In addition, we guarantee the existence of a feedback solution of the MFG with Markovian feedback strategy up to the exit time; see Proposition 3.6. Our main existence result can be stated as follows.

Theorem 3.1 (Existence of relaxed and strict feedback MFG solutions).

Under Assumptions (H1)-(H8) and (C1), there exists a relaxed feedback MFG solution $(\lambda,\mu)$ . Moreover, under the additional Assumption (C2) , there exists a strict feedback MFG solution $(u,\mu)$ .

To prove Theorem 3.1, we proceed by approximation in the sense that, first, we prove that each MFG( $n$ ) introduced in the previous section has a feedback (strict) solution by extending the results in [9]; see Subsection 3.1. Then, we prove the convergence of such approximating solutions to a feedback (relaxed) solution of the original MFG by passing to the limit with the truncation thresholds; see Subsection 3.2.

Before proceeding, we ensure the well-posedness of the game in the sense that we show that the private state $X$ of the representative agent remains in $\mathcal{O}$ up to time $T$ with some positive probability. This is the content of the following lemma.

Lemma 3.1.

Grant Assumptions (H1)-(H8). Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},X,W)$ be a weak solution of Eq.(2.3). Then $\mathbb{P}(\tau^{X}>t)>0$ for all $t\in[0,T]$ .

Proof.

Set $b_{t}\doteq\bar{b}(t,X_{t},\mu_{t},u(t,X))$ for $t\in[0,T]$ , and define $Z\doteq(Z_{t})_{t\in[0,T]}$ as

[TABLE]

where $\mathcal{E}_{t}(\cdot)$ denotes the Doléans-Dade stochastic exponential. By Lemma A.1, $Z$ is a true martingale. Define $\mathbb{Q}$ by $\frac{d\mathbb{Q}}{d\mathbb{P}}\doteq Z_{T}$ . By Girsanov’s theorem $\widetilde{W}_{t}\doteq W_{t}+\int_{0}^{t}\sigma^{-1}b_{s}ds$ , $t\in[0,T]$ , is a $\mathbb{Q}$ -Wiener process, and under $\mathbb{Q}$ the process $X$ has law $\mathcal{W}^{\nu}$ . As a consequence of the law of iterated logarithms, any Wiener process remains in an open set, hence in $\mathcal{O}\subset\mathbb{R}^{d}$ , for a finite time with strictly positive probability. Therefore $\mathbb{Q}(\tau^{X}>T)>0$ and thus $\mathbb{P}(\tau^{X}>T)>0$ . ∎

3.1 Approximating MFGs

In this subsection we prove existence of solutions of the approximating MFG( $n$ )s.

Theorem 3.2 (Existence of solutions of MFG( $n$ )).

Let $n\in\mathbb{N}$ . Under Assumptions (H1)-(H8) and (C1) there exists a feedback solution $(u^{n},\mu^{n})$ of MFG( $n$ ).

Proof.

The proof follows similar steps to those in Section 6 of [9]: we only sketch here the main steps. The main difference with [9] is that, due to Assumption (C1), we have to deal with set-valued maps, hence to apply a version of Kakutani’s fixed point theorem instead of Brouwer’s. We use the version proposed by [14], Proposition 7.4, which is in turn based on the results of [15]. Other adjustments are due to the fact that $\mu$ is a flow of sub-probability measures (instead of probability measures) and that $\mathcal{O}$ can be unbounded.

Fix $n\in\mathbb{N}$ . The proof is based on the construction of a suitable map $\Psi:\mathcal{P}(\mathcal{X})\times\mathbb{U}\rightarrow\mathcal{P}(\mathcal{X})$ on an appropriate compact and convex subset of $\mathcal{P}(\mathcal{X})$ , where $\mathbb{U}$ is the space of progressively measurable $\Gamma$ -valued stochastic processes. The fixed points of $\Psi$ will provide MFG( $n$ ) solutions. More in detail, define $\mathcal{Q}_{\nu,K}$ as the set of laws $\theta\in\mathcal{P}(\mathcal{X})$ of any process of the type

[TABLE]

defined on some filtered probability space with a Wiener process $W$ , $\xi\overset{d}{\sim}\nu$ , drift $(b_{t})_{t\in[0,T]}$ adapted and bounded by $K>0$ . Let us consider

[TABLE]

where $X$ is the canonical process on $\mathcal{X}$ and the probability measure $\mathbb{P}^{\theta,u}$ is defined as follows. Let $(\theta,u)\in\mathcal{Q}_{\nu,K_{n}}\times\mathbb{U}$ and let $\mu^{\theta}\in\Upsilon_{\leq 1}^{T}$ be defined as $\mu_{t}^{\theta}(\cdot)\doteq\theta(\{X_{t}\in\cdot\}\cap\{\tau^{X}>t\})$ for all $t\in[0,T]$ . Let $(\Omega,\mathcal{F}^{u},(\mathcal{F}^{u}_{t})_{t\in[0,T]},\mathbb{P}^{\theta,u},X,W^{u})$ be the weak solution of

[TABLE]

on the canonical space $(\Omega\doteq\mathcal{X},\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P})$ . Moreover, for $\theta\in\mathcal{Q}_{\nu,K_{n}}$ we call $u^{\theta}$ an optimal control for the cost

[TABLE]

Such optimal controls $u^{\theta}$ can be constructed by standard BSDE techniques as in [9], Section 6.1, by means of [18], Theorem 3.4, due to the random terminal times. Under Assumption (C1) optimal controls $u^{\theta}$ are in general not unique. Indeed

[TABLE]

provides an entire set of optimal controls, where $Z^{\theta}$ is part of the the solution of the associated adjoint BSDE and $\mathcal{L}_{T}$ denotes the Lebesgue measure on $[0,T]$ . Moreover, by measurable selection there exists a measurable function $\hat{u}^{n,\theta}:[0,T]\times\mathbb{R}^{d}\times\mathcal{Q}_{\nu,K_{n}}\times\mathbb{R}^{d}\rightarrow\Gamma$ such that

[TABLE]

Additionally, $\hat{u}^{n,\theta}(t,X_{t},\theta,Z^{\theta}_{t})$ , for $t\in[0,T]$ , is a progressively measurable control process that can be written in feedback form. Indeed, since $Z^{\theta}$ is progressively measurable for the canonical filtration, it can expressed as $Z^{\theta}_{t}=\zeta^{\theta}(t,X)$ for some progressively measurable functional $\zeta^{\theta}:[0,T]\times\mathcal{X}\rightarrow\mathbb{R}^{d}$ and for any $t\in[0,T]$ .

Now, a fixed point for the map $\Psi$ is a probability measure $\theta\in\mathcal{Q}_{\nu,K_{n}}$ such that $\theta\in\Psi(\theta,A(\theta))$ . Existence is provided by Proposition 7.4 in [14], so to conclude the proof it suffices to check that all the required assumptions are satisfied in our case. The set $\mathcal{Q}_{\nu,K_{n}}\subset\mathcal{P}(\mathcal{X})$ is a (weakly) compact, convex and metrizable subset of $C_{b}^{*}(\mathcal{X})$ , the dual of the space of bounded and continuous functions on $\mathcal{X}$ , which is a locally convex topological vector space with the weak* topology (that induces the weak convergence of measures on $\mathcal{P}(\mathcal{X})$ ). We endow the vector space $\mathbb{U}$ with the norm $\left\|\cdot\right\|_{\mathbb{U}}$ defined as $\left\|u\right\|_{\mathbb{U}}\doteq\mathbb{E}[\int_{0}^{T}|u_{t}|dt]$ . As a consequence of Berge’s maximum theorem [1, Theorem 17.31] and of Assumption (C1) the set-valued map $A^{n}:\mathcal{Q}_{\nu,K_{n}}\rightarrow\mathbb{U}$ is upper hemicontinuous and has non-empty convex and closed values (see the proof of Lemma 7.11 in [14]). Therefore, Proposition 7.4 in [14] applies, yielding the existence of a feedback solution of MFG( $n$ ). ∎

A-priori estimates. Here, we show that the moments up to any order $\alpha\geq 1$ of the state process remain bounded uniformly in $n$ . Such estimates will be very useful when we will relax the truncation in the next section.

Lemma 3.2 (A-priori estimates).

Grant Assumptions (H1)-(H8) and (C1). Consider feedback solutions $(u^{n},\mu^{n})_{n\in\mathbb{N}}$ and $(u,\mu)$ of the MFG(n)’s and of the MFG, respectively. Let $(\Omega^{n},\mathcal{F}^{n},(\mathcal{F}^{n}_{t})_{t\in[0,T]},\mathbb{P}^{n},X^{n},W^{n})_{n\in\mathbb{N}}$ be a sequence of weak solutions of the SDEs in Eq.(2.8) and $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},X,W)$ a weak solution of the SDE in Eq.(2.3). Then for any $\alpha\geq 1$

[TABLE]

where $K(\alpha)<\infty$ is a positive constant independent of $n$ .

Proof.

This follows from standard estimates that rely on the drift’s sub-linear growth and on Grönwall’s lemma. ∎

3.2 Convergence of the approximating MFGs

Let $(u^{n},\mu^{n})_{n\in\mathbb{N}}$ be a sequence of feedback solutions of the approximating MFGs introduced in the previous Subsection 3.1, whose existence is guaranteed by Theorem 3.2. In addition, let $(\Omega^{n},\mathcal{F}^{n},(\mathcal{F}^{n}_{t})_{t\in[0,T]},\mathbb{P}^{n},X^{n},W^{n})_{n\in\mathbb{N}}$ be a sequence of weak solutions of the SDEs in Eq.(2.8) associated to $(u^{n},\mu^{n})_{n\in\mathbb{N}}$ . Let $\theta^{n}$ be defined as $\theta^{n}\doteq\mathbb{P}^{n}\circ(X^{n})^{-1}$ for each $n\in\mathbb{N}$ .

To prove the convergence of the approximating MFGs we proceed in the following way. First, we show that there exists a subsequence of $(\theta^{n})_{n\in\mathbb{N}}$ , say $(\theta^{n_{k}})_{n_{k}\in\mathbb{N}}$ , that converges in $\mathcal{P}_{1}(\mathcal{X})$ to some limit $\theta\in\mathcal{P}_{1}(\mathcal{X})$ . To prove this, we interpret $(u^{n},\mu^{n})_{n\in\mathbb{N}}$ as relaxed feedback solutions, $(\lambda^{n},\mu^{n})_{n\in\mathbb{N}}$ . Second, we show that also the sequence of the corresponding extended laws $(\Theta^{n})_{n\in\mathbb{N}}\subset\mathcal{P}(\mathcal{X}\times\mathcal{V})$ converges in $\mathcal{P}_{1}(\mathcal{X}\times\mathcal{V})$ to some limit $\Theta\in\mathcal{P}_{1}(\mathcal{X}\times\mathcal{V})$ . Finally, we characterize the limit points by means of the martingale problem of Stroock and Varadhan (see Stroock and Varadhan [55, 56]).

Lemma 3.3 (Relative compactness).

$(\theta^{n})_{n\in\mathbb{N}}$ * is relatively compact in $\mathcal{P}(\mathcal{X})$ .*

Proof.

First, we prove tightness by applying Aldous’ criterion (see, e.g., [37], Condition VI.4.4), that is

[TABLE]

for all $r>0$ and where $\tau$ and $\sigma$ are stopping times bounded by $T$ . Indeed, we have

[TABLE]

and

[TABLE]

for some constants $C^{W}_{T},K>0$ independent of $n\in\mathbb{N}$ . Then we conclude by Lemma 3.2. Relative compactness then follows from Prohorov’s Theorem. ∎

Now, let $\theta\in\mathcal{P}(\mathcal{X})$ be a limit point for $(\theta^{n})_{n\in\mathbb{N}}$ and let $(\theta^{n_{k}})_{n_{k}\in\mathbb{N}}$ be a subsequence of $(\theta^{n})_{n\in\mathbb{N}}$ such that $\theta^{n_{k}}\overset{w}{\rightharpoonup}\theta$ as $n_{k}\rightarrow\infty$ . With a slight abuse of notation, in what follows we identify $(\theta^{n_{k}})_{n_{k}\in\mathbb{N}}$ with $(\theta^{n})_{n\in\mathbb{N}}$ . We now show that the latter convergence is actually stronger by proving that $(\theta^{n})_{n\in\mathbb{N}}$ converges to $\theta$ in the 1-Wasserstein distance.

Lemma 3.4 (Convergence in the 1-Wasserstein distance).

Let $(\theta^{n})_{n\in\mathbb{N}}$ be as above. Then $W_{1}(\theta^{n},\theta)\rightarrow 0$ and $\theta\in\mathcal{P}_{1}(\mathcal{X})$ .

Proof.

Notice that by Lemma 3.2 we have $(\theta^{n})_{n\in\mathbb{N}}\subset\mathcal{P}_{1}(\mathcal{X})$ . To prove convergence in the 1-Wasserstein distance, we have to show that (see, for instance, Theorem 7.12.ii in Villani [58])

[TABLE]

Set $\alpha,\beta>1$ such that $\frac{1}{\alpha}+\frac{1}{\beta}=1$ . Then, for any $\epsilon>0$ by Young’s and Markov’s inequalities, and by Lemma 3.2 we have

[TABLE]

for some positive constants $K(\alpha)$ and $K$ independent of $n\in\mathbb{N}$ . The conclusion immediately follows thanks to the fact that convergence in the 1-Wasserstein distance preserves the finiteness of the first moment. ∎

Proposition 3.1 (Absolute continuity of limit measures).

Let $\theta,(\theta^{n})_{n\in\mathbb{N}}\subset\mathcal{P}_{1}(\mathcal{X})$ be as in Lemma 3.4. Then $\theta\ll\mathcal{W}^{\nu}$ , i.e. $\theta$ is absolutely continuous with respect to $\mathcal{W}^{\nu}$ .

Proof.

By construction $\theta^{n}\ll\mathcal{W}^{\nu}$ for all $n\in\mathbb{N}$ , hence we have to make sure that the absolute continuity is also preserved in the limit. For doing so, we apply Theorem X.3.3 in [37]. In particular, we have to verify that all assumptions therein are fulfilled, which in our setting are reduced to the following properties:

(i)

The contiguity of the sequence of $\theta^{n}$ with respect to the Wiener measure $\mathcal{W}^{\nu}$ , i.e. for any sequence of measurable sets $B_{n}$ with $\mathcal{W}^{\nu}(B_{n})\to 0$ we have $\theta^{n}(B_{n})\to 0$ as $n\to\infty$ (see, e.g., Definition V.1.1 in Jacod and Shiryaev [37]).

(ii)

The tightness of the sequence of $\mathcal{W}^{\nu}$ -martingales $(M^{n})_{n\in\mathbb{N}}$ , where each $M^{n}=(M^{n}_{t})_{t\in[0,T]}$ is defined as

[TABLE]

In order to check property (i), we first show that the sequence of Radon-Nikodym derivatives $(\frac{d\theta^{n}}{d\mathcal{W}^{\nu}})_{n\in\mathbb{N}}$ is uniformly integrable under $\mathcal{W}^{\nu}$ . This is a consequence of the following bound:

[TABLE]

which follows from Corollary A.1 and by fact that, by inspection of the proofs of Lemma A.1 and Corollary A.1, all bounds are uniform in $n\in\mathbb{N}$ .

Now, property (i) can be obtained as follows: for all sequences of measurable sets $B_{n}$ with $\mathcal{W}^{\nu}(B_{n})\to 0$ , we have

[TABLE]

by an application of dominated convergence theorem due to the bound in Eq.(3.1). Hence the sequence of measures $\theta^{n}$ is contiguous to $\mathcal{W}^{\nu}$ .

Property (ii) follows from Aldous criterion [37, Condition VI.4.4], that is

[TABLE]

for all $r>0$ and where $\tau$ and $\sigma$ are stopping times bounded by $T$ . As a consequence, we will also have the tightness property for the pair $(X,M^{n})_{n\in\mathbb{N}}$ under the measure $\mathcal{W}^{\nu}$ . By Theorem VI.4.13 in [37] it is sufficient to check the tightness property for the corresponding quadratic variation processes

[TABLE]

First, by Markov’s inequality $\mathcal{W}^{\nu}(|\langle M^{n}\rangle_{\sigma}-\langle M^{n}\rangle_{\tau}|\geq r)\leq\frac{1}{r}\mathbb{E}^{\mathcal{W}^{\nu}}[|M^{n}_{\sigma}-M^{n}_{\tau}|]$ . Then, by Young’s inequality for all $p,q>1$ such that $\frac{1}{p}+\frac{1}{q}=1$ we have

[TABLE]

for some positive constants $K(p)$ and $K(q)>0$ independent of $n\in\mathbb{N}$ . Notice that the last inequality is a consequence of Lemma 3.2 and Property (i). Therefore, Aldous’ criterion in Eq.(3.2) is satisfied.

After checking properties (i) and (ii) above, we can at last apply Theorem X.3.3 in [37], yielding that the tightness of $(\mathcal{W}^{\nu}\circ(X,M^{n})^{-1})_{n\in\mathbb{N}}$ implies the tightness of $(\theta^{n}\circ(X,M^{n})^{-1})_{n\in\mathbb{N}}$ . In particular, if $(\mathcal{W}^{\nu}\circ(X,M^{n})^{-1})_{n\in\mathbb{N}}$ weakly converges to some $\Theta^{\prime}$ in $\mathcal{P}(\mathcal{X}\times\mathcal{X})$ then $(\theta^{n}\circ(X,M^{n})^{-1})_{n\in\mathbb{N}}$ weakly converges to some other $\Theta^{\prime\prime}\ll\Theta^{\prime}$ in $\mathcal{P}(\mathcal{X}\times\mathcal{X})$ , and the same holds true for their first marginals on $\mathcal{X}$ . Therefore, we can conclude that $\theta\ll\mathcal{W}^{\nu}$ . ∎

Compactification method. So far we have established the convergence of the laws $(\theta^{n})_{n\in\mathbb{N}}$ to some limit law $\theta$ in the 1-Wasserstein distance. Now, in order to prove the convergence of the approximating feedback solutions $(u^{n},\mu^{n})_{n\in\mathbb{N}}$ to some feedback MFG solution $(u,\mu)$ , we need to show that the sequence of optimal controls $(u^{n})_{n\in\mathbb{N}}$ converges to a control $u$ , which is optimal for the limit game.

To do this, we interpret the sequence of strict feedback solutions $(u^{n},\mu^{n})_{n\in\mathbb{N}}$ as a sequence of relaxed feedback solutions $(\lambda^{n},\mu^{n})_{n\in\mathbb{N}}$ , by defining $\lambda^{n}:[0,T]\times\mathcal{X}\rightarrow\mathcal{P}(\Gamma)$ as $\lambda^{n}(t,\varphi)\doteq\delta_{u^{n}(t,\varphi)}$ for all $(t,\varphi)\in[0,T]\times\mathcal{X}$ and for all $n\in\mathbb{N}$ . Furthermore, we identify each $\lambda^{n}$ with a stochastic relaxed control $\Lambda^{n}$ . We then fix a sequence of associated weak solutions $(\tilde{\Omega}^{n},\tilde{\mathcal{F}}^{n},(\tilde{\mathcal{F}}^{n}_{t})_{t\in[0,T]},\mathbb{Q}^{n},X^{n},W^{n})$ of Eq.(2.5) and set $\Theta^{n}\doteq\mathbb{Q}^{n}\circ(X^{n},\Lambda^{n})^{-1}\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ for all $n\in\mathbb{N}$ . Finally, we associate to each MFG( $n$ ) and to the limit MFG a martingale problem (Stroock and Varadhan [55, 56]) and show that the limit points $\Theta\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ of $(\Theta^{n})_{n\in\mathbb{N}}$ solve the limit relaxed martingale problem. We start with the following lemma.

Lemma 3.5 (Tightness in the 1-Wasserstein distance and absolute continuity).

Let $(\Theta^{n})_{n\in\mathbb{N}}$ be as above. Then the following two properties hold:

(i)

$(\Theta^{n})_{n\in\mathbb{N}}$ * is tight in $\mathcal{P}_{1}(\mathcal{X}\times\mathcal{V})$ ;* 2. (ii)

Any limit point $\Theta$ of the sequence $(\Theta^{n})_{n\in\mathbb{N}}$ in $\mathcal{P}_{1}(\mathcal{X}\times\mathcal{V})$ satisfies $\Theta\circ X^{-1}\ll\mathcal{W}^{\nu}$ .

Proof.

(i). It follows from Lemma 3.4 and the compactness of $\Gamma$ .

(ii). This is a consequence of Proposition 3.1, the fact that by construction $\theta^{n}=\Theta^{n}\circ X^{-1}$ for all $n\in\mathbb{N}$ , and the fact that weak convergence of the joint laws implies weak convergence of the marginals. ∎

By the previous lemma, we can assume without loss of generality that the original sequence $(\Theta^{n})_{n\in\mathbb{N}}$ converges to some limit measure $\Theta$ in $\mathcal{P}_{1}(\mathcal{X}\times\mathcal{V})$ . In order to characterize the limit point $\Theta$ , we associate to each approximating MFG( $n$ ) and to the limit MFG a (relaxed) martingale problem, henceforth RM( $n$ ) and RM, respectively. Then, we show that $\Theta$ is also a solution of RM. We will use the notation $Dg$ and $D^{2}g$ for the gradient and the Hessian of a smooth function $g:\mathbb{R}^{d}\to\mathbb{R}$ , while $\textrm{Tr}[A]$ denote the trace of a square matrix $A$ . Notice that in the following definition we have used the repameterization $b$ of the drift $\bar{b}$ .

Definition 3.1.

The approximating martingale problems (RM( $n$ )) We say that $\widehat{\Theta}\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ is a solution of RM( $n$ ) if for all $g\in\mathcal{C}^{2}_{c}(\mathbb{R}^{d})$ the process

[TABLE]

is a $\widehat{\Theta}$ -martingale, where $\hat{\theta}\doteq\widehat{\Theta}\circ X^{-1}$ and $X$ is the canonical process on $\mathcal{X}$ .

Observe that, by construction, each $\Theta^{n}$ solves RM( $n$ ). In Proposition 3.2 below we will characterize the limit points as solutions of the following (relaxed) martingale problem.

Definition 3.2.

The limit martingale problem (RM) We say that $\widehat{\Theta}\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ is a solution of RM if for all $g\in\mathcal{C}^{2}_{c}(\mathbb{R}^{d})$ the process

[TABLE]

is a $\widehat{\Theta}$ -martingale, where $\hat{\theta}\doteq\widehat{\Theta}\circ X^{-1}$ .

Remark 3.1.

The martingale property in both RM( $n$ ) and in RM is understood to hold on $(\mathcal{X}\times\mathcal{V},\mathcal{B}(\mathcal{X}\times\mathcal{V}))$ with respect to the $\Theta$ -augmentation of the canonical filtration made right continuous by a standard procedure. Nonetheless, to conclude it is sufficient to check that the martingale property holds with respect to the canonical filtration on $\mathcal{X}\times\mathcal{V}$ (see, for instance, Problem 5.4.13 in Karatzas and Shreve [39]).

Now, we can characterize the limit points via the martingale problems.

Proposition 3.2 (Characterization of limit points via martingale problems).

$\Theta$ * solves RM as in Definition 3.2.*

Proof.

Fix $t_{1},t_{2}\in[0,T]$ , $t_{1}<t_{2}$ , $g\in\mathcal{C}^{2}_{c}(\mathbb{R}^{d})$ and $\psi\in\mathcal{C}_{b}(\mathcal{X}\times\mathcal{V})$ measurable with respect to $\mathcal{B}_{t_{1}}(\mathcal{X}\times\mathcal{V})$ . Define $\Psi,\Psi^{n}:\mathcal{P}(\mathcal{X}\times\mathcal{V})\rightarrow\mathbb{R}$ as

[TABLE]

for $\Theta^{\prime},\Theta\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ and for all $n\in\mathbb{N}$ . Since $\Psi^{n}(\Theta^{n};\Theta^{n})=0$ for all $n\in\mathbb{N}$ , it suffices to prove that $\Psi^{n}(\Theta^{n};\Theta^{n})\rightarrow\Psi(\Theta;\Theta)$ as $n\rightarrow\infty$ .

First, we observe that $\Psi^{n}(\Theta^{n};\Theta^{n})$ and $\Psi(\Theta;\Theta)$ can be written as

[TABLE]

and

[TABLE]

The convergence of the diffusion terms is a straightforward consequence of the weak convergence $\Theta^{n}\overset{w}{\rightharpoonup}\Theta$ and the fact that the map

[TABLE]

is in $C_{b}(\mathcal{X}\times\mathcal{V})$ , leading to

[TABLE]

Hence, we only need to study the convergence of the drift terms. We split the rest of the proof in two steps.

Step 1. We prove that

[TABLE]

Indeed,

[TABLE]

for all $\epsilon>0$ , where $C_{Dg}$ and $C_{\psi}$ are uniform bounds on $Dg$ and $\psi$ , respectively. We applied Young’s inequality with exponents $\alpha,\beta>1$ , $\frac{1}{\alpha}+\frac{1}{\beta}=1$ for the third inequality, while for the last one we used the Markov’s inequality with respect to the measure $\pi(ds,du,d\varphi,dq)\doteq q(ds,du)\Theta^{n}(d\varphi,dq)$ on $\mathcal{X}\times\mathcal{V}\times[0,T]\times\Gamma$ :

[TABLE]

The suprema over $n\in\mathbb{N}$ are bounded due to Lemma 3.2. We conclude this step by letting first $n\rightarrow\infty$ (so that $K_{n}\nearrow\infty$ ) then $\epsilon\rightarrow 0$ .

Step 2. We prove that

[TABLE]

To this aim we show that:

[TABLE]

is continuous on $\mathcal{P}_{1}(\mathcal{X})\times\mathcal{X}\times\mathcal{V}$ at points such that $\theta\ll\mathcal{W}^{\nu}$ and that it has sub-linear growth in $(\varphi,q)\in\mathcal{X}\times\mathcal{V}$ so that we can conclude by using the property $W_{1}(\Theta^{n},\Theta)\rightarrow 0$ together with Theorem 7.12.iv in [58]. Since $\psi\in\mathcal{C}(\mathcal{X}\times\mathcal{V})$ , we only need to show the continuity of the second (integral) term. Let $(\theta^{n},\varphi^{n},q^{n},u^{n})_{n\in\mathbb{N}}\subset\mathcal{P}_{1}(\mathcal{X})\times\mathcal{X}\times\mathcal{V}\times\Gamma$ converge to some point $(\theta,\varphi,q,u)\in\mathcal{P}_{1}(\mathcal{X})\times\mathcal{X}\times\mathcal{V}\times\Gamma$ where $\theta\ll\mathcal{W}^{\nu}$ . Then

[TABLE]

for all $t\in[t_{1},t_{2}]$ by the continuity assumptions on $b$ and $Dg$ , i.e. $b(t,\cdot)^{\top}Dg(\cdot)$ is jointly continuous for each $t\in[t_{1},t_{2}]$ at points $(\theta,\varphi,q,u)$ with $\theta\ll\mathcal{W}^{\nu}$ . Moreover

[TABLE]

for some constants $C_{Dg},C,K>0$ (this replaces Assumption (2) of Corollary A.5 in [41]). We conclude by means of Corollary A.5 in [41].

∎

We conclude this subsection by characterizing any limit measure $\Theta$ as the joint law of state and (relaxed) control for a weak solution of the limit SDE in Eq.(2.7) with drift $\bar{b}$ . The next corollary is a fairly standard result establishing a well-known connection between solutions of RM and weak solutions of SDEs:

Corollary 3.1 (Representation of limit points).

Let $\Theta$ be a solution of RM, as in Definition 3.2. Then there exists a weak solution $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},X,\Lambda,W)$ of

[TABLE]

such that $\Theta=\mathbb{Q}\circ(X,\Lambda)^{-1}$ , $\theta=\Theta\circ X^{-1}$ and $\mu_{t}=g(t,\theta)$ with $g:[0,T]\times\mathcal{P}_{1}(\mathcal{X})\rightarrow\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})$ as in Eq.(2.2).

Proof.

Arguing analogously as in the proofs of Proposition 5.4.6 and Corollary 5.4.8 in [39] gives the existence of a weak solution $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},X,\Lambda,W)$ of the SDE

[TABLE]

such that $\Theta$ is the law of $(X,\Lambda)$ under $\mathbb{Q}$ and $\theta=\Theta\circ X^{-1}$ . The conclusion is obtained by going back to the original drift $\bar{b}$ , that we recall is given by

[TABLE]

and $g(t,\theta)=\mu_{t}$ as in Eq.(2.2). ∎

3.3 Optimality of the limit points

In this subsection, we show that any limit point $\Theta\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ of $(\Theta^{n})_{n\in\mathbb{N}}$ is optimal according to the cost functional of the MFG. In order to do that, we will extend the notion of relaxed MFG solution to controls that are not necessarily in feedback form. In this case we evaluate optimality according to the following cost functional:

[TABLE]

where $\Lambda$ is any relaxed stochastic control and $\tau\doteq\tau^{X}\wedge T$ , subject to the dynamics

[TABLE]

We set $V^{\mu}=\inf_{\Lambda}J^{\mu}(\Lambda)$ , where the minimization is actually performed over the set of relaxed stochastic open-loop controls, i.e. over the tuples $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},X,\Lambda,W)$ that are weak solutions of Eq.(3.4) and where $\Lambda$ is a progressively measurable $\mathcal{P}(\Gamma)$ -valued stochastic process. To simplify the notation, we will just write $\Lambda$ to refer to the whole tuple. Moreover, when working on the canonical space $\mathcal{X}\times\mathcal{V}$ , where the canonical process $(X,\Lambda)$ is completely characterized by its law $\Theta$ , we will simply write $J^{\mu}(\Theta)$ in place of $J^{\mu}(\Lambda)$ .

Definition 3.3 (Relaxed MFG solution).

A relaxed solution of the MFG is a pair $(\Lambda,\mu)$ , where $\Lambda$ is a relaxed stochastic control and $\mu\in\Upsilon_{\leq 1,1}^{T}$ , such that:

(i)

$\Lambda$ is optimal, i.e. $V^{\mu}=J^{\mu}(\Lambda)$ .

(ii)

Let $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},X,\Lambda,W)$ be a weak solution of Eq.(3.4) with flow of sub-probability measures $\mu$ , stochastic control $\Lambda$ and initial condition $\nu$ . Then

[TABLE]

Proposition 3.3 (Existence of relaxed MFG solutions).

Grant Assumptions (H1)-(H8) and (C1). Let $\Theta$ be a limit point of $(\Theta^{n})_{n\in\mathbb{N}}$ in $\mathcal{P}_{1}(\mathcal{X}\times\mathcal{V})$ . Set $\mu\in\Upsilon^{T}_{\leq 1,1}$ as

[TABLE]

Then $(\Theta,\mu)$ is a relaxed MFG solution according to Definition 3.3.

Proof.

By construction we immediately have that $\Lambda$ is a relaxed stochastic control and $\mu\in\Upsilon_{\leq 1,1}^{T}$ . Moreover, property (ii) is a consequence of the fact that $\Theta$ is a solution of RM as in Definition 3.2. To prove property (i), we proceed through the following steps:

(j)

Let $\tilde{\Theta}\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ be a solution of RM. Then there exists a sequence of solutions $(\tilde{\Theta}^{n})_{n\in\mathbb{N}}$ of RM( $n$ ) such that $\lim_{n\rightarrow\infty}J^{n,\mu^{n}}(\tilde{\Theta}^{n})=J^{\mu}(\tilde{\Theta})$ .

(jj)

$\lim_{n\rightarrow\infty}J^{n,\mu^{n}}(\Theta^{n})=J^{\mu}(\Theta)$ .

(jjj)

$J^{\mu}(\Theta)\leq J^{\mu}(\tilde{\Theta})$ for any $\tilde{\Theta}\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ solution of RM.

The proof of (j)-(jjj) largely follows that of Theorem 3.6 in [41]. Therefore, we highlight only the main differences with respect to our setting, which are due to the sub-linear growth of the drift and the cost functional and to the path dependency induced by the exit time from $\mathcal{O}$ .

Proof of (j). Let $\tilde{\Theta}\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ be a solution of RM and let $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\tilde{\Theta},X,\Lambda,W)$ be a weak solution of Eq.(3.4) on the canonical space $\tilde{\Omega}=\mathcal{X}\times\mathcal{V}$ . The existence of this solution is guaranteed by Corollary 3.1. Now fix $\Lambda$ and let $X^{n}$ be a sequence of strong solutions of:

[TABLE]

on the filtered probability space $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\tilde{\Theta})$ . Set $\tilde{\Theta}^{n}\doteq\tilde{\Theta}\circ(X^{n},\Lambda)^{-1}$ for each $n\in\mathbb{N}$ . Notice that $(\tilde{\Theta}^{n})_{n\in\mathbb{N}}\subset\mathcal{P}_{1}(\mathcal{X}\times\mathcal{V})$ . Moreover each $\tilde{\Theta}^{n}$ solves RM( $n$ ) as in Definition 3.1. We now show that:

[TABLE]

Regarding the first limit, it is sufficient to note that:

[TABLE]

where we set

[TABLE]

The first term can be handled with Grönwall’s Lemma, whereas the second one by applying a similar argument as in the first step of the proof of Proposition 3.2. Regarding the second limit in Eq.(3.5) we can proceed as follows. First, notice that the first limit in Eq.(3.5) implies convergence in probability, hence in law, of $X^{n}$ to $X$ . Thus, by an argument similar to that of Lemma 3.5, we can prove the convergence in the 1-Wasserstein distance. At this point, the convergence of the costs is a consequence of the convergence in the 1-Wasserstein distance and the sub-linear growth of the running cost (combined with Theorem 7.12.iv in [58]), as in the second step of the proof of Proposition 3.2.

Proof of (jj). This follows from an argument similar to the second part of (j).

Proof of (jjj). Let $\tilde{\Theta}\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ be a solution of RM and let $(\tilde{\Theta}^{n})_{n\in\mathbb{N}}\subset\mathcal{P}(\mathcal{X}\times\mathcal{V})$ be an approximating sequence as in (j). By the optimality of $\Theta^{n}$ we have

[TABLE]

for all $n\in\mathbb{N}$ . The optimality of $\Theta$ follows by taking the limit for $n\rightarrow\infty$ on both sides of the inequality above and using the previous properties (j) and (jj). ∎

3.4 Existence of solutions

In this subsection we finally conclude the proof of Theorem 3.1 by proving the existence of a relaxed feedback MFG solution and, under additional convexity assumptions, the existence of a strict feedback MFG solution. In addition, we also prove existence of solutions that are Markovian up to the exit time.

Relaxed feedback MFG solutions. The main mathematical tool here is the mimicking result of [8]. We follow the procedure in [41] but with modifications due to the peculiarities of our model induced mainly by the presence of absorptions. We give more details in the proof below.

Proposition 3.4 (Existence of relaxed feedback MFG solutions).

Grant Assumptions (H1)- (H8) and (C1). Let $(\Theta,\mu)$ be a relaxed MFG solution as in Definition 3.3.

Then there exists another relaxed MFG solution $(\Theta^{\prime},\mu)$ and a progressively measurable functional $\lambda:[0,T]\times\mathcal{X}\rightarrow\mathcal{P}(\Gamma)$ such that $\Theta^{\prime}((\varphi,q)\in\mathcal{X}\times\mathcal{V}:q_{t}=\lambda(t,\varphi))=1$ for $\mathcal{L}_{T}$ -a.e. $t\in[0,T]$ and $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta)=V^{\mu}$ , i.e. $(\lambda,\mu)$ is a relaxed feedback solution of the MFG as in Definition 2.2.

Proof.

We adapt the proof of Theorem 3.7 in [41] to our setting, by exploiting the mimicking result in Corollary 3.11 of [8] instead of Corollary 3.7 as in [41]. As a consequence, the mimicking process that we get is not Markovian as in Lacker. However, it has the same law as the original process and not only the same marginals. This is important in our setting due to the path dependency induced by the exit time $\tau$ .

We start with the construction of $\lambda$ by disintegration. Precisely, define $\eta\in\mathcal{P}([0,T]\times\mathcal{X}\times\Gamma)$ as:

[TABLE]

and disintegrate it as $\eta(dt,d\varphi,du)=\tilde{\eta}(dt,d\varphi)\lambda_{t,\varphi}(du)$ . Then:

[TABLE]

for all $I\in\mathcal{B}([0,T])$ , $B\in\mathcal{B}(\mathcal{X})$ and $G\in\mathcal{B}(\Gamma)$ . By the disintegration theorem, $(t,\varphi)\mapsto\lambda_{t,\varphi}(\cdot)\in\mathcal{P}(\Gamma)$ is Borel-measurable. Now set $\tilde{\mathcal{F}}^{X}_{t}\doteq\sigma(X_{s},s\in[0,t])$ for each $t\in[0,T]$ . We claim that:

[TABLE]

which is measurable and adapted, hence it has a progressively measurable modification $\lambda$ . We show that for any bounded measurable functional $g:[0,T]\times\mathcal{X}\times\Gamma\rightarrow\mathbb{R}$ such that $g(t,\cdot,u)$ is $\tilde{\mathcal{F}}^{X}_{t}$ -measurable for all $t\in[0,T]$ and $u\in\Gamma$

[TABLE]

$\Theta\text{-a.s. and for$ \mathcal{L}_{T} $-a.e.}\,t\in[0,T]$ . Indeed, for any other bounded measurable functional $h:[0,T]\times\mathcal{X}\rightarrow\mathbb{R}$ such that $h(t,\cdot)$ is $\tilde{\mathcal{F}}^{X}_{t}$ -measurable for all $t\in[0,T]$ , we have

[TABLE]

where the first equality comes from the definition of $\tilde{\eta}$ , the second one is due to the disintegration of $\eta$ and the third one holds by definition of $\eta$ .

Now, let $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\mathbb{Q},W,X,\Lambda)$ be a weak solution of Eq.(3.4) with relaxed control $\Theta=\mathbb{Q}\circ(X,\Lambda)^{-1}$ . By Corollary 3.11 in [8] there exists a weak solution $(\tilde{\Omega}^{\prime},\tilde{\mathcal{F}}^{\prime},(\tilde{\mathcal{F}}^{\prime}_{t})_{t\in[0,T]},\mathbb{Q}^{\prime},W^{\prime},X^{\prime})$ of

[TABLE]

such that $\mathbb{Q}^{\prime}\circ(X^{\prime})^{-1}=\mathbb{Q}\circ X^{-1}$ . Define $\Theta^{\prime}\doteq\mathbb{Q}^{\prime}\circ(X^{\prime},\Lambda^{\prime})^{-1}$ where $\Lambda^{\prime}(dt,du)\doteq dt\lambda_{t,X^{\prime}}(du)$ . Notice that if $\mu^{\prime}$ is the flow of sub-probability measures associated to $\Theta^{\prime}$ then $\mu^{\prime}=\mu$ . Finally, $\Theta^{\prime}$ solves the same relaxed martingale problem as $\Theta$ , and it has the same cost as $\Theta$ as required:

[TABLE]

∎

Remark 3.2.

We observe that, due to the discontinuity induced by the exit time $\tau$ , it is not possible in general to apply Theorem 3.6 of [8] to $Z_{t}=(X_{t},\mathbb{I}_{[0,\tau)}(t))$ , $t\in[0,T]$ , to obtain a control which is Markovian in $Z$ . Moreover the few mimicking results available in the literature for discontinuous processes hold under very restrictive or hardly verifiable assumptions. Nonetheless, Theorem 3.6 of [8] could still be applied in some particular cases when, for instance, $\mathcal{O}=(0,\infty)$ and $Z_{t}=(X_{t},\inf_{s\in[0,t]}X_{s})$ .

Strict feedback MFG solutions. Under additional convexity assumptions (Filippov [24], Haussmann and Lepeltier [35]), we prove existence of feedback MFG solutions in strict form. Let $(\Theta,\mu)$ be a relaxed MFG solution according to Definition 3.3 and for each $(t,\varphi)\in[0,T]\times\mathcal{X}$ define $K(t,\varphi,\mu)$ as:

[TABLE]

Existence of strict MFG solutions is established under the additional Assumption (C2).

Remark 3.3.

Assumption (C2) is equivalent to requiring that the set $K(t,\varphi,\mu)$ is convex. This assumption is crucial to apply the measurable selection arguments in [35, 22].

Proposition 3.5 (Existence of strict feedback MFG solutions).

Grant Assumptions (H1)- (H8), (C1) and Assumption (C2). Let $(\Theta,\mu)$ be a relaxed MFG solution as in Definition 3.3.

Then there exists another relaxed MFG solution $(\Theta^{\prime},\mu)$ and a progressively measurable functional $u\in\mathcal{U}_{fb}$ such that $\Theta^{\prime}((\varphi,q)\in\mathcal{X}\times\mathcal{V}:q_{t}=\delta_{u(t,\varphi)})=1$ for $\mathcal{L}_{T}$ -a.e. $t\in[0,T]$ and $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta)=V^{\mu}$ , i.e. $(u,\mu)$ is a strict and feedback solution of the MFG as in Definition 2.1.

Proof.

We follow once more the proof of Theorem 3.7 in [41], highlighting the main differences with respect to our setting. The first part of the proof proceeds as in Proposition 3.4. Since for all $(t,\varphi)\in[0,T]\times\mathcal{X}$ the pair $(\bar{b}(t,\varphi(t),\mu_{t},u),\bar{f}(t,\varphi(t),\mu_{t},u))$ belongs to $K(t,\varphi,\mu)$ for all $u\in\Gamma$ and $K(t,\varphi,\mu)$ is convex, we have

[TABLE]

By applying the measurable selection argument in [35, 22] (with respect to the progressive $\sigma$ -algebra, i.e. the $\sigma$ -algebra generated by progressively measurable processes), we find a progressively measurable functional $u:[0,T]\times\mathcal{X}\rightarrow\Gamma$ such that

[TABLE]

and

[TABLE]

for all $(t,\varphi)\in[0,T]\times\mathcal{X}$ . Define $\Theta^{\prime}\doteq\mathbb{Q}^{\prime}\circ(X^{\prime},\Lambda^{\prime})^{-1}$ where $\mathbb{Q}^{\prime}$ is as in the proof of Proposition 3.4 and $\Lambda^{\prime}(\varphi,q)(dt,du)\doteq dt\delta_{u(t,\varphi)}(du)$ . $\Theta^{\prime}$ solves the same relaxed martingale problem as $\Theta$ . As for the costs, we have

[TABLE]

where the inequality above is due to Eq.(3.8). Given the optimality of $(\Theta,\mu)$ we already have the converse inequality, i.e. $J^{\mu}(\Theta)\leq J^{\mu}(\Theta^{\prime})$ . Hence $J^{\mu}(\Theta)=J^{\mu}(\Theta^{\prime})$ . ∎

We can finally give the proof of Theorem 3.1.

Proof of Theorem 3.1.

Grant Assumptions (H1)-(H8) and (C1). Proposition 3.3 guarantees existence of a relaxed MFG solution $(\Theta,\mu)$ as in Definition 3.3. By Proposition 3.4 there exists another relaxed MFG solution $(\Theta^{\prime},\mu)$ together with a progressively measurable functional $\lambda:[0,T]\times\mathcal{X}\rightarrow\mathcal{P}(\Gamma)$ such that $\Theta^{\prime}((\varphi,q)\in\mathcal{X}\times\mathcal{V}:q_{t}=\lambda(t,\varphi))=1$ for $\mathcal{L}_{T}$ -a.e. $t$ and $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta)=V^{\mu}$ . Then $(\lambda,\mu)$ is a relaxed and feedback solution of the MFG as in Definition 2.2.

Additionally grant Assumption (C2). By Proposition 3.5 there exists another relaxed MFG solution $(\Theta^{\prime},\mu)$ and a progressively measurable functional $u\in\mathcal{U}_{fb}$ such that $\Theta^{\prime}((\varphi,q)\in\mathcal{X}\times\mathcal{V}:q_{t}=\delta_{u(t,\varphi)})=1$ for $\mathcal{L}_{T}$ -a.e. $t\in[0,T]$ , and $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta)=V^{\mu}$ . Then $(u,\mu)$ is a strict and feedback solution of the MFG as in Definition 2.1. ∎

Markovian MFG solutions. We conclude this part with showing that there exist relaxed and strict feedback solutions that are Markovian up to the exit time.

Proposition 3.6 (Markovian MFG solutions).

Grant Assumptions (H1)-(H8) and (C1). Let $(\Theta,\mu)$ be a relaxed MFG solution as in Definition 3.3. Then there exists another relaxed MFG solution $(\Theta^{\prime},\mu)$ and a function $\lambda:[0,T]\times\mathbb{R}^{d}\rightarrow\mathcal{P}(\Gamma)$ such that

[TABLE]

and $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta)=V^{\mu}$ . Additionally, grant Assumption (C2). Then there exists a function $u:[0,T]\times\mathbb{R}^{d}\rightarrow\Gamma$ such that

[TABLE]

and $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta)=V^{\mu}$ .

Proof.

Let us define the following processes

[TABLE]

for $t\in[0,T]$ . If $X$ satisfies Eq.(3.4) with flow of sub-probability measures $\mu$ and relaxed control $\Lambda$ then the SDE satisfied by $X^{\tau^{X}}$ is (on the same probability space)

[TABLE]

for $t\in[0,T]$ . Notice that until $t\leq\tau^{X}$ the stopped process $X^{\tau^{X}}$ coincides pathwise with the original process $X$ . We now apply the mimicking result in Corollary 3.7 of [8], to the stopped process $Y^{\tau^{X}}$ . To this end, we follow the proof of Theorem 3.7 in [41] and the proofs of Propositions 3.4 and 3.5 in the present paper.

First, we claim that there exists a measurable function $\lambda:[0,T]\times\mathbb{R}^{d+1}\rightarrow\mathcal{P}(\Gamma)$ such that

[TABLE]

Such a function can be constructed by disintegration as follows. Let $\eta\in\mathcal{P}([0,T]\times\mathbb{R}^{d+1}\times\Gamma)$ be given by

[TABLE]

We define $\lambda$ through $\eta(dt,dy,du)\doteq\tilde{\eta}(dt,dy)\lambda_{t,y}(du)$ . By Corollary 3.7 in [8] applied to $\lambda_{t,Y_{t}^{\tau^{X}}}$ there exists a weak solution $(\tilde{\Omega}^{\prime},\tilde{\mathcal{F}}^{\prime},(\tilde{\mathcal{F}}^{\prime}_{t})_{t\in[0,T]},\mathbb{Q}^{\prime},W^{\prime},X^{\prime})$ of

[TABLE]

for $t\in[0,T]$ , where $Y_{t}^{\tau^{X^{\prime}}}\doteq(t\wedge\tau^{X^{\prime}},X^{\prime}_{t})$ and $\mathbb{Q}^{\prime}\circ(t\wedge\tau^{X^{\prime}},X^{\prime}_{t})^{-1}=\mathbb{Q}\circ(t\wedge\tau^{X},X^{\tau^{X}}_{t})^{-1}$ for all $t\in[0,T]$ , i.e. $Y^{\tau^{X^{\prime}}}$ and $Y^{\tau^{X}}$ have the same time marginals. Now set $\tau^{\prime}\doteq\tau^{X^{\prime}}\wedge T$ . Recall that $\Theta=\mathbb{Q}\circ(X,\Lambda)^{-1}$ and define $\Theta^{\prime}\doteq\mathbb{Q}^{\prime}\circ(X^{\prime},\Lambda^{\prime})^{-1}$ where $\Lambda^{\prime}(dt,du)\doteq dt\lambda_{t,Y_{t}^{\tau^{X^{\prime}}}}(du)$ . Equality of the costs can be shown just as in the proof of Proposition 3.4:

[TABLE]

Therefore, $\lambda:[0,T]\times[0,T]\times\mathbb{R}^{d}\rightarrow\mathcal{P}(\Gamma)$ satisfies $\Theta^{\prime}(q\in\mathcal{V}:q_{t}=\lambda(t,t\wedge\tau^{\hat{X}},\hat{X}^{\tau^{\hat{X}}}_{t}))=1$ for $\mathcal{L}_{T}$ -a.e. $t\in[0,T]$ and $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta)=V^{\mu}$ .

Consider now a weak solution $(\tilde{\Omega}^{\prime\prime},\tilde{\mathcal{F}}^{\prime\prime},(\tilde{\mathcal{F}}^{\prime\prime}_{t})_{t\in[0,T]},\mathbb{Q}^{\prime\prime},W^{\prime\prime},X^{\prime\prime})$ of

[TABLE]

where $Y_{t}^{\tau^{X^{\prime\prime}}}=(t\wedge\tau^{X^{\prime\prime}},X^{\prime\prime}_{t})$ . Set $\Theta^{\prime\prime}\doteq\mathbb{Q}^{\prime\prime}\circ(X^{\prime\prime},\Lambda^{\prime\prime})^{-1}$ where $\Lambda^{\prime\prime}(dt,du)\doteq dt\lambda_{t,Y_{t}^{\tau^{X^{\prime\prime}}}}(du)$ . To avoid confusion between specific solutions, here $(\hat{X},\hat{\Lambda})$ denotes the canonical process on $\mathcal{X}\times\mathcal{V}$ . First, $\Theta^{\prime}$ solves the martingale problem associated to

[TABLE]

as well as the one associated to

[TABLE]

up to time $\tau^{\hat{X}}\wedge T$ , i.e. the martingale property is satisfied by the processes above stopped at time $\tau^{\hat{X}}\wedge T$ . Second, $\Theta^{\prime\prime}$ solves the latter martingale problem up to time $T$ . Then $\Theta^{\prime}$ and $\Theta^{\prime\prime}$ solve the same martingale problem up to time $\tau^{\hat{X}}\wedge T$ . Moreover, we have $\Theta^{\prime\prime}(q\in\mathcal{V}:q_{t}=\lambda(t,t\wedge\tau^{\hat{X}},\hat{X}_{t}))=1$ for $\mathcal{L}_{T}$ -a.e. $t\in[0,T]$ . If we set $\Theta_{t}\doteq\Theta\circ(\hat{X},\hat{\Lambda})_{\cdot\wedge t}^{-1}$ for all $\Theta\in\mathcal{P}(\mathcal{X}\times\mathcal{V})$ and $t\in[0,T]$ , then by uniqueness of the solution of the martingale problem up to time $\tau^{\hat{X}}\wedge T$ we have

[TABLE]

Hence $J^{\mu}(\Theta^{\prime})=J^{\mu}(\Theta^{\prime\prime})$ . Now $\Theta^{\prime\prime}$ satisfies item (ii) of Definition 3.3.

To conclude notice that the process $Y^{\tau^{X^{\prime\prime}}}_{t}=(t\wedge\tau^{X^{\prime\prime}},X^{\prime\prime}_{t})$ reduces to $(t,X^{\prime\prime}_{t})$ before time $\tau^{X^{\prime\prime}}\wedge T$ . Hence, also $\lambda_{t,Y_{t}^{\tau^{X^{\prime\prime}}}}$ , with a slight abuse of notation, reduces to $\lambda_{t,X^{\prime\prime}_{t}}$ . With the additional Assumption (C2), the second part of this lemma follows from the proof of Proposition 3.5 applied to the stopped process $Y^{\tau^{X}}$ . ∎

4 Uniqueness of solutions of the mean-field game

In this section we address the problem of uniqueness of MFG solutions. Precisely, under Assumptions (H1)-(H8) and with the additional Assumptions (U1)-(U4) given below, where the second one guarantees monotonicity of the running cost in the same spirit as [47] (see also Theorem 3.29 in [11]), we show uniqueness of the MFG solution also in the presence of smooth dependence on past absorptions. The extra assumptions can be formulated as follows.

(U1)

The running cost can be split in two terms:

[TABLE]

for some measurable functions $\bar{f}_{0}:[0,T]\times\mathbb{R}^{d}\times\Gamma\rightarrow[0,\infty)$ and $\bar{f}_{1}:[0,T]\times\mathbb{R}^{d}\times\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})\rightarrow[0,\infty)$ .

(U2)

Lasry-Lions monotonicity assumption: Let $\mu,\tilde{\mu}\in\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})$ , $\mu\neq\tilde{\mu}$ . Then

[TABLE]

(U3)

The drift $b$ does not depend on the measure variable.

(U4)

Let $\bar{\mu}\in\Upsilon^{T}_{\leq 1,1}$ be fixed. Then the following optimization problem

[TABLE]

has a unique solution $\Lambda^{\bar{\mu}}$ , where $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},W,X)$ is a solution of Eq.(2.7) under $\Lambda^{\bar{\mu}}$ with initial distribution $\nu$ and drift $b$ satisfying (U3).

Theorem 4.1 (Uniqueness).

Under Assumptions (H1)-(H8) and (U1)-(U4), if there exists a feedback solution of the MFG $(\lambda,\mu)$ (as in Definition 2.2) then it is unique.

Proof.

By contradiction, let $(\lambda,\mu)$ and $(\tilde{\lambda},\tilde{\mu})$ be two different feedback MFG solutions (as in Definition 2.2). Then

[TABLE]

where the inequality is strict by uniqueness of the minimizer in Assumption (U4), and in particular

[TABLE]

However, thanks to Assumption (U3) that grants independence of the dynamics of the state processes from the flows of measures $\mu$ and $\tilde{\mu}$

[TABLE]

where $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},W,X)$ and $(\tilde{\Omega},\tilde{\mathcal{F}},(\tilde{\mathcal{F}}_{t})_{t\in[0,T]},\tilde{\mathbb{P}},\tilde{W},\tilde{X})$ are weak solutions of Eq.(2.5) respectively with controls $\lambda$ and $\tilde{\lambda}$ . Set $\theta\doteq\mathbb{P}\circ X^{-1}$ and $\tilde{\theta}\doteq\tilde{\mathbb{P}}\circ\tilde{X}^{-1}$ . Then

[TABLE]

which is lower than or equal to zero by Assumption (U2). In the second equality we have used Fubini-Tonelli theorem, while the third one comes from the definitions of $\mu$ and $\tilde{\mu}$ , i.e.

[TABLE]

for all $B\in\mathcal{B}(\mathbb{R}^{d})$ and similarly for $\tilde{\mu}$ . ∎

Example 4.1 (Non-local dependence on the measure through a weighted average).

We provide and example of running cost $\bar{f}$ satisfying the monotonicity condition (U2), which is an assumption on the measure-dependent term $\bar{f}_{1}$ only. Let $w:\mathbb{R}^{d}\to[0,\infty)$ be some measurable function with sub-linear growth so that

[TABLE]

and set

[TABLE]

Since

[TABLE]

we obtain

[TABLE]

5 Approximate Nash equilibria for the $N$ -player game with finite-dimensional interaction

In this section, we consider an important particular case of our MFG with absorption, where the mean-field interaction is finite-dimensional. This is inspired by the original model of [9]. We show that any feedback solution of the MFG can be used to construct a sequence of approximate Nash equilibria for the corresponding $N$ -player game. To this end, we will need two additional assumptions (Assumptions (N1) and (N2) below). We focus on a finite-dimensional example first for technical reasons: this setting is very suitable to the propagation of chaos result that we use in the proofs without being too technical. Second, we think that this case is also particularly relevant for the applications as mentioned in the introduction. Overall, we believe that the finite-dimensional setting enables us to keep a good balance between abstract technicalities and modelling needs.

The approximation result is the content of Theorem 5.1 and Corollary 5.2. In order to prove this, we interpret the $N$ -player system as a system of $N$ interacting diffusions (as in, e.g., [49, 57, 28]). While the usual mode of convergence of an $N$ -particle system is the convergence in law of the empirical measures, here we obtain a stronger form of propagation of chaos as in [42] but with possibly unbounded drift in the state variable. We prove that the empirical measures converge in the stronger $\tau$ -topology, which is widely used in the large deviations literature (see, for instance, Chapter 6.2 in Dembo and Zeitouni [21]); see Subsection 5.3.

5.1 The setting with finite-dimensional interaction

Here, we describe the MFG and the corresponding $N$ -player game with smooth dependence on past absorptions, specializing them to the finite-dimensional interaction setting. In particular, we give the definition of $\epsilon$ -Nash equilibrium for the $N$ -player game. Then, we give the assumptions that are specific to this model. We conclude by checking that the MFG with finite-dimensional interactions satisfies the hypotheses of Theorem 3.1, granting the existence of relaxed and strict solutions of the MFG.

The mean-field dynamics. Given a feedback control $u\in\mathcal{U}_{fb}$ and a flow of sub-probability measures $\mu\in\Upsilon^{T}_{\leq 1,1}$ , the representative player’s state evolves according to the equation

[TABLE]

where $X$ is a $d$ -dimensional stochastic process starting at $X_{0}\overset{d}{\sim}\nu\in\mathcal{P}(\mathbb{R}^{d})$ , $W$ is a $d$ -dimensional Wiener process on some filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P})$ , $\tilde{b}$ and $\sigma$ are as in the assumptions below. In addition, $m_{w}\left(\mu\right)$ and $L\left(\mu\right)$ are functions $m_{w}:\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})\rightarrow\mathbb{R}^{d_{0}}$ and $L:\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})\rightarrow[0,1]$ defined as

[TABLE]

where $w:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d_{0}}$ , $d_{0}\in\mathbb{N}$ , is a fixed weight function with sub-linear growth. Again, solutions of Eq.(5.1) are understood in the weak sense (see Remark 2.5). The cost associated to a strategy $u\in\mathcal{U}_{fb}$ and a flow of sub-probability measures $\mu\in\Upsilon^{T}_{\leq 1,1}$ is given by

[TABLE]

where $\tau\doteq\tau^{X}\wedge T$ is the random time horizon as in the previous sections.

The $N$ -player dynamics. Let $N\in\mathbb{N}$ be the number of players. We assume that the players’ private states evolve according to the following system of $N$ $d$ -dimensional SDEs: for $i\in\left\{1,\ldots,N\right\}$ ,

[TABLE]

for $t\in[0,T]$ , where $X^{N,i}_{0}\overset{d}{\sim}\nu$ i.i.d., $W^{N,1},\ldots,W^{N,N}$ is an $N$ -dimensional vector of independent $d$ -dimensional Wiener processes, $\textbf{X}^{N}$ denotes the vector of all players’ private states, $\textbf{{u}}^{N}$ the vector of feedback strategies, $\tilde{b}$ and $\sigma$ are as in the assumptions below. We remind that $\mu^{N}\in\Upsilon^{T}_{\leq 1,1}$ is the random empirical sub-probability measures defined as

[TABLE]

Solutions of the SDEs in Eq.(5.3) are understood to be in the weak sense on some filtered probability space $(\Omega^{N},\mathcal{F}^{N},(\mathcal{F}^{N}_{t})_{t\in[0,T]},\mathbb{P}^{N})$ satisfying the usual conditions (see Remark 2.5).

Let $\mathcal{U}_{1}^{N}$ be the set of all progressively measurable functionals $u:[0,T]\times\mathcal{X}^{N}\rightarrow\Gamma$ , and let $\mathcal{U}_{N}^{N}$ , the set of all vectors $\textbf{{u}}^{N}$ such that $u^{N,i}\in\mathcal{U}_{1}^{N}$ , $i\in\left\{1,\ldots,N\right\}$ . Each element of $\mathcal{U}_{N}^{N}$ is called feedback strategy vector. In this game, player $i$ evaluates a strategy vector $\textbf{{u}}^{N}\in\mathcal{U}^{N}_{N}$ according to his/her expected costs

[TABLE]

over a random time horizon, where $\textbf{X}^{N}$ is the $N$ -player dynamics under $\textbf{{u}}^{N}$ and $\tau^{N,i}\doteq\tau^{X^{N,i}}\wedge T$ . Our aim is the construction of approximate Nash equilibria for the $N$ -player game from a solution of the limit problem. In the next definition, we use the standard notation $[u^{N,-i},v]$ to indicate a strategy vector equal to $\textbf{{u}}^{N}$ for all players but the $i$ -th, who deviates by playing $v\in\mathcal{U}^{N}_{1}$ instead.

Definition 5.1 ( $\epsilon$ -Nash equilibrium).

Let $\epsilon\geq 0$ . A strategy vector $\textbf{{u}}^{N}\in\mathcal{U}_{N}^{N}$ is called $\epsilon$ -Nash equilibrium for the $N$ -player game if for every $i\in\{1,\ldots,N\}$ and for any deviation $v\in\mathcal{U}^{N}_{1}$ we have:

[TABLE]

Relaxed controls. It will be very convenient to use relaxed controls also in the $N$ -player case. Let $\widetilde{\mathcal{U}}^{N}_{1}$ be the set of all single-player relaxed strategies for the $N$ -player game, and let $\widetilde{\mathcal{U}}^{N}_{N}$ be the set of $N$ -player relaxed strategy vectors, i.e. vectors $\boldsymbol{\lambda}^{N}=(\lambda^{N,1},\ldots,\lambda^{N,N})$ with $\lambda^{N,i}\in\widetilde{\mathcal{U}}^{N}_{1}$ , $i\in\{1,\ldots,N\}$ . At this point, we can rewrite the dynamics and the cost functional of the $N$ -player game (Eq.(5.3) and Eq.(5.5)) by using relaxed controls as

[TABLE]

with associated cost

[TABLE]

for $t\in[0,T]$ , $i\in\{1,\ldots,N\}$ , $\boldsymbol{\lambda}^{N}\in\widetilde{\mathcal{U}}^{N}_{N}$ and $\lambda^{N,i}\in\widetilde{\mathcal{U}}^{N}_{1}$ for all $i\in\{1,\ldots,N\}$ . Moreover, we extend accordingly the notion of $\epsilon$ -Nash equilibrium.

Definition 5.2 (Relaxed $\epsilon$ -Nash equilibrium).

A strategy vector $\boldsymbol{\lambda}^{N}\in\widetilde{\mathcal{U}}_{N}^{N}$ is an $\epsilon$ -Nash equilibrium for the $N$ -player game if for every $i\in\{1,\ldots,N\}$ and for any single-player strategy $\beta\in\widetilde{\mathcal{U}}^{N}_{1}$

[TABLE]

The drift $\tilde{b}$ , the function $w$ , the running cost $\tilde{f}$ and the terminal cost $F$ now satisfy the following assumptions, replacing Assumptions (H1)-(H3):

(H1’)

The drift $\tilde{b}:[0,T]\times\mathbb{R}^{d}\times[0,1]\times\mathbb{R}^{d_{0}}\times\Gamma\rightarrow\mathbb{R}^{d}$ is jointly continuous and satisfies the following uniform Lipschitz continuity: there exists $L>0$ such that

[TABLE]

for all $x,x^{\prime}\in\mathbb{R}^{d}$ and all $(t,\ell,m,u)\in[0,T]\times[0,1]\times\mathbb{R}^{d_{0}}\times\Gamma$ . Moreover it has sub-linear growth in $(x,m)$ uniformly in the other variables, i.e. there exists a constant $C>0$ such that

[TABLE]

for all $(t,x,\ell,m,u)\in[0,T]\times\mathbb{R}^{d}\times[0,1]\times\mathbb{R}^{d_{0}}\times\Gamma$ .

(H2’)

$w:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d_{0}}$ is continuous and has sub-linear growth: $|w(x)|\leq C(1+|x|)$ for all $x\in\mathbb{R}^{d}$ .

(H3’)

The costs $\tilde{f}:\left[0,T\right]\times\mathbb{R}^{d}\times[0,1]\times\mathbb{R}^{d_{0}}\times\Gamma\rightarrow[0,\infty)$ and $F:\left[0,T\right]\times\mathbb{R}^{d}\rightarrow[0,\infty)$ are jointly continuous. Moreover, they have sub-linear growth:

[TABLE]

for all $(t,x,\ell,m,u)\in[0,T]\times\mathbb{R}^{d}\times[0,1]\times\mathbb{R}^{d_{0}}\times\Gamma$ .

We conclude the presentation of the finite-dimensional model by introducing the coefficients’ reparametrization on $\mathcal{P}_{1}(\mathcal{X})$ , by checking their joint continuity (as in Assumption (H3)), where continuity in the measure variable is in the 1-Wasserstein distance and at points $\theta\ll\mathcal{W}^{\nu}$ . We set $(\bar{b},\bar{f})(t,x,\mu,u)\doteq(\tilde{b},\tilde{f})(t,\varphi(t),L(\mu),m_{w}(\mu),u)$ for all $(t,x,\mu,u)\in[0,T]\times\mathbb{R}^{d}\times\mathcal{M}_{\leq 1,1}(\mathbb{R}^{d})\times\Gamma$ and define the reparametrization $(b,f)$ as in Section 2. Then

[TABLE]

where

[TABLE]

are called the average and loss process and they equal $m_{w}(\mu_{t})$ and $L(\mu_{t})$ in case $\mu_{t}=g(t,\theta)$ where $g$ is defined as in Eq.(2.2).

Joint continuity of $b$ and $f$ follows from joint continuity of $\tilde{b}$ and $\tilde{f}$ and from the following lemma.

Lemma 5.1 (Continuity of the average and loss processes).

Grant Assumptions (H1’)-(H3’) and (H4)-(H8). Let $(\theta_{n})_{n\in\mathbb{N}}\subset\mathcal{P}_{1}(\mathcal{X})$ converge to $\theta\in\mathcal{P}_{1}(\mathcal{X})$ , $\theta\ll\mathcal{W}^{\nu}$ , in the 1-Wasserstein distance, then

(i)

$L(t;\theta^{n})\rightarrow L(t;\theta)$ * as $n\rightarrow\infty$ .*

(ii)

$m_{w}(t;\theta^{n})\rightarrow m_{w}(t;\theta)$ * as $n\rightarrow\infty$ .*

Proof.

(i). Denote by $\mathbb{D}_{\tau}(t)$ the set of discontinuity points of the map $\varphi\mapsto\mathbf{1}_{[0,\tau(\varphi))}(t)$ for $t\in[0,T]$ . In particular $\theta^{n}\overset{w}{\rightharpoonup}\theta$ . Then:

[TABLE]

for all $t\in[0,T]$ . This follows from the definition of weak convergence of measures, the fact that $\theta(\mathbb{D}_{\tau}(t))=0$ for all $t\in[0,T]$ (due to $\theta\ll\mathcal{W}^{\nu}$ ) and by Lemma A.4.(d).

(ii). Now we have:

[TABLE]

for all $t\in[0,T]$ as a consequence of the convergence in the 1-Wasserstein distance, the fact that $\theta(\mathbb{D}_{\tau}(t))=0$ for all $t\in[0,T]$ and by Lemma A.4.(d) together with Lemma A.5. ∎

We conclude by proving that we can use Theorem 3.1 and get existence of a feedback relaxed and strict solutions of the MFG with smooth dependence on past absorptions and finite-dimensional dependence on the measure.

Corollary 5.1 (Existence of relaxed and strict feedback MFG solutions).

Under Assumptions (H1’)-(H3’), (H4)-(H8) and (C1) , there exists a relaxed feedback solution $(\lambda,\mu)$ of the MFG with finite dimensional interaction. Moreover, under the additional Assumption (C2) , there exists a strict feedback MFG solution $(u,\mu)$ .

Proof.

Assumptions (H1’)-(H3’) imply Assumptions (H1)-(H3) of Theorem 3.1. Indeed, (H1)-(H2) follow from the definition of the coefficients $\tilde{b}$ and $\tilde{f}$ . Assumption (H3), i.e. joint continuity of the reparametrized coefficients, is a consequence of joint continuity of $\tilde{b}$ and $\tilde{f}$ and Lemma 5.1. ∎

5.2 The $N$ -player approximation theorem

In order to state the $N$ -player approximation results, we need the following two additional assumptions (N1)-(N2), whose formulation requires some more terminology.

We set

[TABLE]

for all $\theta,\,\tilde{\theta}\in\mathcal{P}(\mathcal{X})$ and we note that for $t\in[0,T)$ , $d_{t}$ is only a pseudo-metric, whereas for $t=T$ it is a proper metric; $d_{T}^{TV}$ is called the total variation distance. However, with a slight abuse of terminology, we will often refer to $d^{TV}_{t}$ as the total variation distance for each $t\in\left[0,T\right]$ .

(N1)

The function $w:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d_{0}}$ is bounded.

(N2)

The drift $\tilde{b}$ satisfies the following Lipschitz continuity:

[TABLE]

for all $(x,\ell,m),(x^{\prime},\ell^{\prime},m^{\prime})\in\mathbb{R}^{d}\times[0,1]\times\mathbb{R}^{d_{0}}$ and all $(t,u)\in[0,T]\times\Gamma$ , with Lipschitz constant $L>0$ . The running cost $\tilde{f}$ can be decomposed as

[TABLE]

where

[TABLE]

for all $(t,x,\ell,m,u)\in[0,T]\times\mathbb{R}^{d}\times[0,1]\times\mathbb{R}^{d_{0}}\times\Gamma$ and some constants $C,K>0$ .

From Assumptions (N1)-(N2), the reparametrizations $b$ and $f$ inherit a series of properties that are fundamental in the proof of the approximation result. First, being $w:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d_{0}}$ bounded, the drift $b$ is Lipschitz continuous with respect to the total variation distance, which is a key assumption in Lemma 5.2. Indeed

[TABLE]

because

[TABLE]

Second, the sub-linear growth property

[TABLE]

is uniform in $\theta\in\mathcal{P}(\mathcal{X})$ and in $u\in\Gamma$ , implying that $b$ is bounded in the measure and control variables (and analogously $f$ ). This means that $b$ and $f$ are well defined on all $\mathcal{P}(\mathcal{X})$ not only on $\mathcal{P}_{1}(\mathcal{X})$ , which is fundamental to apply the fixed point theorem in Lemma 5.2. Finally, the running cost $f$ can be decomposed as

[TABLE]

where its components are

[TABLE]

which inherit from $\tilde{f}_{0}$ and $\tilde{f}_{1}$ the properties

[TABLE]

for all $(t,\varphi,\theta,u)\in[0,T]\times\mathcal{X}\times\mathcal{P}(\mathcal{X})\times\Gamma$ . This is a key assumption to perform the passage to the many-player limit in Theorem 5.1. Indeed, boundedness in the control of $f_{0}$ enables us to exploit convergence in the $\tau$ -topology while sub-linearity in the state variable $\varphi$ uniformly in the measure variable $\theta$ makes $f_{1}$ a good test function for the convergence in the 1-Wasserstein distance.

Theorem 5.1 (Approximate Nash equilibria - relaxed).

*Let $(\lambda,\mu)$ be a relaxed feedback MFG solution. For all $N\geq 2$ , define $\boldsymbol{\lambda}^{N}=(\lambda^{N,1},\ldots,\lambda^{N,N})\in\tilde{\mathcal{U}}^{N}_{N}$ where $\lambda^{N,i}(t,\varphi^{N})\doteq\lambda(t,\varphi^{N,i})$ for all $i\in\{1,\ldots,N\}$ , $t\in[0,T]$ and $\varphi^{N}\in\mathcal{X}^{N}$ .

Then under Assumptions (H1’)-(H3’), (H4)-(H8) and (N1)-(N2), for every $\epsilon>0$ there exists $N^{\epsilon}\in\mathbb{N}$ such that $\boldsymbol{\lambda}^{N}$ is an $\epsilon$ -Nash equilibrium for the $N$ -player game whenever $N\geq N^{\epsilon}$ , i.e. for every $i\in\{1,\ldots,N\}$ and for any deviation $\beta\in\tilde{\mathcal{U}}^{N}_{1}$ *

[TABLE]

for all $N\geq N^{\epsilon}$ .

Corollary 5.2 (Approximate Nash equilibria - strict).

*Let $(u,\mu)$ be a strict feedback MFG solution. For all $N\geq 2$ , define $\textbf{{u}}^{N}=(u^{N,1},\ldots,u^{N,N})\in\mathcal{U}^{N}_{N}$ where $u^{N,i}(t,\varphi^{N})\doteq u(t,\varphi^{N,i})$ for all $i\in\{1,\ldots,N\}$ , $t\in[0,T]$ and $\varphi^{N}\in\mathcal{X}^{N}$ .

Then under Assumptions (H1’)-(H3’), (H4)-(H8) and (N1)-(N2), for every $\epsilon>0$ there exists a $N^{\epsilon}\in\mathbb{N}$ such that $\textbf{{u}}^{N}$ is an $\epsilon$ -Nash equilibrium for the $N$ -player game whenever $N\geq N^{\epsilon}$ , i.e. for every $i\in\{1,\ldots,N\}$ and for any deviation $v\in\mathcal{U}^{N}_{1}$ *

[TABLE]

for all $N\geq N^{\epsilon}$ .

Before proceeding, we define the empirical measure $\zeta^{N}$ of the $N$ -player system (Eq.(5.6)) as

[TABLE]

which is a $\mathcal{P}(\mathcal{X})$ -valued random variable. Moreover, we fix a relaxed feedback MFG solution $(\lambda,\mu)$ and define (cfr. Theorem 5.1 and Corollary 5.2) $\boldsymbol{\lambda}^{N}\in\tilde{\mathcal{U}}^{N}_{N}$ as $\boldsymbol{\lambda}^{N}\doteq(\lambda^{N,i})_{i=1,\ldots,N}$ where $\lambda^{N,i}(t,\varphi^{N})\doteq\lambda(t,\varphi^{N,i})$ for all $i=1,\ldots,N$ , $t\in[0,T]$ and $\varphi^{N}\in\mathcal{X}^{N}$ . In the next two subsections we consider the following $N$ -particle system:

[TABLE]

for $i=2,\ldots,N$ , $t\in[0,T]$ and where $\beta\in\tilde{\mathcal{U}}^{N}_{1}$ is a generic single-player control. Precisely, in Subsection 5.3 we set $\beta(t,\varphi^{N})\doteq\lambda(t,\varphi^{N,1})$ for $t\in[0,T]$ and $\varphi^{N}\in\mathcal{X}^{N}$ (we say that $\beta=\lambda$ for short); whereas, in Subsection 5.4 we let $\beta$ be generic (unless differently specified), which means that we allow the first player to deviate from the MFG solution $\lambda$ .

5.3 Propagation of chaos

In this subsection we consider the system of $N$ interacting symmetric diffusions given by Eq.s (5.9) and (5.10) with $\beta=\lambda$ . We associate to this system a suitable McKean-Vlasov equation (Eq.(5.11) below) and show a propagation of chaos result, that we will need in the proofs of Theorem 5.1 and Corollary 5.2.

Definition 5.3 (McKean-Vlasov solution).

A law $\theta^{*}\in\mathcal{P}(\mathcal{X})$ is a McKean-Vlasov solution of equation

[TABLE]

if there exists a weak solution $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,t]},\mathbb{P},X,W)$ with $\mathbb{P}\circ X^{-1}=\theta^{*}$ and $\mathbb{P}\circ X^{-1}_{0}=\nu$ .

The following lemma ensures the well-posedness of Eq.(5.11).

Lemma 5.2 (Existence and uniqueness of McKean-Vlasov solutions).

Grant Assumptions (H1’)-(H3’), (H4)-(H8) and (N1)-(N2). Then, there exists a unique McKean-Vlasov solution for Eq.(5.11).

Proof.

We follow [42], proof of Theorem 2.4. Precisely, we apply Banach fixed point theorem on the complete metric space $(\mathcal{P}(\mathcal{X}),d_{T})$ together with Picard iterations. To this end, we start by defining, for any $\alpha>0$ , the following distance:

[TABLE]

We note that $d^{\alpha}(\cdot,\cdot)$ is a complete metric on $\mathcal{P}(\mathcal{X})$ . We now define $\Psi:\mathcal{P}\left(\mathcal{X}\right)\rightarrow\mathcal{P}(\mathcal{X})\subset\mathcal{P}\left(\mathcal{X}\right)$ as the map $\theta\mapsto\Psi(\theta)\doteq\mathbb{P}^{\theta}\circ(X^{\theta})^{-1}$ where $(\Omega^{\theta},\mathcal{F}^{\theta},\mathbb{P}^{\theta},X^{\theta},W^{\theta})$ is a weak solution of Eq.(5.11) with $\theta$ in the drift, which is well defined (see Remark 2.5).

We show that $\Psi$ is a contraction on $\mathcal{P}\left(\mathcal{X}\right)$ with respect to the distance $d^{\alpha}$ for a sufficiently large $\alpha>0$ . Let $\mathcal{H}(\theta|\theta^{\prime})$ denote the relative entropy of $\theta$ with respect to $\theta^{\prime}$ for $\theta,\theta^{\prime}\in\mathcal{P}(\mathcal{X})$ , and let $\mathcal{H}_{t}(\theta|\theta^{\prime})=\mathcal{H}(\theta_{t}|\theta^{\prime}_{t})$ , $\theta_{t}\doteq\mathbb{P}^{\theta}\circ(X^{\theta}_{\cdot\wedge t})^{-1}$ . By Pinsker’s inequality, there exists a constant $C_{H}>0$ such that

[TABLE]

where we set $\tilde{L}\doteq L^{TV}_{b}$ . Therefore, we have

[TABLE]

which shows that $\Psi$ is a contraction whenever $\frac{1}{2}\frac{C_{H}}{\alpha}|\sigma^{-1}|^{2}\tilde{L}^{2}<1$ . Thanks to the arbitrariness of $\alpha>0$ , we conclude that $\Psi$ has a unique fixed-point in $\mathcal{P}(\mathcal{X})$ . ∎

We consider the sequence of empirical measures $(\zeta^{N})_{N\in\mathbb{N}}$ in Eq.(5.8) associated to the $N$ -particle systems in Eq.s (5.9) and (5.10) (with $\beta=\lambda$ ). We follow [42] and we prove the convergence, both in law and in probability in the $\tau$ -topology, of $(\zeta^{N})_{N\in\mathbb{N}}$ to the McKean-Vlasov solution $\theta^{*}\in\mathcal{P}(\mathcal{X})$ of Eq.(5.11). We remind that the $\tau$ -topology on $\mathcal{P}(\mathcal{X})$ , denoted with $\tau(\mathcal{P}(\mathcal{X}))$ , is the topology generated by the sets

[TABLE]

where $f:\mathcal{X}\rightarrow\mathbb{R}$ is any measurable bounded function, $x\in\mathbb{R}$ and $\delta$ is any strictly positive constant. In particular, the $\tau$ -topology is the coarsest topology that makes the maps $\pi\mapsto\int_{\mathcal{X}}f(y)\pi(dy)$ continuous for all measurable bounded functions $f:\mathcal{X}\rightarrow\mathbb{R}$ (see, for instance, Chapter 6.2 in Dembo and Zeitouni [21]).

Moreover, we denote by $w(\mathcal{P}(\mathcal{X}))$ the weak topology on $\mathcal{P}(\mathcal{X})$ and with $\mathcal{B}(\mathcal{P}(\mathcal{X}))$ the Borel $\sigma$ -algebra on $\mathcal{X}$ generated by the open sets of the weak topology. The following lemma adapts Theorem 2.6.1-2 in [42] to our framework, in particular to the case of diffusions with possibly unbounded drift.

Lemma 5.3 (Propagation of chaos).

Grant Assumptions (H1’)-(H3’), (H4)-(H8) and (N1)-(N2). Let $\theta^{*}\in\mathcal{P}(\mathcal{X})$ be the unique McKean-Vlasov solution of Eq.(5.11). Then the sequence $(\zeta^{N})_{N\in\mathbb{N}}$ converges in law to $\theta^{*}$ , i.e. $\zeta^{N}\overset{\mathcal{L}}{\longrightarrow}\theta^{*}$ , as $N\rightarrow\infty$ . Moreover

[TABLE]

for all open neighbourhoods $B$ of $\theta^{*}$ in the $\tau$ -topology that are in $\mathcal{B}(\mathcal{P}(\mathcal{X}))$ .

Proof.

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space that supports an i.i.d. sequence of $\mathcal{X}$ -valued random variables with law $\theta^{*}$ . For each $N\in\mathbb{N}$ , set $(\mathcal{F}^{N}_{t})_{t\in[0,T]}$ to be the filtration generated by $X^{1},\ldots,X^{N}$ . Define

[TABLE]

In particular, $W^{1},\ldots,W^{N}$ are independent Wiener processes on $(\Omega,\mathcal{F},(\mathcal{F}^{N}_{t})_{t\in[0,T]},\mathbb{P})$ . Fix $N\in\mathbb{N}$ , and consider the tuple $(\Omega,\mathcal{F},(\mathcal{F}^{N}_{t})_{t\in[0,T]},\mathbb{P},(X^{N,1},\ldots,X^{N,N}),(W^{1},\ldots,W^{N}))$ , with $X^{N,i}\doteq X^{i}$ , for all $i\in\{1,\ldots,N\}$ . This is a weak solution of

[TABLE]

Now, define the probability $\mathbb{P}^{N}$ via its density with respect to $\mathbb{P}$ , $\frac{d\mathbb{P}^{N}}{d\mathbb{P}}\doteq Z^{N}_{T}$ , where, for all $t\in[0,T]$

[TABLE]

A standard application of Girsanov’s theorem gives

[TABLE]

for some $\mathbb{P}^{N}$ -Wiener process $W^{N}$ . Notice that $(\Omega,\mathcal{F},(\mathcal{F}^{N}_{t})_{t\in[0,T]},\mathbb{P}^{N},X^{N},W^{N})$ is a weak solution of the $N$ -particle system in Eq.s (5.9) and (5.10), with $\beta(t,\varphi^{N})\doteq\lambda(t,\varphi^{N,1})$ for $t\in[0,T]$ and $\varphi^{N}\in\mathcal{X}^{N}$ .

At this point, the rest of the proof can be performed as in [42], Theorem 2.6.1-2, along the following steps:

(i)

Show that $F_{t_{1},t_{2}}:\mathcal{P}(\mathcal{X})\rightarrow\mathbb{R}$ defined as

[TABLE]

is $\tau$ -continuous for all $t_{1},t_{2}\in[0,T]$ , $t_{1}<t_{2}$ and $\mathcal{B}(\mathcal{P}(\mathcal{X}))$ -measurable, which is done aside at the end of this proof. Moreover $F_{t_{1},t_{2}}(\theta)\leq\tilde{L}(t_{2}-t_{1})\mathcal{H}(\theta|\theta^{*})$ for all $t_{1},t_{2}\in[0,T]$ , $t_{1}<t_{2}$ and for all $\theta\in\mathcal{P}(\mathcal{X})$ , which is a straightforward consequence of the Lipschitz continuity in the total variation distance.

(ii)

Since $X^{N,1},X^{N,2},\ldots X^{N,N}$ are i.i.d. under $\mathbb{P}$ , Sanov’s Theorem (e.g. Theorem 6.2.10 in Dembo and Zeitouni [21]) can be applied to $\mathbb{P}\circ(\zeta^{N})^{-1}$ .

(iii)

Derive a large deviation principle for $\mathbb{P}^{N}\circ(\zeta^{N})^{-1}$ , precisely

[TABLE]

for all open neighbourhoods $B$ of $\theta$ in the $\tau$ -topology that are in $\mathcal{B}(\mathcal{P}(\mathcal{X}))$ , for some constant $\tilde{L}>0$ .

To this aim, we stress that we can proceed just as in [42]111Precisely we can show by induction that Eq.(4.1) in [42] holds also in this case, then conclude observing that $\mathbb{P}^{N}$ and $\mathbb{P}$ agree on $\mathcal{F}_{0}$ .. Indeed, regardless of the sub-linear growth of the drift, we can adapt Lacker’s estimates thanks to

[TABLE]

Moreover we can apply Varadhan’s integral lemma [21, Theorem 4.3.1] thanks to the continuity of $F_{t_{1},t_{2}}$ .

(iv)

Conclude by showing that $\inf_{\theta\not\in B}\mathcal{H}(\theta|\theta^{*})>0$ so that

[TABLE]

which can be performed as in [42].

Proof of the continuity of $F_{t_{1},t_{2}}$ in the $\tau$ -topology. We actually prove the stronger claim that the functional $F_{t_{1},t_{2}}$ in Eq.(5.12) is continuous in the weak topology ( $w$ -topology for short). First, we can write $F_{t_{1},t_{2}}(\theta)=\int_{\mathcal{X}}f_{t_{1},t_{2}}(\varphi,\theta)\theta(d\varphi)$ for $\theta\in\mathcal{P}(\mathcal{X})$ , where

[TABLE]

which is a real-valued bounded measurable function defined on $\mathcal{X}\times\mathcal{P}(\mathcal{X})$ . Let $(\theta^{n})_{n\in\mathbb{N}},\theta\in\mathcal{P}(\mathcal{X})$ be such that $\theta^{n}\overset{w}{\rightharpoonup}\theta$ . We want to show that $F_{t_{1},t_{2}}(\theta^{n})\rightarrow F_{t_{1},t_{2}}(\theta)$ as $n\to\infty$ .

Set $f_{n}(\varphi)\doteq f_{t_{1},t_{2}}(\varphi,\theta^{n})$ and $f(\varphi)\doteq f_{t_{1},t_{2}}(\varphi,\theta)$ . They are all in $C_{b}(\mathcal{X})$ with uniform bound in $n\in\mathbb{N}$ . Moreover, $f_{n}\rightarrow f$ in the sup-norm. Indeed

[TABLE]

which vanishes in the limit for $n\rightarrow\infty$ due to Lemma 5.1. As a consequence, we obtain

[TABLE]

∎

5.4 Proof of the The $N$ -player approximation theorem

This section is devoted to the construction of approximate Nash equilibria for the $N$ -player game from a solution of the limit problem, in the particular case of finite-dimensional interaction as described before. The results of previous Subsection 5.3 allow us to pass to the many-player limit even if feedback MFG strategies are discontinuous in the state variable. We have observed in the introduction that the construction of approximated Nash equilibria for the $N$ -player games in [9] was crucially based on the continuity of the limit optimal control for almost every paths of the state variable with respect to the Wiener measure. In our setting, such a regularity property is no longer feasible due to the possible unboundedness of the coefficients, which makes it difficult to apply PDE-based estimates as in [9] to get the needed continuity. Therefore, in order to overcome this obstacle, we will use the strong form of propagation of chaos in Lemma 5.3, which allows to pass to the limit even through possibly discontinuous MFG optimal controls.

In this part, we consider the dynamics in Eq.(5.9) and Eq.(5.10) without necessarily taking $\beta=\lambda$ , unless differently specified. We start with some preliminary estimates ensuring that the costs remain bounded in the mean-field limit despite the sub-linear growth.

Lemma 5.4 (A-priori estimates).

Grant Assumptions (H1’)-(H3’), (H4)-(H8) and (N1)-(N2). Consider the dynamics in Eq.s (5.9) and (5.10). Then for any $\alpha\geq 1$

[TABLE]

for $i\in\{1,\ldots,N\}$ and where $K(\alpha)<\infty$ is a positive constant independent of $N$ .

Proof.

This is a consequence of Grönwall’s lemma together with uniform boundedness of the drift in the measure and control variables. ∎

Now, we prove the tightness of the sequence of laws $(\mathbb{P}^{N}\circ(\zeta^{N})^{-1})_{N\in\mathbb{N}}$ when $\beta=\lambda$ in Eq.(5.9), i.e. when the dynamics are symmetric. Then, thanks to Lemma 5.3, we characterize the limit points of $(\mathbb{P}^{N}\circ(\zeta^{N})^{-1})_{N\in\mathbb{N}}$ as McKean-Vlasov solutions of Eq.(5.11); see Lemma 5.6.

Lemma 5.5 (Tightness).

Grant Assumptions (H1’)-(H3’), (H4)-(H8) and (N1)-(N2). Let $\zeta^{N}$ be the empirical measure of the system given by Eq.s (5.9) and (5.10) with $\beta=\lambda$ . Then the sequence $(\mathbb{P}^{N}\circ(\zeta^{N})^{-1})_{N\in\mathbb{N}}$ is tight in $\mathcal{P}(\mathcal{P}(\mathcal{X}))$ .

Proof.

The tightness of such a sequence follows from [57], Proposition 2.2, combined with Kolmogorov-Chentsov criterion (see, for instance, Corollary 14.9 in Kallenberg [38]). ∎

Lemma 5.6 (Characterization of limit points).

Grant Assumptions (H1’)-(H3’), (H4)-(H8) and (N1)-(N2). Let $\zeta^{N}$ be the empirical measure of the system given by Eq.s (5.9) and (5.10) with $\beta=\lambda$ . Let $(\mathbb{P}^{N_{k}}\circ(\zeta^{N_{k}})^{-1})_{k\in\mathbb{N}}$ be a convergent subsequence of $(\mathbb{P}^{N}\circ(\zeta^{N})^{-1})_{N\in\mathbb{N}}$ . Let $\zeta$ be a random variable defined on some probability space $(\Omega,\mathcal{F},\mathbb{P})$ with values in $\mathcal{P}(\mathcal{X})$ such that $\zeta^{N_{k}}\overset{\mathcal{L}}{\longrightarrow}\zeta$ . Then

(i)

$\zeta$ * coincides $\mathbb{P}$ -a.s. with the unique McKean-Vlasov solution $\theta^{*}$ of Eq.(5.11).*

(ii)

The sequence $(\zeta^{N})_{N\in\mathbb{N}}$ converges in probability (hence also in law) to $\theta^{*}$ when $\mathcal{P}(\mathcal{X})$ is equipped with the $\tau$ -topology.

Proof.

By Lemma 5.5 there exists a subsequence $(\mathbb{P}^{N_{k}}\circ(\zeta^{N_{k}})^{-1})_{k\in\mathbb{N}}\subset\mathcal{P}(\mathcal{P}(\mathcal{X}))$ converging to $\mathbb{P}\circ\zeta^{-1}\in\mathcal{P}(\mathcal{P}(\mathcal{X}))$ . Lemma 5.3 guarantees the convergence in law of the whole sequence $(\zeta^{N})_{N\in\mathbb{N}}$ to the deterministic limit $\theta^{*}$ , which is the unique McKean-Vlasov solution of Eq.(5.11). By uniqueness in law of the weak limit we have $\mathbb{P}\circ\zeta^{-1}=\delta_{\theta^{*}}$ , yielding $\zeta=\theta^{*}$ $\mathbb{P}$ -a.s.. Lemma 5.3 also gives convergence in probability in the $\tau$ -topology of $(\zeta^{N})_{N\in\mathbb{N}}$ to $\theta^{*}$ . ∎

Corollary 5.3 (Characterization of the convergence).

Under the assumptions of Lemma 5.6, the following properties hold:

(i)

For all Borel-measurable bounded function $f:\mathcal{X}\rightarrow\mathbb{R}$ such that $\theta\mapsto\int_{\mathcal{X}}f(\varphi)\theta(d\varphi)$ is $\tau(\mathcal{P}(\mathcal{X}))$ -continuous

[TABLE]

(ii)

$\mathbb{P}^{N}\circ(X^{N,1},\zeta^{N})^{-1}\overset{w}{\rightharpoonup}\theta^{*}\otimes\delta_{\theta^{*}}$ . Moreover, $\mathbb{P}^{N}\circ(X^{N,1})^{-1}\overset{w}{\rightharpoonup}\theta^{*}$ and $\mathbb{P}^{N}\circ(\zeta^{N})^{-1}\overset{w}{\rightharpoonup}\delta_{\theta^{*}}$ .

(iii)

For all $f\in C(\mathcal{X})$ with sub-linear growth, i.e. $|f(\varphi)|\leq C_{f}(1+\|\varphi\|_{\infty})$ for some $C_{f}>0$ and all $\varphi\in\mathcal{X}$ , we have

[TABLE]

Proof.

(i) This is a consequence of Lemma 5.3, Lemma 5.6 and of the almost sure equality $\zeta=\theta^{*}$ .

(ii) We already know that $\mathbb{P}^{N}\circ(\zeta^{N})^{-1}\overset{w}{\rightharpoonup}\delta_{\theta^{*}}$ from Lemma 5.6. Therefore, the convergence of $\mathbb{P}^{N}\circ(X^{N,1})^{-1}$ to $\theta^{*}$ follows from [57], Proposition 2.2, and the symmetry of the system.

(iii) Let $f\in C(\mathcal{X})$ with sub-linear growth. It is enough to show that

[TABLE]

To this aim, for fixed $R>0$ , we consider the decomposition

[TABLE]

By property (i), for any fixed $R>0$ , we have

[TABLE]

so that

[TABLE]

Now, we let $R\rightarrow\infty$ and we show that the RHS vanishes in the limit. To do so, recall that, due to Lemma 5.4, there exist constants $K(\alpha),K>0$ such that

[TABLE]

independently of $i\in\{1,\ldots,N\}$ . Then, set $\alpha,\beta>1$ such that $\frac{1}{\alpha}+\frac{1}{\beta}=1$ and let $\epsilon>0$ . By definition of $\zeta^{N}$ and by Young’s and Markov’s inequalities, we have

[TABLE]

which converges to zero by letting $R\to\infty$ and then $\epsilon\to 0$ . A similar reasoning applies to the same expectation with $\theta^{*}$ instead of $\zeta^{N}$ . ∎

Remark 5.1.

Let $\mathbb{D}\doteq\{\varphi\in\mathcal{X}:\tau(\varphi)\text{ is discontinuous at }\varphi\}$ . Since $\zeta\overset{a.s.}{=}\theta^{*}\in\mathcal{Q}$ , Lemma A.4 implies $\theta^{*}(\mathbb{D})=0$ and the statement of Corollary 5.3 holds for $f=\mathbf{1}_{\mathbb{D}}$ as well.

Finally, we conclude this section with the proof of Theorem 5.1, which leads immediately to Corollary 5.2.

Proof of Theorem 5.1.

The proof is structured in three steps.

(j)

$\lim_{N\rightarrow\infty}J^{N,1}(\boldsymbol{\lambda}^{N})=J^{\mu}(\lambda)$ .

(jj)

Let $\beta^{N,1}\in\mathcal{U}^{N}_{1}$ be such that

[TABLE]

Then

[TABLE]

(jjj)

$J^{N,1}(\boldsymbol{\lambda}^{N})\leq\inf_{\beta\in\mathcal{U}^{N}_{1}}J^{N,1}([\boldsymbol{\lambda}^{N,-1},\beta])+\epsilon$ .

We consider the dynamics in Eq.(5.6). In (j) we set $\lambda^{N,1}(t,\varphi^{N})=\lambda(t,\varphi^{N,i})$ for all $(t,\varphi^{N})\in[0,T]\times\mathcal{X}^{N}$ and prove convergence of the first-player cost functional to the cost functional of the MFG. In (jj) instead we allow the first player to deviate and choose $\lambda^{N,1}(t,\varphi^{N})=\beta^{N,1}(t,\varphi^{N})$ for all $(t,\varphi^{N})\in[0,T]\times\mathcal{X}^{N}$ where $\beta^{N,1}\in\tilde{\mathcal{U}}^{N}_{1}$ is a generic single-player relaxed control. We conclude the proof in (jjj) by combining the results in (j) and (jj).

Proof of (j). To prove that $J^{N,1}(\boldsymbol{\lambda}^{N})\rightarrow J^{\mu}(\lambda)$ , as $N\rightarrow\infty$ , we split each cost functional in the sum of two terms:

[TABLE]

and

[TABLE]

Since $f_{0}$ is bounded, the convergence of the first summand in the decomposition of $J^{N,1}(\boldsymbol{\lambda}^{N})$ to the corresponding term in $J^{\mu}(\lambda)$ is a consequence of Corollary 5.3(i) and of Lemma 5.6. On the other hand, since both $f_{1}$ and $F$ have sub-linear growth, the convergence of the second summand in $J^{N,1}(\boldsymbol{\lambda}^{N})$ follows from Corollary 5.3(iii), Lemma 5.6 and the fact that $\theta^{*}\in\mathcal{Q}$ together with Lemma A.5.

Proof of (jj). We follow the proof of Theorem 3.10 in [43] with suitable modifications due to the possibly unbounded drift and the dependence on the first exit time from the set $\mathcal{O}$ .

Let $(\Omega^{N},\mathcal{F}^{N},(\mathcal{F}^{N}_{t})_{t\in[0,T]},\mathbb{Q}^{N},Y^{N},W^{N})_{N\in\mathbb{N}}$ be a weak solutions of the $N$ -player system. Let $(\zeta^{N})_{N\in\mathbb{N}}$ be the associated empirical measures. Under $\mathbb{Q}^{N}$ the first player’s dynamics is

[TABLE]

Now, let $\mathbb{P}^{N}$ be the probability measure under which the first player’s dynamics becomes

[TABLE]

where $\tilde{W}^{N,1}$ is a $\mathbb{P}^{N}$ -Wiener process. In other terms, $\mathbb{P}^{N}$ satisfies $\frac{d\mathbb{Q}^{N}}{d\mathbb{P}^{N}}=Z^{N}_{T}$ where

[TABLE]

By inspection of the proofs of Lemma A.1 and Corollary A.1, all bounds are uniform in $N\in\mathbb{N}$ , hence Corollary A.1 gives the uniform integrability of the sequence of exponential martingales $(Z^{N})_{N\in\mathbb{N}}$ . More in detail, we apply Corollary A.1 to the drift

[TABLE]

for $(t,\varphi^{N})\in[0,T]\times\mathcal{X}^{N}$ . Notice that this drift is sublinear in $\varphi^{N}$ . Therefore convergence of the empirical measures to $\theta^{*}$ in probability in the $\tau$ -topology under $\mathbb{P}^{N}$ implies convergence of the empirical measures to the same limit in probability in the $\tau$ -topology under $\mathbb{Q}^{N}$ . Hence $\zeta_{Y}^{N}\overset{\mathcal{L}}{\longrightarrow}\theta^{*}$ under $\mathbb{Q}^{N}$ and

[TABLE]

for all neighbourhoods $B$ of $\theta$ in the $\tau$ -topology which belong to $\mathcal{B}(\mathcal{P}(\mathcal{X}))$ . The tightness of $(Y^{N,1})_{N\in\mathbb{N}}$ under $\mathbb{Q}^{N}$ still follows from their tightness under $\mathbb{P}^{N}$ . Consider $(\beta^{N,1}(t,\textbf{Y}^{N}))_{t\in[0,T]}$ as a single-player relaxed stochastic open-loop control and denote it simply by $(\beta^{N,1}_{t})_{t\in[0,T]}$ . Interpret $(Y^{N,1},\beta^{N,1},\zeta^{N}_{Y})_{N\in\mathbb{N}}$ as a sequence of random variables with values in $\mathcal{X}\times\mathcal{V}\times\mathcal{P}(\mathcal{X})$ . Compactness of $\mathcal{V}$ and tightness of $(Y^{N,1},\zeta^{N}_{Y})_{N\in\mathbb{N}}$ imply the tightness of $(Y^{N,1},\beta^{N,1},\zeta^{N}_{Y})_{N\in\mathbb{N}}$ under $\mathbb{Q}^{N}$ .

Let $(Y,\beta,\theta^{*})$ be a limit point of the sequence $(Y^{N,1},\beta^{N,1},\zeta^{N}_{Y})_{N\in\mathbb{N}}$ , defined on some probability space with probability measure $\mathbb{Q}$ . Then by a standard martingale argument it can be shown to satisfy

[TABLE]

where $W$ is a $\mathbb{Q}$ -Wiener process. As in (j) we split $J^{N,1}([\lambda^{N,-1},\beta^{N,1}])$ in two terms as

[TABLE]

We move along a weakly converging subsequence of $(Y^{N,1},\beta^{N,1},W^{N,1})_{N\in\mathbb{N}}$ under $\mathbb{Q}^{N}$ to the limit point $(Y,\beta,W)$ in Eq.(5.14). Convergence of the first and second summands above now works as in the proof of (j). Considering again the whole sequence, we obtain

[TABLE]

where the infimum on the RHS above is taken over all relaxed stochastic open-loop controls and the last equality follows from embedding the set of strict controls into the set of relaxed controls combined with the chattering lemma [23, 25, 3].

Proof of (jjj). This is a consequence of steps (j) and (jj). Indeed

[TABLE]

Now by steps (j) and (jj) there exists $N^{\epsilon}\in\mathbb{N}$ such that for all $N\geq N^{\epsilon}$

[TABLE]

Therefore, we can conclude that $J^{N,1}(\boldsymbol{\lambda}^{N})\leq\inf_{\beta\in\mathcal{U}^{N}_{1}}J^{N,1}([\boldsymbol{\lambda}^{N,-1},\beta])+\epsilon$ for all $N\geq N^{\epsilon}$ , which establishes the statement of Theorem 5.1. ∎

Appendix A Appendix

This appendix provides some of the technical results used in the paper. More in detail, we state existence and uniqueness of weak solutions of SDEs with sub-linear drift. We characterize the space of laws of processes with sub-linear drift and initial condition $\nu$ ( $\mathcal{Q}$ defined below). We prove some regularity results on the exit time $\tau^{X}$ with respect to measures in $\mathcal{Q}$ . Finally, we discuss the convergence of measures in the 1-Wasserstein distance along test functions with sub-linear growth and possibly discontinuous over a set of limit measure zero.

A.1 Existence and uniqueness of solution of SDEs with sub-linear drift

In this subsection we prove a slight variation of the well-known Beneš’ condition (Beneš [4]), leading to an existence and uniqueness result for weak solutions of SDEs with a sub-linear drift. More precisely, we allow the drift to depend on a rescaled Wiener process with a independent random initial condition. We recall that $\mathcal{E}_{t}(\cdot)$ denotes the Doléans-Dade stochastic exponential. Moreover, given a function $f:E\rightarrow\mathbb{R}$ where $E$ is a Polish space, we denote by $\mathbb{D}_{f}$ the set of its discontinuity points.

As a preliminary, we introduce the set $\mathcal{Q}$ of laws of stochastic processes with sub-linear drift in the sense of Beneš to which these results apply.

Laws of processes with sub-linear drift. Let $\beta:[0,T]\times\mathcal{X}\rightarrow\mathbb{R}^{d}$ be a progressively measurable functional such that

[TABLE]

for some constant $C>0$ . Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},X)$ be a weak solution of the following SDE

[TABLE]

where $W$ is a Wiener process independent of $\xi$ . Existence and uniqueness of a weak solution follows from an application of Girsanov’s theorem and Beneš’ condition (see Lemma A.1 and Lemma A.2). Moreover such laws turn out to be absolutely continuous with respect to the Wiener measure $\mathcal{W}^{\nu}$ (Lemma A.3). Then, we denote by $\mathcal{Q}$ the set of laws $\theta\in\mathcal{P}(\mathcal{X})$ of all continuous processes $X$ solving the SDE above.

Lemma A.1 (Beneš’ condition).

Let $b:[0,T]\times\mathcal{X}\rightarrow\mathbb{R}^{d}$ be a progressively measurable functional such that

[TABLE]

Let $\sigma\in\mathbb{R}^{d\times d}$ be a full rank matrix. Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P})$ be a filtered probability space satisfying usual conditions, supporting a random variable $\xi\overset{d}{\sim}\nu$ and a Wiener process $W$ independent of $\xi$ . Set

[TABLE]

Then

[TABLE]

is a martingale.

Proof.

We follow the proof of Corollary 3.5.16 in [39]. Precisely let $t_{0}=0<t_{1}<\ldots<t_{n-1}<t_{n}=T$ be a partition of the interval $\left[0,T\right]$ . Then thanks to the sub-linearity of the drift

[TABLE]

Let $Y^{n}\doteq(Y^{n}_{t})_{t\in[0,T]}$ be defined by

[TABLE]

Notice that $Y^{n}$ is a sub-martingale and that by Doob’s maximal inequality [39, Theorem 1.3.8.iv] we have $\mathbb{E}[\|Y^{n}\|_{\infty}^{2}]\leq 4\mathbb{E}[(Y^{n}_{T})^{2}]$ . Moreover

[TABLE]

where in the equality we have used the independence between $\xi$ and $W$ . To conclude, it is sufficient to choose $(t_{k}-t_{k-1})$ , $k=1,\ldots,n$ , sufficiently small, for instance $(t_{k}-t_{k-1})<\min\{\frac{1}{2C^{2}|\sigma|^{2}},\frac{\lambda}{C^{2}}\}$ , and to apply Corollary 3.5.14 in [39]. ∎

Corollary A.1 (Moments of the stochastic exponential).

Under the assumptions of Lemma A.1, the process $Z=(Z_{t})_{t\in[0,T]}$ has finite moments of any order $p\in[1,\infty)$ , i.e. $\mathbb{E}\left[Z_{T}^{p}\right]<\infty$ for all $p\in[1,\infty)$ .

Proof.

The proof follows directly from Lemma A.1 combined with Corollary 2 in [31]. ∎

Lemma A.2 (Existence and uniqueness of weak solutions).

Let $b:[0,T]\times\mathcal{X}\rightarrow\mathbb{R}^{d}$ be a progressively measurable functional such that

[TABLE]

Let $\sigma\in\mathbb{R}^{d\times d}$ a full rank matrix. Then there exists a weak solution $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},\mathbb{P},X,W)$ of

[TABLE]

Additionally, this solution is unique in law.

Proof.

The proof follows directly from Lemma A.1 and Girsanov’s theorem [see 39, Propositions 5.3.6 and 5.3.10]. ∎

A.2 Characterization of the set $\mathcal{Q}$

Lemma A.3 (Laws of processes with sub-linear drift).

Let $\theta\in\mathcal{Q}$ . Then $\theta\sim\mathcal{W}^{\nu}$ , i.e. $\theta$ is equivalent to the Wiener measure $\mathcal{W}^{\nu}$ .

Proof.

The proof follows directly from Lemma A.1, Girsanov’s theorem and Bayes’ rule to ensure that $Z^{-1}$ given by Lemma A.1 is still a martingale. ∎

Before proceeding further, we recall that $\tau^{X}$ is the first exit time from $\mathcal{O}$ in the path space, i.e.

[TABLE]

where $\mathcal{O}\subset\mathbb{R}^{d}$ satisfies Assumption (H4).

Lemma A.4 (Regularity results).

Let $\theta\in\mathcal{Q}$ . Let $\mathcal{O}\subset\mathbb{R}^{d}$ satisfy Assumption (H4) and let $X$ be the identity process on $\mathcal{X}$ . Then

(a)

$\tau^{X}<\infty$ , $\theta$ -almost surely.

(b)

The mapping $\varphi\mapsto\tau^{X}(\varphi)$ , from $\mathcal{X}$ to $[0,\infty]$ , is $\theta$ -a.s. continuous.

(c)

$\theta(\tau^{X}=t)=0$ * for all $t\in[0,T]$ .*

(d)

The mapping $\varphi\mapsto\mathbf{1}_{[0,\tau^{X}(\varphi))}(t)$ , from $\mathcal{X}$ to $\mathbb{R}$ , is $\theta$ -a.s. continuous for all $t\in[0,T]$ .

(e)

Properties (a)-(d) hold for $\mathcal{O}=(0,\infty)^{\times d}$ as well.

Proof.

The proof is similar to the one of Lemma D.3 in [9]. Notice that by Lemma A.3 each $\theta\in\mathcal{Q}$ is equivalent to $\mathcal{W}^{\nu}$ . So, it is sufficient to check properties (a)-(d) for $\mathcal{W}^{\nu}$ .

(a) This is a consequence of the law of iterated logarithms (as time tends to infinity) and the fact that $\mathcal{O}$ is strictly included in $\mathbb{R}^{d}$ .

(b) This, again, is a consequence of the law of iterated logarithms (as time tends to zero), the smoothness of $\mathcal{O}$ ’s boundary, the non-degeneracy of $\sigma$ and the fact that $\mathcal{O}$ is strictly included in $\mathbb{R}^{d}$ (Kushner and Dupuis [40], pp. 260-261).

(c) This is a consequence of the following relations

[TABLE]

where in the last equality we use the fact that the Lebesgue measure of the boundary of a convex subset of $\mathbb{R}^{d}$ is identically zero (Lang [44]), and that $\mathcal{W}^{\nu}\circ X_{t}^{-1}$ is absolutely continuous with respect to the Lebesgue measure for all $t\in[0,T]$ .

(d) This is a consequence of properties (b) and (c) above.

(e) When $\mathcal{O}=(0,\infty)^{\times d}$ it turns out that

[TABLE]

where $\tau^{i}(\varphi)\doteq\inf\{t\in[0,T]:\varphi_{i}(t)\leq 0\}$ , for $i\in\{1,\ldots,d\}$ and $\varphi\in\mathcal{X}$ . Then the conclusion follows from the continuity result in dimension $d=1$ (Kushner and Dupuis [40], pp. 260-261) applied to each $\tau^{i}$ . ∎

A.3 Additional convergence results

Lemma A.5 (Convergence in the 1-Wasserstein distance).

Let $E$ be a Polish space with a complete metric $d_{E}$ . Let $\theta,(\theta^{n})_{n\in\mathbb{N}}\subset\mathcal{P}_{1}(E)$ such that $W_{1}(\theta^{n},\theta)\rightarrow 0$ as $n\rightarrow\infty$ . Let $f:E\rightarrow\mathbb{R}$ be a measurable function such that $|f(x)|\leq C(1+d_{E}(x,x_{0}))$ for all $x\in E$ , for some $x_{0}\in E$ and for some constant $C>0$ . Let $\mathbb{D}_{f}$ be the set of its discontinuity points and assume $\theta(\mathbb{D}_{f})=0$ . Then

[TABLE]

Proof.

The proof works as in [58], proof of Theorem 7.12.iv, the only difference being that here $f$ can have discontinuities with $\theta(\mathbb{D}_{f})=0$ . In particular, we perform the same decomposition as in [58], i.e. $f(x)=f_{R}^{1}(x)+f_{R}^{2}(x)$ with $f_{R}^{1}(x)\doteq f(x)\wedge(C(1+R))$ and $f_{R}^{2}(x)\doteq f(x)-f_{R}^{1}(x)$ for all $x\in E$ and for some $R>0$ . We have that $|f^{1}_{R}|$ is bounded by $C(1+R)$ and $\theta(\mathbb{D}_{f^{1}_{R}})=0$ since $\mathbb{D}_{f^{1}_{R}}\subset\mathbb{D}_{f}$ . Then all limits can be performed just as in [58], proof of Theorem 7.12.iv. ∎

Bibliography58

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Aliprantis and Border [1994] Aliprantis, C. and K. Border (1994). Infinite Dimensional Analysis . Springer-Verlag, Berlin.
2Ambrosio et al. [2008] Ambrosio, L., N. Gigli, and G. Savaré (2008). Gradient flows: in metric spaces and in the space of probability measures . Springer Science & Business Media, Basel.
3Bahlali et al. [2006] Bahlali, S., B. Mezerdi, and B. Djehiche (2006). Approximation and optimality necessary conditions in relaxed stochastic control problems. International Journal of Stochastic Analysis 2006 .
4Beneš [1971] Beneš, V. (1971). Existence of optimal stochastic control laws. SIAM Journal on Control 9 (3), 446–472.
5Bertucci [2018] Bertucci, C. (2018). Optimal stopping in mean field games, an obstacle problem approach. Journal de Mathématiques Pures et Appliquées 120 , 165–194.
6Billingsley [1999] Billingsley, P. (1999). Convergence of probability measures . John Wiley & Sons, New York.
7Bouveret et al. [2020] Bouveret, G., R. Dumitrescu, and P. Tankov (2020). Mean-field games of optimal stopping: a relaxed solution approach. SIAM Journal on Control and Optimization 58 (4), 1795–1821.
8Brunick and Shreve [2013] Brunick, G. and S. Shreve (2013). Mimicking an Itô process by a solution of a stochastic differential equation. The Annals of Applied Probability 23 (4), 1584–1628.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

NNN**-player games and mean-field games with

Abstract

Contents

1 Introduction

2 Preliminaries and assumptions

Remark 2.1**.**

Remark 2.2**.**

Remark 2.3**.**

Remark 2.4**.**

Remark 2.5**.**

Definition 2.1** (Feedback MFG solution).**

Definition 2.2** (Relaxed feedback MFG solution).**

Remark 2.6**.**

3 Existence of solutions of the mean-field game

Theorem 3.1** (Existence of relaxed and strict feedback MFG solutions).**

Lemma 3.1**.**

Proof.

3.1 Approximating MFGs

Theorem 3.2** (Existence of solutions of MFG(nnn)).**

Proof.

Lemma 3.2** (A-priori estimates).**

Proof.

3.2 Convergence of the approximating MFGs

Lemma 3.3** (Relative compactness).**

Proof.

Lemma 3.4** (Convergence in the 1-Wasserstein distance).**

Proof.

Proposition 3.1** (Absolute continuity of limit measures).**

Proof.

Lemma 3.5** (Tightness in the 1-Wasserstein distance and absolute continuity).**

Proof.

Definition 3.1**.**

Definition 3.2**.**

Remark 3.1**.**

Proposition 3.2** (Characterization of limit points via martingale problems).**

Proof.

Corollary 3.1** (Representation of limit points).**

Proof.

3.3 Optimality of the limit points

Definition 3.3** (Relaxed MFG solution).**

Proposition 3.3** (Existence of relaxed MFG solutions).**

Proof.

3.4 Existence of solutions

Proposition 3.4** (Existence of relaxed feedback MFG solutions).**

Proof.

Remark 3.2**.**

Remark 3.3**.**

Proposition 3.5** (Existence of strict feedback MFG solutions).**

Proof.

Proof of Theorem 3.1.

Proposition 3.6** (Markovian MFG solutions).**

Proof.

4 Uniqueness of solutions of the mean-field game

Theorem 4.1** (Uniqueness).**

Proof.

Example 4.1** (Non-local dependence on the measure through a weighted average).**

5 Approximate Nash equilibria for the NNN-player game with finite-dimensional interaction

5.1 The setting with finite-dimensional interaction

Definition 5.1** (ϵ\epsilonϵ-Nash equilibrium).**

Definition 5.2** (Relaxed ϵ\epsilonϵ-Nash equilibrium).**

Lemma 5.1** (Continuity of the average and loss processes).**

Proof.

Corollary 5.1** (Existence of relaxed and strict feedback MFG solutions).**

Proof.

5.2 The NNN-player approximation theorem

Theorem 5.1** (Approximate Nash equilibria - relaxed).**

Corollary 5.2** (Approximate Nash equilibria - strict).**

5.3 Propagation of chaos

Definition 5.3** (McKean-Vlasov solution).**

Lemma 5.2** (Existence and uniqueness of McKean-Vlasov solutions).**

Proof.

Lemma 5.3** (Propagation of chaos).**

Proof.

5.4 Proof of the The NNN-player approximation theorem

$N$ **-player games and mean-field games with

Remark 2.1.

Remark 2.2.

Remark 2.3.

Remark 2.4.

Remark 2.5.

Definition 2.1 (Feedback MFG solution).

Definition 2.2 (Relaxed feedback MFG solution).

Remark 2.6.

Theorem 3.1 (Existence of relaxed and strict feedback MFG solutions).

Lemma 3.1.

Theorem 3.2 (Existence of solutions of MFG( $n$ )).

Lemma 3.2 (A-priori estimates).

Lemma 3.3 (Relative compactness).

Lemma 3.4 (Convergence in the 1-Wasserstein distance).

Proposition 3.1 (Absolute continuity of limit measures).

Lemma 3.5 (Tightness in the 1-Wasserstein distance and absolute continuity).

Definition 3.1.

Definition 3.2.

Remark 3.1.

Proposition 3.2 (Characterization of limit points via martingale problems).

Corollary 3.1 (Representation of limit points).

Definition 3.3 (Relaxed MFG solution).

Proposition 3.3 (Existence of relaxed MFG solutions).

Proposition 3.4 (Existence of relaxed feedback MFG solutions).

Remark 3.2.

Remark 3.3.

Proposition 3.5 (Existence of strict feedback MFG solutions).

Proposition 3.6 (Markovian MFG solutions).

Theorem 4.1 (Uniqueness).

Example 4.1 (Non-local dependence on the measure through a weighted average).

5 Approximate Nash equilibria for the $N$ -player game with finite-dimensional interaction

Definition 5.1 ( $\epsilon$ -Nash equilibrium).

Definition 5.2 (Relaxed $\epsilon$ -Nash equilibrium).

Lemma 5.1 (Continuity of the average and loss processes).

Corollary 5.1 (Existence of relaxed and strict feedback MFG solutions).

5.2 The $N$ -player approximation theorem

Theorem 5.1 (Approximate Nash equilibria - relaxed).

Corollary 5.2 (Approximate Nash equilibria - strict).

Definition 5.3 (McKean-Vlasov solution).

Lemma 5.2 (Existence and uniqueness of McKean-Vlasov solutions).

Lemma 5.3 (Propagation of chaos).

5.4 Proof of the The $N$ -player approximation theorem

Lemma 5.4 (A-priori estimates).

Lemma 5.5 (Tightness).

Lemma 5.6 (Characterization of limit points).

Corollary 5.3 (Characterization of the convergence).

Remark 5.1.

Lemma A.1 (Beneš’ condition).

Corollary A.1 (Moments of the stochastic exponential).

Lemma A.2 (Existence and uniqueness of weak solutions).

A.2 Characterization of the set $\mathcal{Q}$

Lemma A.3 (Laws of processes with sub-linear drift).

Lemma A.4 (Regularity results).

Lemma A.5 (Convergence in the 1-Wasserstein distance).