On Classical Solutions to the Mean Field Game System of Controls

Z Kobeissi (UPD7)

arXiv:1904.11292·math.AP·July 13, 2020

On Classical Solutions to the Mean Field Game System of Controls

Z Kobeissi (UPD7)

PDF

TL;DR

This paper investigates classical solutions to the mean field game system of controls, establishing existence and uniqueness results for the associated PDEs under natural assumptions, with applications illustrated through examples.

Contribution

It provides new existence and uniqueness theorems for mean field game systems involving controls, using Bernstein's method for a priori estimates.

Findings

01

Existence of solutions under natural assumptions

02

Uniqueness under more restrictive conditions

03

Application to specific examples

Abstract

In this paper, we consider a class of mean field games in which the optimal strategy of a representative agent depends on the statistical distribution of the states and controls. We prove some existence results for the forward-backward system of PDEs under rather natural assumptions. The main step of the proof consists of obtaining a priori estimates on the gradient of the cost function by Bernstein's method. Uniqueness is also proved under more restrictive assumptions. The last section contains some examples to which the previously mentioned existence (and possibly uniqueness) results apply.

Equations410

- \partial_{t} u (t, x) - ν Δ u (t, x) + H (t, x, \nabla_{x} u (t, x)) = f (x, m (t))

- \partial_{t} u (t, x) - ν Δ u (t, x) + H (t, x, \nabla_{x} u (t, x)) = f (x, m (t))

\partial_{t} m (t, x) - ν Δ m (t, x) - div (H_{p} (t, x, \nabla_{x} u (t, x)) m) = 0

u (T, x) = g (x, m (T))

m (0, x) = m_{0} (x)

- \partial_{t} u (t, x) - ν Δ u (t, x) + H (x, \nabla_{x} u (t, x), μ (t)) = 0

- \partial_{t} u (t, x) - ν Δ u (t, x) + H (x, \nabla_{x} u (t, x), μ (t)) = 0

\partial_{t} m (t, x) - ν Δ m (t, x) - div (H_{p} (x, \nabla_{x} u (t, x), μ (t)) m) = 0

\displaystyle\mu(t)=\Bigl{(}I_{d},-H_{p}\left(\cdot,\nabla_{x}u(t,\cdot),\mu(t)\right)\Bigr{)}{\#}m(t)

u (T, x) = g (x, m (T))

m (0, x) = m_{0} (x)

\mu\mapsto{\widetilde{\mu}}=\Bigl{(}I_{d},-H_{p}\left(\cdot,\nabla_{x}u(t,\cdot),\mu\right)\Bigr{)}{\#}m,

\mu\mapsto{\widetilde{\mu}}=\Bigl{(}I_{d},-H_{p}\left(\cdot,\nabla_{x}u(t,\cdot),\mu\right)\Bigr{)}{\#}m,

Λ_{q} (μ)

Λ_{q} (μ)

Λ_{\infty} (μ)

Λ_{q_{1}} (μ) \leq Λ_{q_{2}} (μ),

Λ_{q_{1}} (μ) \leq Λ_{q_{2}} (μ),

Λ_{q} (μ)

Λ_{q} (μ)

Λ_{\infty} (μ)

C^{\frac{β}{2}, β} ([0, T] \times T^{d}; R^{n}) = {v \in C^{0} ([0, T] \times T^{d}; R^{n}), \exists C > 0 s.t. \forall (t_{1}, x_{1}), (t_{2}, x_{2}) \in [0, T] \times T^{d}, ∣ v (t_{1}, x_{1}) - v (t_{2}, x_{2}) ∣ \leq C (∣ x_{1} - x_{2} ∣^{2} + ∣ t_{1} - t_{2} ∣)^{\frac{β}{2}}} .

C^{\frac{β}{2}, β} ([0, T] \times T^{d}; R^{n}) = {v \in C^{0} ([0, T] \times T^{d}; R^{n}), \exists C > 0 s.t. \forall (t_{1}, x_{1}), (t_{2}, x_{2}) \in [0, T] \times T^{d}, ∣ v (t_{1}, x_{1}) - v (t_{2}, x_{2}) ∣ \leq C (∣ x_{1} - x_{2} ∣^{2} + ∣ t_{1} - t_{2} ∣)^{\frac{β}{2}}} .

∥ v ∥_{C^{\frac{β}{2}, β}} = ∥ v ∥_{\infty} + (t_{1}, x_{1}) \neq = (t_{2}, x_{2}) sup \frac{∣ v ( t _{1} , x _{1} ) - v ( t _{2} , x _{2} ) ∣}{( ∣ x _{1} - x _{2} ∣ ^{2} + ∣ t _{1} - t _{2} ∣ ) ^{\frac{β}{2}}} .

∥ v ∥_{C^{\frac{β}{2}, β}} = ∥ v ∥_{\infty} + (t_{1}, x_{1}) \neq = (t_{2}, x_{2}) sup \frac{∣ v ( t _{1} , x _{1} ) - v ( t _{2} , x _{2} ) ∣}{( ∣ x _{1} - x _{2} ∣ ^{2} + ∣ t _{1} - t _{2} ∣ ) ^{\frac{β}{2}}} .

∥ v ∥_{C^{\frac{1 + β}{2}, 1 + β}} = ∥ v ∥_{\infty} + ∥ \nabla_{x} v ∥_{C^{\frac{β}{2}, β}} + (t_{1}, x) \neq = (t_{2}, x) \in [0, T] \times T^{d} sup \frac{∣ v ( t _{1} , x ) - v ( t _{2} , x ) ∣}{∣ t _{1} - t _{2} ∣ ^{\frac{1 + β}{2}}} .

∥ v ∥_{C^{\frac{1 + β}{2}, 1 + β}} = ∥ v ∥_{\infty} + ∥ \nabla_{x} v ∥_{C^{\frac{β}{2}, β}} + (t_{1}, x) \neq = (t_{2}, x) \in [0, T] \times T^{d} sup \frac{∣ v ( t _{1} , x ) - v ( t _{2} , x ) ∣}{∣ t _{1} - t _{2} ∣ ^{\frac{1 + β}{2}}} .

- \partial_{t} u^{M} (t, x) - ν Δ u^{M} (t, x) + H (x, \nabla_{x} u^{M} (t, x), μ^{M} (t)) = 0

- \partial_{t} u^{M} (t, x) - ν Δ u^{M} (t, x) + H (x, \nabla_{x} u^{M} (t, x), μ^{M} (t)) = 0

\partial_{t} m_{t}^{M} (t, x) - ν Δ m^{M} (t, x) - div (H_{p} (x, \nabla_{x} u^{M} (t, x), μ^{M} (t)) m^{M}) = 0

μ^{M} (t) = [I_{d}, T_{M} (- H_{p} (\cdot, \nabla_{x} u^{M} (t, \cdot), μ^{M} (t)))] # m^{M} (t)

u^{M} (T, x) = g (x, m^{M} (T))

m^{M} (0) = m_{0},

T_{M} (v) = ⎩ ⎨ ⎧ v if ∣ v ∣ \leq M, \frac{M}{∣ v ∣} v otherwise.

T_{M} (v) = ⎩ ⎨ ⎧ v if ∣ v ∣ \leq M, \frac{M}{∣ v ∣} v otherwise.

∥ g (\cdot, m) ∥_{C^{2 + β_{0}}} \leq C_{0}, \forall m \in P (T^{d}) .

∥ g (\cdot, m) ∥_{C^{2 + β_{0}}} \leq C_{0}, \forall m \in P (T^{d}) .

H (x, p, μ^{1}) - H (x, p, μ^{2})

H (x, p, μ^{1}) - H (x, p, μ^{2})

H_{p} (x, p, μ^{1}) - H_{p} (x, p, μ^{2})

μ^{M} (t) = [I_{d}, T_{M} (- H_{p} (\cdot, p (t, \cdot), μ^{M} (t)))] # m (t),

μ^{M} (t) = [I_{d}, T_{M} (- H_{p} (\cdot, p (t, \cdot), μ^{M} (t)))] # m (t),

g (\cdot, m^{1}) - g (\cdot, m^{2})_{C^{1 + β}} \leq C_{0} W_{q_{1}} (m^{1}, m^{2}),

g (\cdot, m^{1}) - g (\cdot, m^{2})_{C^{1 + β}} \leq C_{0} W_{q_{1}} (m^{1}, m^{2}),

μ^{M} = [I_{d}, T_{M} (- H_{p} (\cdot, p (\cdot), μ^{M}))] # m .

μ^{M} = [I_{d}, T_{M} (- H_{p} (\cdot, p (\cdot), μ^{M}))] # m .

Λ_{q} (μ^{M}) \leq \frac{C _{0}}{1 - λ _{0}} (1 + ∣ p ∣^{q - 1}_{L^{m a x (q_{0}, q)} (m)}) .

Λ_{q} (μ^{M}) \leq \frac{C _{0}}{1 - λ _{0}} (1 + ∣ p ∣^{q - 1}_{L^{m a x (q_{0}, q)} (m)}) .

Φ_{(p, m)}^{M} : C^{0} (T^{d}; R^{d}) \to C^{0} (T^{d}; R^{d}) α \mapsto {T^{d} \to R^{d} x \mapsto T_{M} (- H_{p} (x, p (x), (I_{d}, α) # m)) .

Φ_{(p, m)}^{M} : C^{0} (T^{d}; R^{d}) \to C^{0} (T^{d}; R^{d}) α \mapsto {T^{d} \to R^{d} x \mapsto T_{M} (- H_{p} (x, p (x), (I_{d}, α) # m)) .

Λ_{q} (μ^{M})

Λ_{q} (μ^{M})

\leq C_{0} (1 + ∣ p ∣^{q - 1}) + λ_{0} Λ_{q_{0}} (μ^{M})_{L^{q} (m)}

\leq C_{0} (1 + ∣ p ∣^{q - 1}_{L^{q} (m)}) + λ_{0} Λ_{q} (μ^{M}),

μ^{N, n} = [I_{d}, T_{M} (- H_{p} (\cdot, p^{n} (\cdot), μ^{N, n}))] # m^{n}

μ^{N, n} = [I_{d}, T_{M} (- H_{p} (\cdot, p^{n} (\cdot), μ^{N, n}))] # m^{n}

Λ_{q_{0}} (μ) \leq Λ_{q^{'}} (μ) \leq \frac{C _{0}}{1 - λ _{0}} (1 + ∥ p ∥_{L^{q} (m)}^{q - 1}),

Λ_{q_{0}} (μ) \leq Λ_{q^{'}} (μ) \leq \frac{C _{0}}{1 - λ _{0}} (1 + ∥ p ∥_{L^{q} (m)}^{q - 1}),

Λ_{q_{0}} (μ)^{q^{'}} \leq \frac{C _{0}^{q^{'}}}{( 1 - λ _{0} ) ^{q^{'}}} (θ^{1 - q^{'}} + (1 - θ)^{1 - q^{'}} ∥ p ∥_{L^{q} (m)}^{q}) .

Λ_{q_{0}} (μ)^{q^{'}} \leq \frac{C _{0}^{q^{'}}}{( 1 - λ _{0} ) ^{q^{'}}} (θ^{1 - q^{'}} + (1 - θ)^{1 - q^{'}} ∥ p ∥_{L^{q} (m)}^{q}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On Classical Solutions to the Mean Field Game System of Controls

Z. Kobeissi Laboratoire Jacques-Louis Lions, Univ. Paris Diderot, Sorbonne Paris Cité, UMR 7598, UPMC, CNRS, 75205, Paris, France. [email protected]

Abstract

We consider a class of mean field games in which the optimal strategy of a representative agent depends on the statistical distribution of the states and controls.

We prove some existence results for the forward-backward system of PDEs under rather natural assumptions. The main step of the proof consists of obtaining a priori estimates on the gradient of the value function by Bernstein’s method. Uniqueness is also proved under more restrictive assumptions.

Finally, we discuss some examples to which the previously mentioned existence (and possibly uniqueness) results apply.

Introduction

The theory of Mean Field Games (MFG for short) has been introduced in the independent works of J.M. Lasry and P.L. Lions [31, 32, 33], and of M.Y. Huang, P.E. Caines and R.Malhamé [25, 26]. It aims at studying deterministic or stochastic differential games (Nash equilibria) as the number of agents tends to infinity. The agents are supposed to be rational (given a cost to be minimized, they always choose the optimal strategies), and indistinguishable. Furthermore, the agents interact via some empirical averages of quantities which depend on the state variable.

At the limit when $N\rightarrow+\infty$ , the game may be modeled by a system of two coupled partial differential equations (PDEs), which is named the MFG system. On the one hand, there is a Fokker-Planck-Kolmogorov equation describing the evolution of the statistical distribution $m$ of the state variable; this equation is a forward in time parabolic equation, and the initial distribution at time $t=0$ is given. On the other hand, the optimal value of a generic agent at some time $t$ and state $x$ is noted $u(t,x)$ and is defined as the lowest cost that a representative agent can achieve from time $t$ to $T$ if it is at state $x$ at time $t$ . The value function satisfies a Hamilton-Jacobi-Bellman equation posed backward in time with a terminal condition involving a terminal cost. In the present work, we will restrict our attention to the case when the costs and the dynamics are periodic in the state variable, and we will work in the $d$ -dimensional torus ${\mathbb{T}}^{d}$ (as it is often done in the MFG literature for simplicity). We will take a finite horizon time $T>0$ , and will only consider second-order non-degenerate MFG systems. In this case, the MFG system is often written as:

[TABLE]

We refer the reader to [10] for some theoretical results on the convergence of the $N$ -agents Nash equilibrium to the solutions of the MFG system. For a thorough study of the well-posedness of the MFG system, see the videos of P.L.Lions’ lecture at the Collège de France, and some lecture notes [9].

There is also an important literature on the probabilistic aspects of MFGs, see [12, 29] for some examples and [13, 14] for a detailed presentation of the probabilistic viewpoint.

For applications of MFGs, numerical simulations are crucial because it is most often impossible to find explicit or semi-explicit solutions to the MFG system. We refer to [2] for a survey on finite difference methods and to [3] for applications to crowd motion.

Most of the literature on MFGs is focused on the case when the mean field interactions only involves the distributions of states. Here we will consider a more general situation in which the cost of an individual agent depends on the joint distribution $\mu$ of states and optimal strategies. To underline this, we choose to use the terminology Mean Field Games of Controls (MFGCs) for this class of MFGs; the latter terminology was introduced in [11]. Within this framework, the usual MFG system (1.1) is replaced by the following MFGC system,

[TABLE]

We would like to point out two of the main difficulties that one may encounter when studying (1.2) and which are not present in the study of (1.1).

1)

The joint law of states and controls satisfies a fixed point relation described by (1.2c). 2. 2)

The HJB equation (1.2a) is non-local with respect to $\nabla_{x}u$ . Consequently, it is much more difficult to obtain uniform a priori estimates on $u$ and the its derivatives.

Difficulty 1 is in general not straightforward and one needs to make assumptions for the fixed point in $\mu$ to have a unique solution when $\left(\nabla_{x}u,m\right)$ are given. An example in which this fixed point relation does not admit any solution is given in [1] Remark $4.3$ .

Let us provide a simple illustration for describing difficulty 2 by comparing the results obtained when we apply the maximum principle on parabolic equations to (1.1a) and (1.2a) respectively: if $u$ satisfies (1.1a) where $f$ and $g$ are assumed to be uniformly bounded with respect to $m$ , then $u$ is uniformly bounded; under the same assumption on $g$ , if $u$ is a solution to (1.2a) and $H$ is not uniformly bounded with respect to $\mu$ , we can only say that $u$ is bounded in absolute value by a constant depending on $\mu$ . The other estimates used in the usual arguments of existence in MFG sytems suffer the same lack of uniformity with respect to $\mu$ . Conversely, the estimates of $\mu$ depend on $\nabla_{x}u$ . It is not obvious a priori how to combine the estimates on $\mu$ and $(u,m)$ in order to obtain uniform estimates on $u$ . Consequently, compactness results are harder to obtain for (1.2) than for (1.1).

The main assumption of this paper, namely FP1 and FP2 described below, is an original structural assumption designed to address difficulty 1. In particular, it implies that the map

[TABLE]

is a contraction in a convenient metric space, when $(t,u,m)$ are given.

Moreover, we also assume that the Hamiltonian $H(x,p,\mu)$ behaves like a power function when $p$ tends to infinity. See paragraph 2.2 for more details.

The main objective of this work is to discuss existence of the solutions of the MFGC system (1.2) within this framework. We will also give a uniqueness result under a short time horizon assumption. We refer to [1] for a numerical application with multiple solutions. Indeed, uniqueness does not hold in general for arbitrary time horizon. It can be obtained though, under a monotonicity assumption which is investigated in the companion paper [28]. In [28], existence and uniqueness of solutions of the MFGC system are proved under the above-mentioned monotonicity assumption and with Hamitonian having similar growth as in the present paper. This monotonicity condition implies that the agents favor moving in a direction opposite to the mainstream. Such an assumption is adapted to some models coming from finance or economy; and may be unrealistic in several situations, in particular in models of crowd motions. This explains why here we introduce a new structural assumption and refrain from assuming monotonicity or investigating uniqueness in the general case.

Related literature

In the first articles devoted to MFGCs, [20, 21], D. Gomes and his collaborators have given several existence results for MFGCs in various cases, using the terminology extended MFGs instead of MFGCs. For instance, [21] contains existence results for stationary games (infinite horizon) under the assumption that some of the parameters involved in the models are small. We refer to [7, 11, 15, 13, 28] for other existence and uniqueness results for MFGC systems.

Uniqueness is a major issue in MFG theory, it has been proved for (1.1) in [33, 35] under an assumptions called the Lasry-Lions monotonicity on the coupling function $f$ and the terminal cost $g$ in the case of non-local coupling. This assumption has been extended to MFGC and discussed in [20, 13, 28] in which uniqueness is proved. It translates the fact that the agents prefer directions opposite to the mainstream direction; therefore it is not adapted to a large class of MFGC systems like crowd motion models in which an agent is more likely to go in the mainstream direction.

The latter example of population dynamic is the typical application we had in mind when writing the assumptions in the present paper, see paragraphs 6.3 and 6.4. To our knowledge, existence results for such MFGC systems have not been discussed in the literature before. Uniqueness should not hold in general but under a short-time assumption. We refer to [1] in which the MFGC system is discretized using a finite-difference scheme and simulations are provided where the approximating discrete MFGC system admits several different solutions.

For other applications of MFGCs we refer to [11] for an model of optimal trading, [8, 16, 22, 24, 27] in the case of competition between firms producing the same goods, or [4] for energy storage.

Organization of the paper

Section 2 describes the notations, assumptions and main results in this paper. In Section 3, we address difficulty 1 which consists of inverting the fixed point relation in $\mu$ (1.2c) and providing estimates on the resulting flow of measures. Section 4 is devoted to proving a priori estimates on the solutions to (1.2) and addresses difficulty 2. Section 5 contains the proofs of the main results. Finally, we discuss several applications in Section 6. Namely, we study

•

the Bertrand and Cournot competition for exhaustible ressources and introduce an extension to negatively correlated ressources (for instance gold and other raw materials);

•

a model of price impact for high-frequency trading by Almgren and Chriss in which we discuss the possibility for the bid and ask prices to be different;

•

a first-order flocking model;

•

a crowd motion model.

Notations and assumptions

2.1 Notations and definitions

The spaces of probability measures are equipped with the weak* topology. We denote by ${\mathcal{P}}_{\infty}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ the subset of measures $\mu$ in ${\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ with a second marginal compactly supported. For $\mu\in{\mathcal{P}}_{\infty}\left(\mathbb{R}^{d}\times\mathbb{R}^{d}\right)$ and ${\widetilde{q}}\in[1,\infty)$ , we define the quantities $\Lambda_{{\widetilde{q}}}(\mu)$ and $\Lambda_{\infty}(\mu)$ by,

[TABLE]

Jensen inequality states that,

[TABLE]

for any $1\leq q_{1}\leq q_{2}\leq\infty$ .

For $R>0$ , we denote by ${\mathcal{P}}_{\infty,R}\left(\mathbb{R}^{d}\times\mathbb{R}^{d}\right)$ the subset of measures $\mu$ in ${\mathcal{P}}_{\infty}\left(\mathbb{R}^{d}\times\mathbb{R}^{d}\right)$ such that $\Lambda_{\infty}\left(\mu\right)\leq R$ . The probability measures $\mu$ involved in (1.2) and (2.4), have a particular form, since they are the images of a measure $m$ on ${\mathbb{T}}^{d}$ by $\left(I_{d},\alpha\right)$ , where $\alpha$ is a bounded measurable functions from ${\mathbb{T}}^{d}$ to $\mathbb{R}^{d}$ ; in particular they are supported on the graph of $\alpha$ . For $m\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ , we call ${\mathcal{P}}_{m}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ the set of such measures. For $\mu\in{\mathcal{P}}_{m}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ , we set $\alpha^{\mu}$ to be the unique element of $L^{\infty}\left(m\right)$ such that $\mu=\left(I_{d},\alpha^{\mu}\right)\#m$ . Here, $\Lambda_{{\widetilde{q}}}(\mu)$ and $\Lambda_{\infty}(\mu)$ defined in (2.1) are given by

[TABLE]

If $X$ is a normed space and $|\cdot|_{X}$ is its norm, for $n\geq 1$ we denote by $C^{0}\left(X;\mathbb{R}^{n}\right)$ the set of bounded continuous functions from $X$ to $\mathbb{R}^{n}$ ; it is endowed with the norm ${\left\|v\right\|_{\infty}}=\sup_{x\in X}|v(x)|_{X}$ .

We define $C^{0,1}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ as the set of the functions $v\in C^{0}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ differentiable at any point with respect to the state variable, and such that its gradient satisfies $\nabla_{x}v\in C^{0}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ . This is a Banach space equipped with the norm ${\left\|v\right\|_{C^{0,1}}}={\left\|v\right\|_{\infty}}+{\left\|\nabla_{x}v\right\|_{\infty}}$ .

For $\beta\in(0,1)$ and $n\geq 1$ , we denote by $C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}^{n}\right)$ the parabolic space of Hölder continuous functions which is commonly defined by

[TABLE]

This is a Banach space equipped with the norm,

[TABLE]

The space $C^{\frac{1+\beta}{2},1+\beta}([0,T]\times{\mathbb{T}}^{d};\mathbb{R})$ is defined as the set of the functions $v\in C^{0,1}([0,T]\times{\mathbb{T}}^{d};\mathbb{R})$ such that $\nabla_{x}v\in C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}^{n}\right)$ and which admits a finite norm defined by,

[TABLE]

We set $C^{1,2}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ to be the set of functions which admit first derivative with resepct to time and second derivatives with respect to the state variables, such that these derivatives are continuous with respect to time and state.

Throughout the paper, what we call a solution to (1.2) is precisely defined by the following definition.

Definition 2.1.

The triple $(u,m,\mu)$ is a solution to (1.2) if $u\in C^{1,2}([0,T]\times{\mathbb{T}}^{d})$ is a pointwise solution to the Hamilton-Jacobi-Bellman equation (1.2a) with terminal condition (1.2d), $m\in C^{0}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ is solution to the Fokker-Planck-Kolmogorov equation (1.2b) in the sense of distribution with initial condition (1.2e), and $\mu\in C^{0}\left([0,T];{\mathcal{P}}_{\infty}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)\right)$ satisfies (1.2c) at any $t\in[0,T]$ .

A simple way to overcome difficulty 2 is to assume that the Hamiltonian $H$ and some of its derivatives admit uniform bounds with respect to $\mu$ . In this case, the well-posedness of the MFGC system with a possibly degenerate diffusion is investigated in [11]. Here we avoid such an assumption for (1.2) but we introduce the following approximating system which satisfies it,

[TABLE]

where $M$ is a positive constant and $T_{M}$ is a truncation map defined by

[TABLE]

The latter definition can be naturally extended to the case when $M=\infty$ by taking $T_{\infty}=Id_{\mathbb{R}^{d}}$ . In this case systems (1.2) and (2.4) coincide. A solution to (2.4) is defined by replacing (1.2) by (2.4) in Definition 2.1.

If $M<\infty$ and $\left(u^{M},m^{M},\mu^{M}\right)$ is a solution to (2.4), $\mu^{M}(t)$ is compactly supported in $B_{\mathbb{R}^{d}}(0,M)$ the closed ball in $\mathbb{R}^{d}$ centered at [math] with radius $M$ , for any $t\in[0,T]$ . Consequently, $u^{M}$ and $m^{M}$ should satisfy estimates depending on $M$ and uniform with respect to $\mu^{M}$ . Therefore, compactness results for (2.4) should be less demanding than for (1.2) and difficulty 2 should vanish.

2.2 Assumptions

Let us start with some reasonable assumptions about the regularity and the boundedness of the Hamiltonian, the terminal cost and the inital distribution of agents. We introduce two constants: $C_{0}>0$ and $\beta_{0}\in(0,1)$ .

A1

$H=H(x,p,\mu)$ is convex with respect to $p$ , and differentiable with respect to $(x,p)$ ; $H_{p}$ is locally $\beta_{0}$ -Hölder continuous with respect to $p$ ; $H$ and $H_{p}$ are continuous with respect to $\mu$ on ${\mathcal{P}}_{\infty,R}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ for any $R>0$ , where ${\mathcal{P}}_{\infty,R}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ is defined in paragraph 2.1 and equipped with the weak* topology. 2. A2

$g:{\mathbb{T}}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\rightarrow\mathbb{R}$ is continuous, and we suppose that $x\mapsto g(x,m)$ is in $C^{2+\beta_{0}}\left({\mathbb{T}}^{d}\right)$ , with a norm bounded uniformly with respect to $m$ , i.e.

[TABLE] 3. A3

$m_{0}\in{\mathcal{P}}({\mathbb{T}}^{d})$ is absolutely continuous with respect to the Lebesgue measure on ${\mathbb{T}}^{d}$ and we also name $m_{0}$ its density (abuse of notation). Assume that $m_{0}\in C^{\beta_{0}}({\mathbb{T}}^{d})$ and is positive (see Remark 2.2 below to drop out the positivity assumption).

These assumptions are not restrictive when looking for solutions with the regularity given in Definition 2.1. However, they can be relaxed if we are interested in weaker solutions of systems (1.2).

In this paper we consider nonlocal coupling through the controls. More precisely, we assume that these interactions involve the quantity $\Lambda_{q_{0}}(\mu)$ defined in (2.1).

Let us introduce the assumptions used to address difficulty 1 which consists of solving the fixed point relations in $\mu$ given in (1.2c) and (2.4c), when $(u,m)$ are fixed and have the same regularity as in Definition 2.1. We introduce $\lambda_{0}\in[0,1)$ , for all $\left(x,p,m\right)\in{\mathbb{T}}^{d}\times\mathbb{R}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ , and $\mu,\mu^{1},\mu^{2}\in{\mathcal{P}}_{m}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ , we assume that,

FP1

$|H_{p}(x,p,\mu)|\leq C_{0}(1+|p|^{q-1})+\lambda_{0}\Lambda_{q_{0}}(\mu)$ . 2. FP2

$\left|H_{p}(x,p,\mu^{1})-H_{p}(x,p,\mu^{2})\right|\leq\lambda_{0}{\left\|\alpha^{\mu^{1}}-\alpha^{\mu^{2}}\right\|_{L^{q_{0}}(m)}}$ .

These structural assumptions for MFGC are new in the literature and participate to the originality and novelty of the results presented in this paper. Moreover they do not seem to be restrictive as it is explained in what follows.

We recall that the optimal control of a representative agent is given by $\alpha=-H_{p}\left(x,\nabla_{x}u,\mu\right)$ . Since $\Lambda_{q_{0}}(\mu)$ is homogeneous to the norm of a control, we cannot expect the dependency of $H_{p}$ upon $\mu$ to involve an exponent larger than one. Moreover if $m$ is the first marginal of $\mu$ , taking the $L^{q_{0}}(m)$ -norm in FP1 makes $\Lambda_{q_{0}}(\mu)$ appear in both sides of the resulting inequality; this explains the form of the right-hand side in FP1 and the necessity of choosing $\lambda_{0}$ smaller than $1$ . Similar arguments can provide insights on FP2, by noticing that if $\Lambda_{q_{0}}$ was seen as a norm on ${\mathcal{P}}_{m}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ then ${\left\|\alpha^{\mu^{1}}-\alpha^{\mu^{2}}\right\|_{L^{q_{0}}(m)}}$ would be the associated distance. We refer to Remark $4.3$ in [1] for a concrete example of a MFGC system which does not admit solution if $\lambda_{0}=1$ .

As in a large part of the literature on MFG or HJB equations, we consider Hamiltonians that are power-like functions in $p$ at least asymptotically. Let $q\in(1,\infty)$ be this asymptotic exponent, and $q^{\prime}$ the conjugate exponent of $q$ defined by $q^{\prime}=\frac{q}{q-1}$ . Namely, we assume that $H$ satisfies the following inequalities, for all $x\in{\mathbb{T}}^{d}$ , $p\in\mathbb{R}^{d}$ , $m\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ , and $\mu\in{\mathcal{P}}_{m}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ ,

B1

$|H(x,0,\mu)|\leq C_{0}+\lambda_{2}\Lambda_{q_{0}}(\mu)^{q^{\prime}}$ , with $\lambda_{2}\geq 0$ . 2. B2

$|H_{x}(x,p,\mu)|\leq C_{0}\left(1+|p|^{q}+\Lambda_{q_{0}}(\mu)^{q^{\prime}}\right)$ . 3. B3

$H_{p}(x,p,\mu)\cdot p-H(x,p,\mu)\geq C_{0}^{-1}\left(|p|^{q}-\lambda_{1}\Lambda_{q_{0}}\left(\mu\right)^{q^{\prime}}\right)-C_{0}$ , where $\lambda_{1}$ is a nonnegative constant satisfying $0\leq\lambda_{1}<\frac{(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ .

One may notice that the dependencies of $H$ upon $p$ and $\mu$ involve different exponents (which happen to be equal when $q=2$ ). Indeed the Legendre transform applied to a power-like function make the exponent change into its conjugate. Since $H$ is defined as the Legendre transform of the Lagrangian $L$ , the exponent in the dependency of $L$ upon $\alpha$ should be $q^{\prime}$ . Moreover, $\Lambda_{q_{0}}(\mu)$ is homogeneous to the norm of a control, therefore $L$ should at most involve $\Lambda_{q_{0}}(\mu)^{q^{\prime}}$ . Going back to the Hamiltonian by the Legendre transform, the exponent on $\Lambda_{q_{0}}(\mu)$ stays the same which explains the right-hand side in B1-B3. One may find the abovementionned growth conditions on $L$ in [28].

Assumption B3 is a convexity property of $H$ with respect to $p$ . In MFG without coupling through the controls, such an assumption is common, the only difference is that the term in $\Lambda_{q_{0}}(\mu)$ does not appear. This assumption will be particularly useful to obtain energy integral estimates by taking advantage of the duality properties of the forward-backward systems (1.2a), (1.2b) and (2.4a), (2.4b). The inequality satisfied by $\lambda_{1}$ is needed in the calculation for getting these estimates. Let us mention that the right-hand side in this inequality comes from the estimates in Lemma 3.1, and that the constant $C_{0}$ can be identified with the one in FP1.

In order to obtain classical solutions of the HJB equations (1.2a) and (2.4a), we need Hölder continuity of $(t,x)\mapsto H\left(x,\nabla_{x}u(t,x),\mu(t)\right)$ . While the space regularity of the latter map is straightforward here, its time regularity may be more demanding and we need assumptions which allow one to compare $H$ at different measures $\mu^{1},\mu^{2}\in{\mathcal{P}}_{\infty}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ . Assumption FP2 is not enough since it requires $\mu^{1}$ and $\mu^{2}$ to share the same marginal with respect to ${\mathbb{T}}^{d}$ .

T

For $R>0$ , there exists a constant $C_{R}>0$ such that

[TABLE]

for $\left(x,p,m^{i},\mu^{i}\right)$ such that $\left(x,p\right)\in{\mathbb{T}}^{d}\times\mathbb{R}^{d}$ with $|p|\leq R$ , $m^{i}\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\cap C^{0}\left({\mathbb{T}}^{d}\right)$ with $m^{i}\geq R^{-1}$ , $\mu^{i}\in{\mathcal{P}}_{m^{i}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ with $\alpha^{\mu^{i}}\in C^{0}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ and ${\left\|\alpha^{\mu^{i}}\right\|_{\infty}}\leq R$ , $i=1,2$ .

One may notice that when $\mu^{1}$ and $\mu^{2}$ have the same first marginal with respect to ${\mathbb{T}}^{d}$ the second inequality in T is implied by FP2. If one is only interested in weak solution to (1.2), T can be removed.

Remark 2.2.

Letting $C_{R}$ depends on ${\left\|\left(m^{i}\right)^{-1}\right\|_{\infty}}$ was motivated by models of population dynamics which are discussed in paragraphs 6.3 and 6.4. The drawback of this assumption is that we have to assume that the initial distribution of agents $m_{0}$ is positive.

All the results in this paper hold if we do not assume $m_{0}$ to be positive in A3, and we remove the condition $m^{i}\geq R^{-1}$ in T.

2.3 Main results

We recall that assumptions FP1 and FP2 are designed to address difficulty 1, and T to obtain time regularity of the fixed point $\mu$ in 1.2c or 2.4c. More precisely, we state the following lemma that will be proved in Section 3.

Lemma 2.3.

Assume A1, FP1, FP2 and T. Take $p\in C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ and $m\in C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d}\right)$ such that $m\geq R^{-1}$ and $m(t)\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ for $t\in[0,T]$ , where $\beta\in(0,1)$ and $R>0$ are constants. For any $t\in[0,T]$ , there exists a unique $\mu(t)\in{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ satisfying

[TABLE]

where $M\in(0,\infty]$ . Moreover, the map $(t,x)\mapsto\alpha^{\mu^{M}(t)}(x)$ is in $C^{\frac{\beta\beta_{0}}{2},\beta\beta_{0}}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ , and its associated norm can be estimated from above by a constant which depends on ${\left\|p\right\|_{C^{\frac{\beta}{2},\beta}}}$ , ${\left\|m\right\|_{C^{\frac{\beta}{2},\beta}}}$ , and the constants in the assumptions.

In Section 4, we prove the a priori estimates stated in the following lemma.

Lemma 2.4.

Assume A1-A3, B2, B3, FP1, FP2 and T. If $(u,m,\mu)$ is a solution to (2.4) for $M\in(0,\infty]$ , then

•

${\left\|\nabla_{x}u\right\|_{\infty}}\leq C\left(1+{\left\|u\right\|_{\infty}}\right)$ * and ${\left\|u\right\|_{\infty}}\leq C\left(1+{\left\|\nabla_{x}u\right\|_{\infty}^{q}}\right)$ , where $C$ is independent of $M$ and depends only on the constants in the assumptions,*

•

$m$ * is positive,*

•

$u\in C^{1+\frac{\beta}{2},2+\beta}\left([0,T]\times{\mathbb{T}}^{d}\right)$ ,

•

$m\in C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ ,

•

$(t,x)\mapsto\alpha^{\mu(t)}(x)$ * is in $C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ ,*

where $\beta\in\left(0,\beta_{0}^{2}\right)$ . Moreover, ${\left\|m^{-1}\right\|_{\infty}}$ and the norms associated with the last three items above depend only on ${\left\|u\right\|_{\infty}}$ , ${\left\|m_{0}^{-1}\right\|_{\infty}}$ , $\beta$ and the constants in the assumptions.

These estimates are weaker than their equivalents for MFG systems without interaction through controls. In particular, $u$ is not uniformly bounded in ${\left\|\cdot\right\|_{\infty}}$ -norm. However, we believe that our estimate of ${\left\|\nabla_{x}u\right\|_{\infty}}$ is the best that we can achieve in our framework since its right-hand side should be at least linear with respect to ${\left\|u\right\|_{\infty}}$ . To our knowledge, such an estimate for systems of MFG with nonlocal dependency on $\nabla_{x}u$ (or more generally for MFG systems in which we do not have a uniform a priori estimate on $u$ ) is new in the literature.

Here, these a priori estimates are not sufficient to address the difficulty 2 and to obtain existence of solutions. However, existence can be obtained under several different kinds of assumptions; below, we supply a list of existence results under various assumptions:

Theorem 2.5.

Assume A1-A3, B1-B3, FP1, FP2, T. There exists a solution to (1.2) if one of the following assertions is satisfied

a)

$q_{0}\leq q^{\prime}$ * and $\left|H(x,0,\mu)\right|\leq C_{0}\left(1+\Lambda_{q_{0}}\left(\mu\right)^{{\widetilde{q}}}\right)$ , where ${\widetilde{q}}$ is a constant satisfying ${\widetilde{q}}<q^{\prime}$ (Proposition 5.4),* 2. b)

$q_{0}\leq q^{\prime}$ * and $\lambda_{1}+C_{0}\lambda_{2}<\frac{(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ where $\lambda_{1}$ and $\lambda_{2}$ are respectively defined in B3 and B1, the $C_{0}$ on the left-hand side comes from $C_{0}^{-1}$ in B3 and the $C_{0}$ on the right-hand side comes from FP1 (Proposition 5.3),* 3. c)

$\left|H(x,0,\mu)\right|\leq C_{0}\left(1+\Lambda_{q_{0}}\left(\mu\right)^{q^{\prime}-1}\right)$ , for any $\left(x,\mu\right)\in{\mathbb{T}}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ (Proposition 5.5), 4. d)

$\left|H_{x}(x,p,\mu)\right|\leq C_{0}\left(1+|p|+\Lambda_{q_{0}}\left(\mu\right)^{q^{\prime}-1}\right)$ , for any $\left(x,p,\mu\right)\in{\mathbb{T}}^{d}\times\mathbb{R}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ (Proposition 5.6), 5. e)

$T\leq T_{0}$ , where $T_{0}$ is a constant depending on the constants in the assumptions (Proposition 5.8).

An other additional assumption under which existence holds is the monotonicity condition addressed in [28].

We also give a uniqueness result under a short time horizon assumption.

Theorem 2.6 (Uniqueness with short time horizon).

Assume A1, A2, A3, B1, B2, B3, FP1, FP2, and that the following three assumptions are satisfied,

•

$H_{p}$ * is locally Lipschitz continuous with respect to $p$ ,*

•

$g$ * satisfies*

[TABLE]

for any $m^{1},m^{2}\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ , where $q_{1}\in[1,\infty)$ and $W_{q_{1}}$ is the $q_{1}$ -Wassertein distance on measures,

•

the two inequalities in T hold when we replace ${\left\|m^{1}-m^{2}\right\|_{\infty}^{\beta_{0}}}$ by $W_{q_{1}}\left(m^{1},m^{2}\right)$ .

There exists $T_{1}>0$ such that if $T<T_{1}$ then there is at most one solution to (1.2).

We believe that this uniqueness result can be easily extended to more general Hamiltonians, but that the short-time assumption is essential. Indeed numerical examples in which non-uniqueness occurs are presented in [1]. In these examples, we consider groups of agents who start from some crowded areas at time $t=0$ , and travel through the domain to arrive at some target areas. Imposing a short time assumption in such an example results in the agents not trying to reach the targets at all. Indeed in this case the kinetic cost makes it more expensive for them to cross the domain very quickly before the end of the game than to do nothing and just wait passively at their starting point. For this reason we were not interested in finding less restrictive assumptions in Theorem 2.6. This theorem should be only seen as an example of uniqueness result with a short time horizon assumption. In particular we wanted the proof in paragraph 5.4 to stay simple.

Remark 2.7.

)

In this work, we only consider MFGC systems in the $d$ -dimensional torus ${\mathbb{T}}^{d}$ . However, we believe that our existence results (Theorem 2.5) hold under the same assumptions on the Euclidean space $\mathbb{R}^{d}$ , and that the method introduced in **[28]** to pass from ${\mathbb{T}}^{d}$ to $\mathbb{R}^{d}$ can applied here. 2. )

We did not include the case $q=1$ in this work (i.e. when the Hamiltonian is Lipschitz continuous in $p$ ). In this case, systems (1.2) and (2.4) coincide when $M$ is large enough, therefore there exists a solution to (1.2) under assumptions A1-A3, B1-B3, FP1, FP2 and T, by the same arguments as in Lemma 5.1.

The fixed point relation in $\mu$

and the proof of Lemma 2.3

We recall that (1.2) and (2.4) conincide when $M=\infty$ . Here, we take $M\in(0,\infty]$ .

The following lemma takes advantage of the structural assumptions FP1 and FP2 to solve the fixed point relations (1.2c) and (2.4c) which consists of difficulty 1. It also states a priori estimates on $\mu$ which will be of great use in the next section to obtain a priori estimates on $u$ and its derivatives.

Lemma 3.1.

Assume A1, FP1 and FP2. Take $p\in C^{0}\left({\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ , and $m\in{\mathcal{P}}({\mathbb{T}}^{d})$ . The following two assertions are satisfied.

(i)

There exists a unique $\mu^{M}\in{\mathcal{P}}({\mathbb{T}}^{d}\times\mathbb{R}^{d})$ such that

[TABLE]

For any ${\widetilde{q}}\in[1,\infty]$ , it satisfies

[TABLE] 2. (ii)

The map $(p,m)\mapsto\mu^{M}$ given by (3.1), is continuous from $C^{0}\left({\mathbb{T}}^{d};\mathbb{R}^{d}\right)\times{\mathcal{P}}({\mathbb{T}}^{d})$ to ${\mathcal{P}}({\mathbb{T}}^{d}\times\mathbb{R}^{d})$ . We recall that the spaces of measures are equipped with the weak- topology.*

Proof.

$(i)$

Let us define the following map,

[TABLE]

This map is well defined by (A1). It is $\lambda_{0}$ -Lipschitz continuous by FP2 and the fact that $T_{M}$ is $1$ -Lipschitz continuous, we recall that $\lambda_{0}<1$ . Therefore it admits a unique fixed point by the Banach fixed point theorem. If $\mu^{M}\in{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ satisfies (3.1) then $\alpha^{\mu^{M}}$ is the only fixed point of $\Phi_{(p,m)}^{M}$ . Conversely, if we denote by $\alpha$ the fixed point of $\Phi_{(p,m)}^{M}$ , then $\mu^{M}$ defined by $\mu^{M}=\left(I_{d},\alpha\right)\#m$ satisfies (3.1). This implies that (3.1) admits a unique fixed point that we name $\mu^{M}$ in what follows. From FP1, $\mu^{M}$ satisfies,

[TABLE]

for ${\widetilde{q}}\geq q_{0}$ , where we obtained the last line by using the triangle inequality for the $L^{{\widetilde{q}}}$ -norm, and (2.2). This implies (3.2) for any ${\widetilde{q}}\geq q_{0}$ . Then we extend this result to $1\leq{\widetilde{q}}<q_{0}$ by combining (2.2) and (3.2) applied to $q_{0}$ . 2. $(ii)$

Let $(p^{n},m^{n})_{n\in\mathbb{N}}\in\left(C^{0}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right);{\mathcal{P}}({\mathbb{T}}^{d})\right)^{\mathbb{N}}$ be a convergent sequence to $(p,m)$ in $C^{0}\left({\mathbb{T}}^{d};\mathbb{R}^{d}\right)\times{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ . We define $\mu^{N}$ as before, and $\left(\mu^{N,n}\right)_{n\in\mathbb{N}}$ the fixed points satisfying

[TABLE]

for $n\in\mathbb{N}$ . The sequence $\left(p^{n}\right)_{n\in\mathbb{N}}$ is bounded in $C^{0}\left({\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ , thus (3.2) with ${\widetilde{q}}=\infty$ yields that $\left(\mu^{N,n}\right)_{n\in\mathbb{N}}$ are uniformly compactly supported. The sequence $\left(\mu^{N,n}\right)$ is compact in ${\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ endowed with the weak-* topology. Let ${\widetilde{\mu}}$ be the limit of a subsequence $\left(\mu^{N,\varphi(n)}\right)_{n\in\mathbb{N}}$ , for $\varphi:\mathbb{N}\to\mathbb{N}$ an increasing function. By continuity of $H_{p}$ and $T_{M}$ , we can pass to the limit in (3.3) taken at $\varphi(n)$ when $n$ tends to infinity, this gives that ${\widetilde{\mu}}$ satisfies the same fixed point relation as $\mu$ . By uniqueness of this fixed point, we deduce that ${\widetilde{\mu}}=\mu$ . This implies that the entire sequence $\left(\mu^{N,n}\right)$ tends to $\mu$ .

Therefore the map $(p,m)\mapsto\mu^{M}$ is continuous from $C^{0}\left({\mathbb{T}}^{d};\mathbb{R}^{d}\right)\times{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ to ${\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ .

∎

In particular if $q_{0}\leq q^{\prime}$ , (2.2) and (3.2) yield

[TABLE]

and then we use the inequality $(a+b)^{q^{\prime}}\leq\frac{a^{q^{\prime}}}{\theta^{q^{\prime}-1}}+\frac{b^{q^{\prime}}}{(1-\theta)^{q^{\prime}-1}}$ which holds for $a,b>0$ and for any $\theta\in(0,1)$ , to obtain

[TABLE]

If $q\in[1,\infty]$ without restriction, we obtain

[TABLE]

These latter three inequalities will be of great use in Section 4 for getting a priori estimates.

Given $(u,m)$ as regular as in definition 2.1, we can use Lemma 3.1 to prove that the fixed point relations (1.2c) and (2.4c) are well-posed, and that if $(u,m,\mu)$ is a solution to (1.2) or (2.4) then $\mu$ is continuous with respect to time. However, we need a better regularity in time to get classical solution of the HJB equations (1.2a) and (2.4a). In Lemma 3.2, we use T to obtain an estimate of the distance between two fixed points of (3.1) associated with different $(u,m)$ . We will be particularly interested in using this estimate on a solution to (2.4) at different times.

Lemma 3.2.

Assume A1, FP1, FP2 and T. Take $p^{1},p^{2}\in C^{0}\left({\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ , and $m^{1},m^{2}\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\cap C^{0}\left({\mathbb{T}}^{d};\mathbb{R}\right)$ some positive probability measures. We define $\mu^{1},\mu^{2}\in{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ as the fixed point in $(i)$ in Lemma 3.1 associated with $\left(p^{1},m^{1}\right)$ and $\left(p^{2},m^{2}\right)$ , respectively. There exists a constant $C$ such that

[TABLE]

where $C$ depends on ${\left\|p^{i}\right\|_{\infty}}$ , ${\left\|\left(m^{i}\right)^{-1}\right\|_{\infty}}$ , for $i=1,2$ , and the constants in the assumptions.

Proof.

We define ${\widetilde{\mu}}$ by ${\widetilde{\mu}}=\left(I_{d},\alpha^{\mu^{1}}\right)\#m^{2}$ . The triangle inequality and the fact that $T_{M}$ is a contraction imply that for any $x\in{\mathbb{T}}^{d}$ ,

[TABLE]

The measures $\mu^{1}$ and ${\widetilde{\mu}}$ are the image measures by the same function $\left(I_{d},\alpha^{\mu^{1}}\right)$ , of $m^{1}$ and $m^{2}$ respectively. From T, we obtain

[TABLE]

where $R=\max\left({\left\|p^{i}\right\|_{\infty}},{\left\|\left(m^{i}\right)^{-1}\right\|_{\infty}}\right)$ and $C_{R}$ is the constant defined in T. We recall that $\Lambda_{\infty}\left(\mu^{i}\right)$ can be estimated from above by a quantity which only depends on ${\left\|p^{i}\right\|_{\infty}}$ and the constants in the assumptions, by (3.2).

Since ${\widetilde{\mu}}$ and $\mu^{2}$ have the same marginal with respect to ${\mathbb{T}}^{d}$ , FP2 yields that,

[TABLE]

Then $H_{p}$ is locally $\beta_{0}$ -Hölder continuous by A1 so,

[TABLE]

for some constant $C$ . Combining the latter four inequalities, we obtain,

[TABLE]

which implies (3.7) up to replacing $C$ with $\left(1-\lambda_{0}\right)^{-1}\max\left(C,C_{R}\right)$ . ∎

Lemma 2.3 is a straightfoward consequence of Lemmas 3.1 and 3.2.

A priori estimates

and the proof of Lemma 2.4

Here we take $M\in(0,\infty]$ , and $(u,m,\mu)$ a solution to (2.4) defined in Definition 2.1. We will look for estimates independent of $M$ which allow us to address difficulty 2. These a priori estimates imply compactness results and play an essential role in the proofs of existence in Section 5.

4.1 A priori estimates on $u$

When we consider MFG without interactions through controls and with bounded coupling function and terminal cost, we can apply the maximum principle on parabolic differential equations to (4.3) below and get an a priori estimates of ${\left\|u\right\|_{\infty}}$ which only depends on the constants in the assumptions. However, for MFGC systems and more generally for HJB equations with non-local interactions in $\nabla_{x}u$ , it is not possible to get such a strong a priori estimate directly from the maximum principle. Instead we get (4.1) and (4.2) which involve non-local quantities depending on $\nabla_{x}u$ .

Lemma 4.1.

Under assumptions A1, A2, B1, FP1, FP2, and $q_{0}\leq q^{\prime}$ , for $\theta\in(0,1)$ $u$ satisfies,

[TABLE]

where $\lambda_{2}$ is defined in B1. More generally, for any $q_{0}\in[1,\infty]$ $u$ satisfies,

[TABLE]

Proof.

Here, we can rewrite (2.4a) in the following way,

[TABLE]

for $\left(t,x\right)\in(0,T)\times{\mathbb{T}}^{d}$ . The maximum principle for parabolic second-order equation applies to $u$ and $-u$ ,

[TABLE]

Moreover, $|H\left(x,0,\mu(t)\right)|\leq C_{0}+\lambda_{2}\Lambda_{q_{0}}\left(\mu(t)\right)^{q^{\prime}}$ and $|u(T,x)|\leq C_{0}$ come from B1 and A2, respectively. We combine the latter inequalities with (3.5) and (3.6) to get (4.1) when $q_{0}\leq q^{\prime}$ , and (4.2) respectively. ∎

The non-local term in (4.1) involving $\nabla_{x}u$ corresponds roughly speaking to an energy. Moreover this is a quantity that naturally appears in MFG literature thanks to duality properties in the forward-backward systems (1.1), (1.2), or (2.4). More precisely, the FPK equations is the dual equation of the linearized HJB equation with respect to $u$ . Lemma 4.2 provides an a priori estimate of this quantity.

Lemma 4.2.

Under assumptions A1, A2, B3, FP1, FP2, and $q_{0}\leq q^{\prime}$ , the following inequality is satisfied,

[TABLE]

for any $\theta\in(0,1)$ such that $\lambda_{1}<\frac{(1-\theta)^{q^{\prime}-1}(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ .

Proof.

We multiply (2.4a) by $-m$ and (2.4b) by $u$ ; we add up and integrate over $(0,T)\times{\mathbb{T}}^{d}$ the resulting quantities; after performing some integrations by part, we obtain

[TABLE]

that we can combine with B3 and A2 to get,

[TABLE]

We integrate (3.5) over $(0,T)$ ,

[TABLE]

where we can choose $\theta\in(0,1)$ such that $\lambda_{1}<\frac{(1-\theta)^{q^{\prime}-1}(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ , since $\lambda_{1}$ satisfies the inequality in B3. The latter three inequalities imply (4.5). ∎

Roughly speaking, Lemma 4.1 with $q_{0}\leq q^{\prime}$ and Lemma 4.2 provide opposite inequalities which may become complementary under a smallness condition on the parameters, implying a uniform estimate on ${\left\|u\right\|_{\infty}}$ . This condition is explicitely given in the following corollary.

Corollary 4.3.

Under Assumptions A1, A2, B1, B3, FP1, FP2, $q_{0}\leq q^{\prime}$ , and $\lambda_{1}+C_{0}\lambda_{2}<\frac{(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ , $u$ is bounded by a quantity which only depends on the constants in the assumptions.

Proof.

Combing (4.1) and (4.5) results in,

[TABLE]

where $\theta\in(0,1)$ may be chosen such that $\lambda_{1}+C_{0}\lambda_{2}<\frac{(1-\theta)^{q^{\prime}-1}(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ , and $C_{\theta}$ is a positive constant depending on the constants in the assumptions and $\theta$ . This implies

[TABLE]

where $C_{0}\lambda_{2}\left(\frac{(1-\theta)^{q^{\prime}-1}(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}-\lambda_{1}\right)^{-1}<1$ , which concludes the proof. ∎

Let us mention that in the assumption $\lambda_{1}+C_{0}\lambda_{2}<\frac{(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ in Corollary 4.3, the constant $C_{0}$ in the left-hand side comes from the $C_{0}^{-1}$ in B3, and the $C_{0}$ in the right-hand side comes from FP1.

4.2 A priori estimates on $m$

In order for the HJB equations (1.2a) and (2.4a) to admit classical solutions, we want $\mu$ to be regular in time. Since $m$ is the marginal of $\mu$ with respect to ${\mathbb{T}}^{d}$ , we first prove that $m$ is regular in the following lemma. Moreover, we also prove that $m$ stays positive, which is required in Lemma 2.3 to obtain time regularity on $\mu$ .

Lemma 4.4.

Under assumptions A1, A3, FP1, FP2, $m$ is in $C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ for $\beta\in(0,\beta_{0})$ and its $C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d},\mathbb{R}\right)$ -norm can be estimated from above by a constant which depends on ${\left\|m_{0}\right\|_{C^{\beta_{0}}}}$ , ${\left\|\nabla_{x}u\right\|_{\infty}}$ , $\beta$ and the constants in the assumptions.

Furthermore, $m$ is positive everywhere and admits a positive lower bound which only depends on ${\left\|m_{0}^{-1}\right\|_{\infty}}$ , ${\left\|\nabla_{x}u\right\|_{\infty}}$ and the constants in the assumptions.

Proof.

The distribution of agents $m$ satisfies the second-order parabolic FPK equation (2.4b), which is supplemented with a $\beta_{0}$ -Hölder continuous initial condition. Theorem $2.1$ section $V.2$ in [30] states that $m$ is uniformly bounded by a constant which depends on ${\left\|m_{0}\right\|_{\infty}}$ and ${\left\|H_{p}\left(\cdot,\nabla_{x}u,\mu\right)\right\|_{\infty}}$ . This, (FP1) and (3.4) yield that $mH_{p}\left(\cdot,\nabla_{x}u,\mu,m\right)$ is bounded by a constant which depends on ${\left\|m_{0}\right\|_{\infty}}$ , ${\left\|\nabla_{x}u\right\|_{\infty}}$ and the constant of the assumptions. Finally, Theorem $6.29$ in [34] yields that $m\in C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d}\right)$ for $\beta\in(0,\beta_{0})$ , and its associated norm can be estimated from above by a constant which depends on ${\left\|m_{0}\right\|_{C^{\beta_{0}}}}$ , ${\left\|\nabla_{x}u\right\|_{\infty}}$ , $\beta$ and the constants in the assumptions.

We define $T_{\varepsilon}=\inf\left(\left\{t\in[0,T],{\left\|m(t)^{-1}\right\|_{\infty}}\leq\varepsilon\right\}\cup\left\{T\right\}\right)$ , for $0<\varepsilon<{\left\|m_{0}^{-1}\right\|_{\infty}}$ . In particular $T_{\varepsilon}$ is positive, since we proved in the latter paragraph that $m$ is continuous. On $[0,T_{\varepsilon}]\times{\mathbb{T}}^{d}$ we define the function $n$ by $n=m^{-1}$ , it satisfies the following partial differential equation in the sense of viscosity,

[TABLE]

supplemented with the initial condition $n(0)=m_{0}^{-1}$ , where $\alpha(t,x)=-H_{p}\left(x,\nabla_{x}u(t,x),\mu(t)\right)$ for $(t,x)\in[0,T]\times{\mathbb{T}}^{d}$ . We define ${\widetilde{n}}$ as the unique weak solution of the following partial differential equation defined on $[0,T]\times{\mathbb{T}}^{d}$ ,

[TABLE]

supplemented with the initial condition ${\widetilde{n}}(0)=m_{0}^{-1}$ . Theorem $2.1$ section $V.2$ in [30] states that ${\widetilde{n}}$ is bounded from above by a constant which depends on ${\left\|m_{0}^{-1}\right\|_{\infty}}$ , ${\left\|\alpha\right\|_{\infty}}$ and $T$ . Moreover, $n$ is a subsolution of the restriction of (4.6) to $[0,T_{\varepsilon}]\times{\mathbb{T}}^{d}$ , with the same initial condition as ${\widetilde{n}}$ . Therefore, by a comparison argument for second-order parabolic equations in divergence form (Theorem $9.7$ in [34] for instance), $n$ and ${\widetilde{n}}$ satisfy $n\leq{\widetilde{n}}$ . This implies that there exists $C$ a positive constant independent of $T_{\varepsilon}$ , such that ${\left\|n\right\|_{\infty}}\leq C$ . We conclude the proof by taking $\varepsilon=2^{-1}C^{-1}$ and recalling that ${\left\|\alpha\right\|_{\infty}}$ can be estimated from above using FP1 and (3.4). ∎

4.3 A priori estimates on derivatives of $u$

Bernstein methods are useful tools when studying HJB equations or MFG systems. They allow one to obtain a priori estimates on $\nabla_{x}u$ by considering the partial differential equations satisfied by some well-chosen functions depending on $u$ and $\nabla_{x}u$ . See for example the video of the lecture of P.L. Lions on November the $23$ rd $2018$ [35], in which Bernstein estimates are derived for MFG systems without interactions through controls. More precisely, P.L. Lions used the function defined by $\left|\nabla_{x}u\right|^{2}e^{-\eta u}$ , for small $\eta$ . Here this method might work only if we knew a uniform estimates on ${\left\|u\right\|_{\infty}}$ and if $q=2$ . After significant changes in the latter method, we can derive an estimate on $u$ which is weaker than the one for MFG without interactions through controls. Namely, we state that ${\left\|\nabla_{x}u\right\|_{\infty}}$ is bounded by a quantity that depends linearly on ${\left\|u\right\|_{\infty}}$ by studying the functions $w$ and $\varphi$ defined in (4.11) below. To our knowledge, such estimates for systems of MFG with nonlocal dependency on $\nabla_{x}u$ (or more generally for MFG systems in which we do not have a uniform a priori estimate on $u$ ) are new in the literature. We believe that this result may hold for more general HJB equations with nonlocal dependency on $\nabla_{x}u$ .

Lemma 4.5.

Under assumptions A1, A2, B2, B3, FP1 and FP2, there exists $C>0$ depending only on the constants of the assumptions, such that

[TABLE]

for any $t\in[0,T]$ .

Proof.

In what follows, we only prove that (4.7) holds for $t=0$ , however the proof does not use additional information available at $t=0$ (the initial condition on $m$ for example), so it can be repeated for any $t\in[0,T]$ and the constant $C$ in (4.7) does not depend on $t$ .

Here we wish to differentiate (2.4a) with respect to $x$ ; however we did not assume in Definition 2.1 enough regularity on $u$ for such an operation to have sense pointwisely on $(0,T)\times{\mathbb{T}}^{d}$ . Especially the time derivative of $\nabla_{x}u$ and the third derivatives of $u$ with respect to $x$ are not required to exist. This leads us to introducing $\rho\in C^{\infty}\left([-\frac{1}{2},\frac{1}{2})^{d}\right)$ a non-negative mollifier such that $\rho(x)=0$ if $|x|\geq\frac{1}{4}$ and $\int_{\mathbb{R}^{d}}\rho(x)dx=1$ . We introduce $\rho^{\delta}=\delta^{-d}\rho\left(\frac{\cdot}{\delta}\right)$ and $u^{\delta}(t)=\rho^{\delta}\star u(t)$ , for any $0<\delta<1$ and $t\in[0,T]$ , where $\star$ denotes the convolution operator.

Thus $u^{\delta}$ depends smoothly on the state variable and its partial derivatives in space at any order have the same regularity in time as $u$ , moreover it solves the following partial differential equation with final condition,

[TABLE]

Let us take the gradient with respect to the state variable of the latter equation and the scalar product of the resulting equality with $\nabla_{x}u^{\delta}$ ,

[TABLE]

where $H^{\delta}$ and $R^{\delta}$ are defined by

[TABLE]

By simple calculus, we notice that

[TABLE]

that we can combine with (4.9) and obtain

[TABLE]

We define the functions $\varphi$ and $w^{\delta}$ by

[TABLE]

where $a>1$ and $b>0$ are constants that will be defined below. The derivatives of $\varphi$ are given by

[TABLE]

which implies that $\varphi$ and $\varphi^{\prime}$ satisfy,

[TABLE]

Roughly speaking, we introduced $a$ and $b$ in order to have ${\left\|\varphi\right\|_{\infty}}{\left\|\varphi^{-1}\right\|_{\infty}}$ and ${\left\|\varphi^{\prime}\right\|_{\infty}}{\left\|(\varphi^{\prime})^{-1}\right\|_{\infty}}$ as close as possible to $1$ . This will be achieved by taking $a$ large enough, and $b$ small enough.

For simplicity of the notations, we will omit to write the argument of $\varphi$ since it is always $u^{\delta}$ .

The derivatives of $w^{\delta}$ verify the following equalities,

[TABLE]

We multiply (4.10) by $2\varphi$ and use the latter equalities in the resulting relation,

[TABLE]

We can rewrite the first line of (4.8) in the following way,

[TABLE]

where $Q^{\delta}$ is defined by,

[TABLE]

This and (4.14) imply that

[TABLE]

In the following we will estimate from above the right-hand side of the latter expression. We notice that the second term of the right-hand side is negative since

[TABLE]

We notice that $R^{\delta}$ and $Q^{\delta}$ are uniformly convergent to [math] as $\delta$ tends to [math], so we can assume that,

[TABLE]

for $\delta$ small enough and depending on $\varepsilon>0$ .

The first term in the last line of (4.15) can be bounded using B2,

[TABLE]

In fact we are going to use the latter inequality to obtain (4.19) below, only by noticing that using (3.6), the right-hand side involves only terms with exponents in ${\left\|w^{\delta}\right\|_{\infty}}$ or ${\left\|w^{0}\right\|_{\infty}}$ not larger than $\frac{1+q}{2}$ .

Then we use B3 on the first term of the right-hand side of (4.15) since $\varphi^{\prime}<0$ ,

[TABLE]

The term involving $-\left(w^{\delta}\right)^{1+\frac{q}{2}}$ is a key element in this proof. On the one hand, it will allow us to cancel the term in $w^{\delta}\Lambda_{q_{0}}\left(\mu(t)\right)^{q^{\prime}}$ . On the other hand, we will use the fact that it has a larger exponent than any of the remaining terms.

From (3.6) and (4.13), we obtain

[TABLE]

where $\theta\in(0,1)$ will be defined below. Then (4.13) implies,

[TABLE]

Combining the latter six inequalities, (4.15), and the fact that ${\left\|w^{\delta}\right\|_{\infty}}\leq{\left\|w^{0}\right\|_{\infty}}$ , we obtain the following partial differential inequality,

[TABLE]

where $C_{a,b,\theta}$ is a positive constant which only depends on the constants in the assumptions and in $(a,b,\theta)$ . We systematically used the inequality ${\left\|w^{0}\right\|_{\infty}^{r}}\leq 1+{\left\|w^{0}\right\|_{\infty}^{\frac{1+q}{2}}}$ on every term of the form ${\left\|w^{0}\right\|_{\infty}^{r}}$ with $0<r<\frac{1+q}{2}$ .

Let us mention the following result: the function $y^{+}$ defined by $y^{+}=\max\left(y_{0},K^{-\frac{1}{k}}{\left\|f\right\|_{\infty}^{\frac{1}{k}}}\right)$ is a super-solution of the following differential equation,

[TABLE]

posed on $[0,T]$ , where $k$ and $y_{0}$ are positive constants and $f$ is a bounded positive function.

This and ${\left\|w(0)\right\|_{\infty}}\leq eC_{0}^{2}$ which comes from A2 and (4.13), yield that a super-solution to (4.19) is given by

[TABLE]

where we replace $C_{a,b,\theta}$ with $C_{a,b,\theta}+\left(eC_{0}^{2}\right)^{1+\frac{q}{2}}$ .

From a comparison argument for parabolic second-order equation, $w^{\delta}$ is not larger than the latter expression. This result holds for $w^{0}$ by letting $\delta$ and $\varepsilon$ tend to [math], thus $w^{0}$ verifies the following inequality,

[TABLE]

By B3, we can choose $a>1$ large enough, $b>0$ and $\theta\in(0,1)$ small enough such that $\frac{\lambda_{1}C_{0}^{q^{\prime}}e^{2b}e^{\frac{q}{2}e^{-a+b}}}{(1-\theta)^{q^{\prime}-1}(1-\lambda_{0})^{q^{\prime}}}<1$ . This implies

[TABLE]

where we increased $C_{a,b,\theta}$ into $\left(1-\frac{\lambda_{1}C_{0}^{q^{\prime}+1}e^{2b}e^{\frac{q}{2}e^{-a+b}}}{(1-\theta)^{q^{\prime}-1}(1-\lambda_{0})^{q^{\prime}}}\right)^{-1}C_{a,b,\theta}$ .

We make out two cases: the first case is when ${\left\|w^{0}\right\|_{\infty}^{\frac{1}{2}}}\leq 2C_{a,b,\theta}\left(1+{\left\|u\right\|_{\infty}}\right)$ . The second case is when ${\left\|w^{0}\right\|_{\infty}^{\frac{1}{2}}}>2C_{a,b,\theta}\left(1+{\left\|u\right\|_{\infty}}\right)$ . In the latter case, (4.20) implies that ${\left\|w^{0}\right\|_{\infty}^{1+\frac{q}{2}}}\leq\frac{1}{2}{\left\|w^{0}\right\|_{\infty}^{\frac{1}{2}}}\left(1+{\left\|w^{0}\right\|_{\infty}^{\frac{1+q}{2}}}\right)$ , which implies that ${\left\|w^{0}\right\|_{\infty}}\leq 1$ . Therefore, in any of the two latter cases we obtain

[TABLE]

This and (4.13) yield (4.7) when $t=0$ , this concludes the proof. ∎

Now, we can combine the estimates obtained in this section with classical results on parabolic second-order equations and get further estimates of $u$ and its derivatives and on $m$ .

Lemma 4.6.

Assume A1, A2, B2, B3, FP1, FP2 and T. The function $u$ is in $C^{1+\frac{\beta}{2},2+\beta}\left([0,T]\times{\mathbb{T}}^{d}\right)$ for any $\beta\in\left(0,\beta_{0}^{2}\right)$ , where $\beta_{0}$ was introduced in the assumptions. Its $C^{1+\frac{\beta}{2},2+\beta}$ -norm can be bounded by a quantity depending only on ${\left\|u\right\|_{\infty}}$ , $\beta$ , and the constants in the assumptions.

Proof.

Lemma 4.5 states that ${\left\|\nabla_{x}u\right\|_{\infty}}$ is bounded by a quantity which depends on ${\left\|u\right\|_{\infty}}$ and the constants in the assumptions. So is $\Lambda_{q_{0}}(\mu)$ by (3.2). Then $u$ is the solution of the heat equation with a right-hand side equal to $-H\left(x,\nabla_{x}u,\mu\right)$ which is bounded in $L^{\infty}$ . Classical results (see for example Theorem $6.48$ in [34]) state that for any $\beta\in(0,1)$ , the $C^{\frac{1}{2}+\frac{\beta}{2},1+\beta}$ -norm of $u$ is bounded by a constant which depends on the $L^{\infty}$ -norm of the right-hand side, the terminal condition, and $\beta$ .

Lemma 4.4 yields that $m$ is in $C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d}\right)$ for $\beta\in(0,\beta_{0})$ , is positive, and that both its $C^{\frac{\beta}{2},\beta}\left([0,T]\times{\mathbb{T}}^{d}\right)$ -norm and its lower bound depend on ${\left\|u\right\|_{\infty}}$ , ${\left\|m_{0}^{-1}\right\|_{\infty}}$ , $\beta$ , and the constant of the assumptions.

Therefore, Lemma 2.3 yields that $\left[(t,x)\mapsto\alpha^{\mu(t)}(x)\right]\in C^{\frac{\beta\beta_{0}}{2},\beta\beta_{0}}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}^{d}\right)$ .

From A1, $H$ is locally Lipschitz continuous with respect to $(x,p)$ . This and T imply that $\left[(t,x)\mapsto H\left(t,\nabla_{x}u\left(t,x\right),\mu(t)\right)\right]\in C^{\frac{\beta\beta_{0}}{2},\beta\beta_{0}}\left([0,T]\times{\mathbb{T}}^{d}\right)$ . Thus $u$ is the solution of the backward heat equation with a right-hand side in $C^{\frac{\beta\beta_{0}}{2},\beta\beta_{0}}$ supplemented with terminal condition in $C^{2+\beta_{0}}$ . Classical results (see for instance Theorem $4.9$ in [34]) yield that $u$ is in $C^{1+\frac{\beta\beta_{0}}{2},2+\beta\beta_{0}}$ , and its $C^{1+\frac{\beta\beta_{0}}{2},2+\beta\beta_{0}}$ -norm depends on ${\left\|g(\cdot,m(T))\right\|_{C^{2+\beta_{0}}}}$ and the $C^{\frac{\beta\beta_{0}}{2},\beta\beta_{0}}$ -norm of the right-hand side. We recall that $\beta$ is any constant in $(0,\beta_{0})$ . The proof of the lemma is complete.

Following precisely the dependencies in the above estimates, we obtain that the $C^{1+\frac{\beta\beta_{0}}{2},2+\beta\beta_{0}}$ norm of $u$ can be estimated from above by a constant which depends on ${\left\|u\right\|_{\infty}}$ , ${\left\|m_{0}^{-1}\right\|_{\infty}}$ , $\beta$ , and the constants in the assumptions. ∎

The conclusions of Lemmas 4.1, 4.4, 4.5 and 4.6 are summarized in Lemma 2.4.

Existence and uniqueness results

under additional assumptions

5.1 Solving the MFGC systems for $M<\infty$

Lemma 5.1.

Under assumptions A1-A3, B2, B3, FP1, FP2, T and $M\in(0,\infty)$ , there exists at least one solution to (2.4).

Proof.

For $(u,m)\in C^{0,1}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)\times C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\right)$ , we define $\mu^{M}\in C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)\right)$ by

[TABLE]

using Lemma 2.3. Then we define $u^{M}$ as the viscosity solution of the following backward HJB equation with a final condition,

[TABLE]

We can rewrite the first line of the latter system in the following way,

[TABLE]

where the right-hand side is bounded using $\Lambda_{\infty}\left(\mu(t)\right)\leq M$ , B1 and (2.2). The maximum principle for second-order parabolic equation provides that $u^{M}$ is bounded. Here, the proof of Lemma 4.5 can be repeated to prove that ${\left\|\nabla_{x}u\right\|_{\infty}}$ is bounded by a constant which depends on $M$ and the constants in the assumptions. Then with the same argument as in Lemma 4.6, $u^{M}$ is bounded in $C^{\frac{1}{2}+\frac{\beta}{2},1+\beta}$ -norm, for all $\beta\in(0,1)$ .

We define $m^{M}$ as the solution in the sense of distributions of the following Fokker-Planck-Kolmogorov equation with an initial condition,

[TABLE]

with $b(t,x)=-H_{p}\left(x,\nabla_{x}u^{M}(t,x),\mu^{M}(t)\right)$ which is a continuous function with respect to $(t,x)$ . Using the same arguments as in Lemma 4.4, we get that $m\in C^{\frac{\beta}{2},\beta}\left({\mathbb{T}}^{d};\mathbb{R}\right)$ for $\beta\in(0,\beta_{0})$ .

Moreover, ${\left\|u\right\|_{C^{\frac{1}{2}+\frac{\beta}{2},1+\beta}}}$ , ${\left\|m\right\|_{C^{\frac{\beta}{2},\beta}}}$ are bounded by a constant which depends on $M$ , $\beta$ and the constants in the assumptions. The map $(u,m)\mapsto\mu^{M}$ is continuous from $C^{0,1}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)\times C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\right)$ to $C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)\right)$ by Lemma 3.1. The map $\left(m,\mu^{M}\right)\mapsto u^{M}$ is continuous from $C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\right)\times C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)\right)$ to $C^{0,1}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ by the stability of the solutions of viscosity. The map $\left(u^{M},\mu^{M}\right)\mapsto m^{M}$ is continuous from $C^{0,1}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)\times C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)\right)$ to $C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\right)$ by linearity of the FPK equation.

Thus the map $(u,m)\mapsto(u^{M},m^{M})$ is continuous from $C^{0,1}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)\times C^{0}\left([0,T];{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\right)$ to itself. Its fixed points are exactly the solutions to (2.4). The image of this map is a subset of a convex compact set. Therefore, there exists a fixed point by Schauder theorem, see [19] Corollary $11.2$ .

Using the same arguments as in the proof of Lemma 4.6, such a fixed point $u$ satisfies $u\in C^{1+\frac{\beta}{2},2+\beta}\left([0,T]\times{\mathbb{T}}^{d};\mathbb{R}\right)$ for any $\beta\in\left(0,\beta_{0}^{2}\right)$ . ∎

Considering $M<\infty$ in (2.4) consists of enforcing the condition $\Lambda_{\infty}\left(\mu(t)\right)\leq M$ , i.e. the fact that the support of $\mu(t)$ is embedded in the compact set ${\mathbb{T}}^{d}\times B_{\mathbb{R}^{d}}\left(0,M\right)$ , for $t\in[0,T]$ . Therefore, the interactions through controls are uniformly bounded. Lemma 5.1 relies on that to state the existence of solutions to (2.4). For $M=\infty$ , we can not obtain such a uniform estimate by combining only the results of Section 4. However if such an estimate exists, the result of Lemma 5.1 holds for $M=\infty$ and yields the existence of solutions to (1.2). More precisely, if a solution to (2.4) satisfies $\Lambda_{\infty}\left(\mu(t)\right)<M$ for any $t\in[0,T]$ , then it is also a solution to (1.2). This is summarized in the following Corollary.

Corollary 5.2.

Under the same assumptions as in Lemma 5.1, if, for any $M>0$ , any solution $(u,m,\mu)$ to (2.4) satisfies ${\left\|u\right\|_{\infty}}\leq C$ , or ${\left\|\nabla_{x}u\right\|_{\infty}}\leq C$ , for some $C>0$ , then there exists at least one solution to (1.2).

Proof.

By Lemma 5.1, we define $(u,m,\mu)$ as a solution to (2.4) for $M\in(0,\infty)$ that will be defined later. By Lemma 2.4, assuming that ${\left\|u\right\|_{\infty}}$ is bounded is equivalent to assuming that ${\left\|\nabla_{x}u\right\|_{\infty}}$ is bounded. Therefore, without loss of generality, we can assume that ${\left\|\nabla_{x}u\right\|_{\infty}}\leq C$ . From FP1 and (3.4), we obtain

[TABLE]

We define $M=1+C_{0}\left(1+C^{q-1}\right)+\frac{\lambda_{0}C_{0}}{1-\lambda_{0}}\left(1+C^{q-1}\right)$ , then the truncation $T_{M}$ leaves $-H_{p}\left(\cdot,\nabla_{x}u^{M},\mu\right)$ unchanged. Hence $(u,m,\mu)$ is a solution to (1.2). ∎

5.2 Existence results

when $q_{0}\leq q^{\prime}$

When $q_{0}\leq q^{\prime}$ , we can use integral energy estimates. More precisely, inequalities (4.1) and (4.5) hold. Therefore, the assumptions under which we can prove existence should be weaker than in the case $q_{0}>q^{\prime}$ in which we have less estimates at our disposal.

In particular, Corollary 4.3 provides a uniform estimate on ${\left\|u\right\|_{\infty}}$ under suitable assumptions. Corollary 5.2 then yields the existence of a solution to (1.2): hence we may state the following theorem:

Proposition 5.3 (*Existence of solution

with small non-linearities*).

Under assumptions A1-A3, B1-B3, FP1, FP2, T, $q_{0}\leq q^{\prime}$ , and $\lambda_{1}+C_{0}\lambda_{2}<\frac{(1-\lambda_{0})^{q^{\prime}}}{C_{0}^{q^{\prime}}}$ , there exists at least one solution to (1.2).

Instead of assuming that the multiplicative parameters are small like in Proposition 5.3; we suppose in Propositions 5.5 below the exponent for the interactions through controls is in fact smaller than the one appearing in B1.

Proposition 5.4.

Assume A1-A3, B2, B3, FP1, FP2, T, $q_{0}\leq q^{\prime}$ , and that $H$ satisfies

[TABLE]

for $(x,\mu)\in{\mathbb{T}}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ , where ${\widetilde{q}}\in[0,q^{\prime})$ is a constant. There exists a solution to (1.2).

Proof.

Let $(u,m,\mu)$ be a solution to (2.4) for $M\in(0,\infty)$ . From A2, (4.4) and the new assumption, we obtain that,

[TABLE]

where the second line is obtained by a Hölder inequality, since ${\widetilde{q}}<q^{\prime}$ . Let us recall that the inequality $\left(a+b\right)^{\frac{{\widetilde{q}}}{q^{\prime}}}\leq a^{\frac{{\widetilde{q}}}{q^{\prime}}}+b^{\frac{{\widetilde{q}}}{q^{\prime}}}$ holds for any $a,b>0$ . The latter two inequalities and (3.5) with $\theta=\frac{1}{2}$ imply,

[TABLE]

where $C>0$ is a constant which depends on the constants in the assumptions. This and (4.5) yield that,

[TABLE]

up changing the value of $C$ . Let us make out two cases: the first case is when ${\left\|u\right\|_{\infty}}\leq\left(2C\right)^{\frac{q^{\prime}}{q^{\prime}-{\widetilde{q}}}}$ . The second case is when ${\left\|u\right\|_{\infty}}>\left(2C\right)^{\frac{q^{\prime}}{q^{\prime}-{\widetilde{q}}}}$ , which implies ${\left\|u\right\|_{\infty}}\leq C+\frac{1}{2}{\left\|u\right\|_{\infty}}$ . In any of the two cases, $u$ is uniformly bounded with respect to $M$ . The desired result then stems from Corollary 5.2. ∎

5.3 Existence results

which do not need the assumption $q_{0}<q^{\prime}$

Here, we do not make the assumption $q_{0}\leq q^{\prime}$ . We can still obtain an existence result in the same spirit as the one provided in Proposition 5.4. In the following proposition, the exponent for the interactions through controls is assumed to be smaller than the one appearing in B1 or in Proposition 5.4.

Proposition 5.5.

Assume A1-A3, B2, B3, FP1, FP2, T, and that $H$ satisfies

[TABLE]

for any $(x,\mu)\in{\mathbb{T}}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ . There exists a solution to (1.2).

Proof.

Take $(u,m,\mu)$ a solution to (2.4) for $M\in(0,\infty)$ . Let us combine (4.3), (3.6) for $\theta=\frac{1}{2}$ , (4.7), (5.2), and the inequality $\left(a+b\right)^{\frac{1}{q}}\leq a^{\frac{1}{q}}+b^{\frac{1}{q}}$ which holds for $a,b>0$ ; this yields

[TABLE]

for a constant $C>0$ which depends only on the constants in the assumptions. We recall that ${\left\|u(T)\right\|_{\infty}}\leq C_{0}$ by A2. We consider $y_{+},y_{-}\in C^{1}\left([0,T];\mathbb{R}\right)$ defined as $y_{+}(t)=Ct+C_{0}e^{Ct}$ and $y_{-}(t)=-Ct-C_{0}e^{Ct}$ such that they are solution to the following differential equations

[TABLE]

By a comparison argument for second-order parabolic equation we obtain,

[TABLE]

for $(t,x)\in[0,T]\times{\mathbb{T}}^{d}$ . Therefore $u$ is uniformly bounded with respect to $M$ . The desired result then stems from Corollary 5.2. ∎

In Propotitions 5.4 and 5.5, we changed the exponent appearing in B1. In the following proposition, we assume a smaller exponent than the one appearing in B2 instead.

Proposition 5.6 (*Existence

with more restrictive assumptions on $H_{x}$ *).

Assume A1-A3, B1, B3, FP1, FP2, T, and the following inequality,

[TABLE]

for any $(x,p,\mu)\in{\mathbb{T}}^{d}\times\mathbb{R}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ . There exists at least one solution to (1.2).

Proof.

Take $(u,m,\mu)$ a solution to (2.4), for $M\in(0,\infty)$ .

First step: we prove the following inequality,

[TABLE]

for any $t\in[0,T]$ , where $C>0$ is a constant depending only on the constants in the assumptions. We will only prove this inequality for $t=0$ , however the proof does not use the additional information available at $t=0$ (the initial condition on $m$ for example), so it can be repeated for any $t\in[0,T]$ and the constant $C$ in (5.5) does not depend on $t$ .

We introduce $\varphi,w^{\delta},a,b,\delta$ and $\varepsilon$ as in the proof of Lemma 4.5. Using (5.4) instead of B2, we obtain

[TABLE]

From this and (3.2), one may notice that the right-hand side of the latter inequality only involves terms with exponents in ${\left\|w^{\delta}\right\|_{\infty}}$ or ${\left\|w^{0}\right\|_{\infty}}$ nor larger than $\frac{1}{2}\left(1+(q-1)(q^{\prime}-1)\right)=1$ . This and the same arguments as in the proof of Lemma 4.5 between (4.18) and (4.19), lead to the following inequality,

[TABLE]

instead of (4.19), where the novelty is the exponent on ${\left\|w^{0}\right\|_{\infty}}$ at the last line which changed from $\frac{1+q}{2}$ to $1$ . Then following the same steps as in the proof of Lemma 4.5 until the end, we obtain that,

[TABLE]

This concludes the first step of the proof.

Second step: obtaining a uniform estimate on $u$ .

Using B1, (3.6) with $\theta=\frac{1}{2}$ and (5.5), we obtain that,

[TABLE]

where the constant $C$ from the previous step may have been increased. This implies that $u$ satisfies the same partial differential inequality as in the proof of Proposition 5.5, namely (5.3). Therefore the same arguments as in Proposition 5.5 apply and we conclude that there exists a solution to (1.2). ∎

Remark 5.7.

Note that the exponent $q^{\prime}-1$ actually appears in several applications: for instance, the price impact model described in paragraph 6.2 in the quadratic case (i.e. $q=2$ ) with $\varepsilon=0$ (i.e. when the bidding and asking prices are equal), satisfies the assumptions in both Propositions 5.5 and 5.6 with an exponent exactly equal to $q^{\prime}-1$ .

5.4 Existence and uniqueness results with

a short-time horizon assumption

Under a short-time horizon assumption, existence and even uniqueness of solutions are well-known in the MFG literature. Indeed, when the time horizon is small, one may obtain strong a priori estimates under non-restrictive assumptions. These estimates combined with Corollary 5.2 yield existence of solution to (1.2) as stated in the following proposition.

Proposition 5.8 (Existence with short time horizon).

Assume A1, A2-B1, B2-FP1, FP2, and T. There exists $T_{0}>0$ such that, if $T\leq T_{0}$ then there exists a solution to (1.2).

Proof.

Take $(u,m,\mu)$ a solution to (2.4) for $M\in(0,\infty)$ . We combine (4.3), FP1, (3.6), (4.7), and the convex inequality $\left(a+b\right)^{q}\leq 2^{q-1}\left(a^{q}+b^{q}\right)$ , and we obtain

[TABLE]

where $C$ is a positive constant which depends only on the constants in the assumptions. We recall that ${\left\|u(T)\right\|_{\infty}}\leq C_{0}$ by A2. Let us consider the following differential equation,

[TABLE]

There exists $T_{0}>0$ such that the latter differential equation admits a bounded solution on $[0,T_{0}]$ . We suppose that $T\leq T_{0}$ , then $(t,x)\mapsto y(T-t)$ is a super-solution to (5.6). Hence by a comparison principle, we get that $u\leq y$ . The same argument applies in order to prove that $u\geq-y$ . Therefore $u$ is uniformly bounded with respect to $M$ , and there exists a solution to (1.2) by Corollary 5.2. ∎

We will now prove Theorem 2.6 which states that uniqueness is achieved under a short-time horizon assumption. We believe that this uniqueness result can be easily extended to more general Hamiltonians, but that the short-time assumption is essential. Indeed, numerical simulations in [1] show that uniqueness does not hold for the discrete MFGC system obtained by approximating (1.2) with finite differences; we believe that uniqueness does not hold for (1.2) either. Theorem 2.6 should be interpreted only as a simple example of uniqueness result with a short-time horizon assumption.

Proof of Theorem 2.6.

We suppose that $T_{1}\leq T_{0}$ , where $T_{0}$ was defined in Proposition 5.8, so that a solution to (1.2) satisfies uniform estimates on ${\left\|u\right\|_{\infty}}$ , $\|u^{i}\|_{C^{1,2}}$ and $\|m^{i}\|_{C^{0}}$ by Lemma 4.6, for $i=1,2$ . Take $(u^{1},m^{1},\mu^{1})$ and $(u^{2},m^{2},\mu^{2})$ two solutions to (1.2). We define $u=u^{1}-u^{2}$ , $m=m^{1}-m^{2}$ and $\alpha=\alpha^{\mu^{1}}-\alpha^{\mu^{2}}$ .

In this proof $C>0$ is a constant which may differ from line to line and depends only on the constants in the assumptions, $\|u^{i}\|_{C^{1,2}}$ and $\|m^{i}\|_{C^{0}}$ , for $i=1,2$ .

We can repeat the proof of Lemma 3.2 replacing ${\left\|m^{1}-m^{2}\right\|_{\infty}^{\beta_{0}}}$ and ${\left\|p^{1}-p^{2}\right\|_{\infty}^{\beta_{0}}}$ respectively with $W_{q_{1}}\left(m^{1},m^{2}\right)$ and ${\left\|p^{1}-p^{2}\right\|_{\infty}^{\beta_{0}}}$ everywhere and we obtain that,

[TABLE]

for any $t\in[0,T]$ . Let us consider $X^{1}$ and $X^{2}$ two random processes defined by

[TABLE]

where $X^{0}$ is a random variable on ${\mathbb{T}}^{d}$ with law $m^{0}$ and $W$ is a Brownian motion independent of $X^{0}$ . The respective laws of $\left(X^{1}_{t},\alpha^{\mu^{1}}(t,X^{1}_{t})\right)$ and $\left(X^{2}_{t},\alpha^{\mu^{2}}(t,X^{2}_{t})\right)$ are $\mu^{1}(t)$ and $\mu^{2}(t)$ . Then we obtain,

[TABLE]

where we used the triangle inequality for the $L^{q_{1}}$ -norm twice. By the first additional assumption of the theorem and A1, $\alpha^{\mu^{1}}$ is Lipschitz continuous with respect to $x$ and its Lipschitz constant depends on ${\left\|u^{i}\right\|_{C^{0,1}}}{}$ and $\Lambda_{\infty}\left(\mu^{1}\right)$ . Using the estimates from the proof of Proposition 5.8, it only depends on the constants in the assumptions. This, the latter inequality and (5.7) imply

[TABLE]

This and Gronwall’s inequality yield that,

[TABLE]

From now on, we assume that $T\leq\frac{1}{2C}$ , so that $(1-CT)\geq\frac{1}{2}$ . Since $W_{q_{1}}(m^{1}(t),m^{2}(t))\leq{\mathbb{E}}\left[\left|X^{1}_{t}-X^{2}_{t}\right|^{q_{1}}\right]^{\frac{1}{q_{1}}}$ , we obtain:

[TABLE]

Hence $u$ satisfies the following equation,

[TABLE]

The right-hand side of the first line can be estimated in absolute value from above as follows:

[TABLE]

by T, (5.7) and (5.8). Since $u(T,\cdot)\in C^{1+\beta}\left({\mathbb{T}}^{d}\right)$ , Theorem $6.48$ in [34] yields that $u\in C^{\frac{1}{2}+\frac{\beta}{2},1+\beta}\left([0,T]\times{\mathbb{T}}^{d}\right)$ and it satisfies:

[TABLE]

This, (2.5) and (5.8) yield,

[TABLE]

Thus if we suppose furthermore that $T<C^{-\frac{2}{\beta}}$ , then $\nabla_{x}u=0$ , so $m=0$ by (5.8), then $\mu^{1}=\mu^{2}$ by (5.7), and finally $u^{1}$ and $u^{2}$ solve the same Hamilton-Jacobi-Bellman equation with the same terminal condition, so by uniqueness $u=0$ .

Therefore, we proved the uniqueness for $T<T_{1}$ where $T_{1}$ is defined by $T_{1}=\min T_{0},\left(C^{-\frac{2}{\beta}},C^{-1}\right)$ . ∎

Applications

Here, we are going to work on ${\mathbb{T}}^{d}$ , while it would be more realistic to work in the whole space $\mathbb{R}^{d}$ for the applications considered below. We would like to recall that the existence results contained in the present work hold for MFGC systems on $\mathbb{R}^{d}$ using the method introduced in [28] to pass from the torus to the whole Euclidean space. Therefore, the conclusions of this section may be adapted to treat the same applications on $\mathbb{R}^{d}$ .

6.1 Exhaustible ressource model with nonpositively correlated ressources

This model is often referred to as Bertrand and Cournot competition model for exhaustible ressources, introduced in the independent works of Cournot [17] and Bertrand [6]; its mean field game version in dimension one was introduced in [24] and numerically analyzed in [16]; for theoretical results see [8, 22, 27, 23]. We consider a continuum of producers selling exhaustible ressources. The production of a representative agent is $(q_{t})_{t\in[0,T]}$ ; the agents differ in their production capacities $X_{t}\in{\mathbb{T}}$ (the state variable), that satifies,

[TABLE]

where $\nu>0$ and $W$ is a Brownian motion. Each producer is selling a different ressource and has her own consumers. However, the ressources are substitutable and any consumer may change her mind and buy from a competitor depending on the degree of competition in the game (which is characterized by $\varepsilon$ in the linear demand case below for instance). Therefore, the selling price per unit of ressource that a producer can make when she sales $q$ units of ressource, depends naturally on $q$ and on the quantity produced by the other agents. The price satisfies a supply-demand relationship, and is given by $P\left(q,{\overline{q}}\right)$ , where ${\overline{q}}$ is the aggregate demand which depends on the overall distribution of productions of the agents. A producer tries to maximize her profit, or equivalently to minimize the following quantity,

[TABLE]

where $g$ is a terminal cost which often penalizes the producers who have non-zero production capacities at the end of the game. In the Cournot competition, see [17], a producer is controling her production $q$ . Like the MFG version of the Bertrand and Cournot competition introduced in [16], here we consider the Bertrand formulation [6], where an agent directly controls her selling price $\alpha=P(q,{\overline{q}})$ . After inverting the latter equality, the production can be viewed as a function of the price and the mean field. Mathematically this corresponds to writing $q=Q\left(\alpha,{\overline{\alpha}}\right)$ .

In [16], the authors considered a linear demand system depending on ${\overline{q}}_{\text{lin}}=\int_{{\mathbb{T}}}q(x)dm(x)$ , and a price satisfying $\alpha=P_{\text{lin}}(q,{\overline{q}}_{\text{lin}})=1-q-\varepsilon{\overline{q}}_{\text{lin}}$ . In this case, the running cost $L^{\text{lin}}$ and its Legendre transform $H^{\text{lin}}$ are defined by

[TABLE]

where $\alpha,p\in\mathbb{R}$ , $\mu\in{\mathcal{P}}\left({\mathbb{T}}\times\mathbb{R}\right)$ and ${\overline{\alpha}}$ is defined by ${\overline{\alpha}}=\int_{{\mathbb{T}}\times\mathbb{R}}{\widetilde{\alpha}}d\mu(y,{\widetilde{\alpha}})$ . Therefore the system of MFGC has the following form,

[TABLE]

for $(t,x)\in[0,T]\times{\mathbb{T}}$ . Roughly speaking, $\varepsilon=0$ corresponds to a monopoly in which a producer does not suffer from competition, and she plays as if she was alone in the game. Conversely, $\varepsilon=\infty$ stands for all the producers selling the same ressource and the consumers not having any preference.

Here, Theorem 2.5 d implies the following existence result.

Proposition 6.1.

If $m_{0}$ and $g$ satisfy A2 and A3, there exists a solution to (6.1) for any $\varepsilon\in(0,\infty)$ .

To prove it, we may take $q=2$ , $q_{0}=1$ , $\lambda_{0}=\frac{\varepsilon}{2(1+\varepsilon)}$ , $\lambda_{1}=1$ , and $C_{0}=\frac{1}{2}$ in FP1; then we check the assumptions of Theorem 2.5 d. In this case, the inequality in B3 has the form $1<\left(\frac{2+\varepsilon}{1+\varepsilon}\right)^{2}$ , and is satisfied for any $\varepsilon\in(0,\infty)$ .

Here, the Lagrangian $L^{\text{lin}}$ satisfies a monotonicity assumption, but the latter existence result does not take advantage of it. We refer to [28] for a uniqueness result and an other existence result for the solution to (6.1) using this monotonicity assumption. Generalizations of (6.1) to larger dimensions with more general Hamiltonians and prices are also discussed in [28] under the monotonicty assumption.

In what follows, we provide a simple example of a generalization of (6.1) in which the monotonicity assumption does not hold and the results in [28] do not apply anymore. However, the results in the present work may hold in some cases even without the monotonicity assumption.

Let us consider a model in which every producer sells $d$ different kinds of ressources. The price of each ressource depends on the mean field like in (6.1). Namely, we take $Q=M{\overline{\alpha}}-\alpha$ which is now a $d$ -dimensional vector and where $M\in\mathbb{R}^{d\times d}$ is a given matrix. This leads to the following MFGC system,

[TABLE]

Proposition 6.2.

Assume A2, A3, that $M$ has an operator norm smaller than $1$ , and that $f$ is continuous, and differentiable with respect to $x$ with continuous derivatives. There exists a solution to (6.3).

The proof consists in taking $q=2$ , $q_{0}=1$ , $\lambda_{1}=1$ , $C_{0}=\frac{1}{2}$ in FP1, and $\lambda_{0}=\frac{{\left\|M\right\|}}{2}$ , where ${\left\|M\right\|}$ is the operator norm of $M$ ; and we check the assumptions of Theorem 2.5 d.

The monotonicity assumption discussed in [28] is equivalent to assuming that $M$ is a positive semi-definite matrix. Here, we do not make such an assumpion.

What we have in mind in the latter example is the case where the prices of the different ressources may be negatively correlated, like cars and oil (if the production of cars increases, then the demand for oil also increases and the price of oil rises while the price of cars decreases), or pesticides and medicines, or gold and other raw materiels. To our knowledge, such a generalization of the exhaustible ressource model to negatively correlated ressources is new in the MFG literature.

More generally, we believe that our results hold for the following MFGC system under various different sets of assumptions that we will not detailed here,

[TABLE]

where $Q:[0,T]\times{\mathbb{T}}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)\to\mathbb{R}^{d}$ is a vector characterizing the mean field interactions.

6.2 Price impact models with bid and ask prices

The price impact model without bid and ask prices is inspired by the Almgren and Chriss’s model [5], and was introduced in the MFG literature in [11] and [15] where existence and uniqueness results are proved when the admissible controls stay in a compact set. Here we consider an extension with bid and ask prices.

We suppose that a continuum of agents are trading an asset, the state of a representative agent is $X_{t}$ the amount of this asset she owns. Her control $\alpha$ is the quantity she buys (if $\alpha\geq 0$ ) or sell (if $\alpha<0$ ). The state space is the one-dimensional torus ${\mathbb{T}}$ , and $X_{t}$ is given by,

[TABLE]

where $W$ is a Brownian motion, and $\sigma>0$ is a real constant. We define $S_{t}$ as the asking price of the asset, and $\varepsilon\left(\mu(t)\right)$ as the difference between the bidding and asking prices, where $\mu(t)$ is the law of $(X_{t},\alpha_{t})$ . The agent buys at the bidding price $S_{t}+\varepsilon\left(\mu_{t}\right)$ , thus her cash is given by

[TABLE]

where $\ell$ is a differentiable function standing for the transaction cost. The price $S_{t}$ evolves accordingly with the amount of transactions at time $t$ , it satisfies the following SDE,

[TABLE]

The wealth of a representative agent is given by $V_{t}=V_{0}+X_{t}S_{t}+K_{t}$ and it satisfies the following SDE,

[TABLE]

The objective function that she will try to maximize is given by,

[TABLE]

where $f$ and $g$ are penalization costs for holding stocks. Here, the Lagrangian and Hamiltonian are given by,

[TABLE]

for $\left(x,\alpha,\mu\right)\in{\mathbb{T}}\times\mathbb{R}\times{\mathcal{P}}\left({\mathbb{T}}\times\mathbb{R}\right)$ , where $h$ is the Legendre transform of $\ell$ .

The linear-quadratic case with $\varepsilon=0$ is treated in [13]. Here, taking $\varepsilon=0$ corresponds to assuming that the bidding and asking prices coincide. In this case the optimal control is given by $-h_{p}(p)$ and does not depend explicitely on $\mu$ . If $\varepsilon\neq 0$ , the optimal control depends explicitely on $\mu$ and $L^{\text{PI}}$ is not separable in $\alpha$ and $\mu$ , this prevents us from using the results in [13].

Let us give an example of choices for the functions $\ell$ and $\varepsilon$ under which our result apply and a solution of the MFGC price impact model exists.

Proposition 6.3.

Assume A2, A3, that $f$ is $C^{1}$ , and that $c$ and $\varepsilon$ are respectively given by $\ell(\alpha)=\frac{|\alpha|^{2}}{2}$ and $\varepsilon\left(\mu\right)={\widetilde{\varepsilon}}\left(\int_{{\mathbb{T}}\times\mathbb{R}}|\alpha|^{2}d\mu\left(x,\alpha\right)\right)^{\frac{1}{2}}$ , where $0<{\widetilde{\varepsilon}}<\frac{1}{2}$ . There exists a solution to (1.2) with $H^{\text{PI}}$ .

This existence result is a consequence of 2.5 c, where the assumptions are satisfied for $q=q_{0}=2$ , $\lambda_{0}=\varepsilon$ , $\lambda_{1}=\frac{1}{4}$ and $C_{0}=1$ in FP1. We would like to insist on the fact that Theorem 2.5 c provides the existence of solutions for a wild class of Hamiltonian, larger than the one of the latter proposition and which goes beyond the linear-quadratic case.

Let us mention that we would be interested in defining the bidding price by $(1+{\widetilde{\varepsilon}})S_{t}$ , where ${\widetilde{\varepsilon}}>0$ . The associated MFGC system cannot be using the conclusions of the present work because the mean field interaction at time $t$ would depend not only on $\mu_{t}$ but on $\left(\mu_{s}\right)_{s\in[0,t]}$ . However, we believe that existence holds under similar assumptions as here, and we plan to prove it in forthcoming works.

6.3 First-order flocking model with velocity as controls

Cucker and Smale proposed a form of Vicseck model in [18] to illustrate the behavior of flocks of birds. This model is of second-order in the sense that the state of an agent is given by a couple $(x,v)$ standing for her position and velocity respectively, and the equation of evolution of her state involves considering her acceleration.

A game version of this model in which an agent controls her acceleration has been introduced in [36], the authors derived a MFG formulation in the infinite horizon case. Here we are interested in the finite horizon problem which was studied in [15, 13]. This model is still of second-order. More precisely the state of an agent is given by $(X_{t},V_{t})_{t\in[0,T]}$ respectively her position and velocity, two random processes which satisfy the following system of stochastic differential equations,

[TABLE]

where $a_{t}$ is the individual’s acceleration vector and her control, $W$ is a $d$ -dimentional Brownian motion, and $\sigma\in\mathbb{R}^{d\times d}$ is a positive definite matrix. The cost that a representative agent tries to minimize is given by

[TABLE]

where $\mu(t)\in{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ is the joint distribution of states and velocities of the agents, $\varphi$ is a $C^{1}$ nonincreasing function, and $f$ is a $C^{1}$ function modeling the spatial preferences of the agents (for instance, we can take $f$ significantly smaller in some areas which corresponds to where the food is).

Here we consider an alternative viewpoint in which an agent directly controls her velocity. This is a first-order model since the state of an agent is now given by a vector of ${\mathbb{T}}^{d}$ , and the acceleration does not appear anymore in the dynamics of a given agent, which is given by

[TABLE]

Here, the cost that an agent tries to minimize is given by

[TABLE]

First-order physical models are generally easier to study than second-order models. However the price we paid here to go from a second-order model to a first-order model is to consider a MFGC system instead of a MFG system without interaction through the controls.

If $\mu\in{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ and $m\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ are such that $m$ is the marginal of $\mu$ with respect to ${\mathbb{T}}^{d}$ , we define $A(x,\mu)$ and $Z(x,\mu)$ by,

[TABLE]

for $x\in{\mathbb{T}}^{d}$ . We define the Lagrangian of the first-order flocking model by,

[TABLE]

for $\left(x,\alpha,\mu\right)\in{\mathbb{T}}^{d}\times\mathbb{R}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ , and the Hamiltonian by,

[TABLE]

for $p\in\mathbb{R}^{d}$ , such that $H^{\text{FM}}$ is the Legendre’s transform of $L^{\text{FM}}$ .

Proposition 6.4.

Under assumptions A2 and A3, there exists $T_{0}>0$ such that if $T<T_{0}$ , there exists a unique solution to (1.2) with $H^{\text{FM}}$ .

Hereafter, we present an other model for crowd motion which is very similar to the first-order flocking model discussed above. The main difference between these two models is the normalization constants. However, the assumptions and conclusions of this work are more adapted to the following crowd motion model and we can derive more existence results for it. We believe that these results can be adapted to the first-order Cucker-Smale system.

6.4 A model of crowd motion

This model of crowd motion has been numerically studied in [1] in the quadratic case, and has some similarities with the first-order flocking model presented in the previous paragraph. For $(x,\mu)\in{\mathbb{T}}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ , we define $V(x,\mu)$ and $Z_{q_{0}}(x,\mu)$ by

[TABLE]

where $q_{0}\in(1,\infty]$ , $q_{0}^{\prime}$ is the conjugate exponent of $q_{0}$ , $k:{\mathbb{T}}^{d}\times{\mathbb{T}}^{d}\rightarrow\mathbb{R}_{+}$ is a nonnegative $C^{1}$ kernel, and $m\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ is the marginal of $\mu$ with respect to ${\mathbb{T}}^{d}$ . The quantity $V(x,\mu)$ is called the average drift.

The state of a representative agent is given by her position $X_{t}\in{\mathbb{T}}^{d}$ and she controls her velocity $\alpha_{t}$ ,

[TABLE]

Her objective is to minimize the cost given by,

[TABLE]

where $-1<{\widetilde{\lambda}}<1$ and $0\leq\theta\leq 1$ are two constants standing for the preference of an individual to have a similar (resp. opposite) control as the mainstream when ${\widetilde{\lambda}}>0$ (resp. ${\widetilde{\lambda}}<0$ ), $f$ and $g$ are respectively the running cost and the terminal cost which encode the spatial preferences of the agents, and $a^{\prime},b^{\prime}>1$ are exponents.

Here, we take $q=\min(a,b)$ . In this model we define the Lagrangian by,

[TABLE]

and the Hamiltonian as its Legendre transform. If $a=b=2$ , $H$ is given by

[TABLE]

If $\theta=1$ , $H$ satisfies

[TABLE]

For other choices of the parameters $a$ , $b$ and $\theta$ , $H$ does not admit an explicit form.

Proposition 6.5.

Assume that $g$ and $m_{0}$ satisfy A2 and A3 respectively. There exists a solution to (1.2) where $H$ is the Legendre transform of $L$ given in (6.5), under one of the following assertions,

a)

$q_{0}\leq q^{\prime}$ * and $a\neq b$ ,* 2. b)

$q_{0}\leq q^{\prime}$ * and one of the following assertions is satisfied,*

)

$\theta<\theta_{0}$ , 2. )

$\theta>1-\theta_{0}$ , 3. )

$\left|{\widetilde{\lambda}}\right|<\lambda_{0}$ ,

where $\theta_{0},\lambda_{0}\in(0,1)$ are constants coming from Theorem 2.5 c, 3. c)

$\theta=1$ , 4. d)

$k(x,y)$ * is constant,* 5. e)

$T<T_{0}$ , where $T_{0}$ is a positive constant coming from Theorem 2.5 e.

Proof.

We refer to the appendix, Lemma A.2 for the proof that $H$ satisfies A1-A3, B1-B3, FP1-FP2, and T. The existence results c, d and e are direct consequences of Theorem 2.5 c, d and e respectively.

We define ${\widetilde{L}}(\alpha,V)$ by

[TABLE]

for $\alpha,V\in\mathbb{R}^{d}$ , ${\widetilde{H}}(p,V)$ as the Legendre transform of ${\widetilde{L}}$ with respect to its first argument, and $\boldsymbol{\alpha}(p,V)$ as the unique control which achieves the maximum in the definition of ${\widetilde{H}}$ (it is unique because ${\widetilde{L}}$ is strictly convex with respect to $\alpha$ ).

Proof of a. Take $V\in\mathbb{R}^{d}$ and $\boldsymbol{\alpha}=\boldsymbol{\alpha}(0,V)$ , since $\boldsymbol{\alpha}$ achieves the maximum in the definition of ${\widetilde{H}}(0,V)$ , we know that

[TABLE]

which implies

[TABLE]

and then

[TABLE]

The two latter equalities yield $\displaystyle{\lim_{V\rightarrow+\infty}|\boldsymbol{\alpha}(0,V)|=+\infty}$ . We make out two cases:

•

if $a>b$ then we have $\frac{(a^{\prime}-2)(b^{\prime}-1)}{a^{\prime}-1}<b^{\prime}-2$ , and $|\boldsymbol{\alpha}|=\displaystyle{\underset{+\infty}{o}(|V|)}$ . Therefore, (6.7) yields

[TABLE]

and $b^{\prime}-1-\frac{(a^{\prime}-2)(b^{\prime}-1)}{a^{\prime}-1}=\frac{a-1}{b-1}>1$ , so we obtain

[TABLE]

which yields

[TABLE]

with $a^{\prime}<b^{\prime}$ , and $\frac{b-1}{a-1}b^{\prime}<b^{\prime}$ , and $b=q$ .

•

if $a<b$ then we have $\frac{(a^{\prime}-2)(b^{\prime}-1)}{a^{\prime}-1}>b^{\prime}-2$ , and $\boldsymbol{\alpha}={\widetilde{\lambda}}V+\displaystyle{\underset{+\infty}{o}(|V|)}$ . Therefore, (6.7) yields

[TABLE]

We notice that $b^{\prime}-2-\frac{(a^{\prime}-2)(b^{\prime}-1)}{a^{\prime}-1}=\frac{b^{\prime}-a^{\prime}}{a^{\prime}-1}<0$ , and we obtain

[TABLE]

This implies

[TABLE]

with $b^{\prime}<a^{\prime}$ , and $\frac{a-1}{b-1}a^{\prime}<a^{\prime}$ , and $a=q$ .

We conclude by (A.1) and Theorem 2.5 a.

Proof of b

Here, we assume that $a=b$ since the case $a\neq b$ is addressed in a.

Take $V\in\mathbb{R}^{d}$ , and $\boldsymbol{\alpha}=\boldsymbol{\alpha}(0,V)$ . In this case, ${\widetilde{H}}(0,V)$ admits an explicit form given by

[TABLE]

Therefore, taking ${\widetilde{\lambda}}$ , $\theta$ or $(1-\theta)$ small enough allows one to conclude by (A.1) and Theorem 2.5 b.

∎

Acknowledgements. I wish to express my gratitude to Y. Achdou and P. Cardaliaguet for technical advices, insightful comments and corrections. The work was supported by the ANR project MFG ANR-16-CE40-0015-01.

Appendix A Verification of the assumptions

for the model of crowd motion

We start by establishing some properties of the function $V$ in the following lemma.

Lemma A.1.

The function $V$ is $C^{1}$ with respect to $x$ and it satisfies

[TABLE]

where $\mu\in{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ .

For $m\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)$ and $\mu^{1},\mu^{2}\in{\mathcal{P}}_{m}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ , the following inequality is satisfied,

[TABLE]

For $R>0$ , there exists $C_{R}>0$ a constant such that,

[TABLE]

for $\left(m^{i},\mu^{i}\right)$ such that $m^{i}\in{\mathcal{P}}\left({\mathbb{T}}^{d}\right)\cap C^{0}\left({\mathbb{T}}^{d}\right)$ with $m^{i}\geq R^{-1}$ , $\mu^{i}\in{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ with $\alpha^{\mu^{i}}\in C^{0}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ and ${\left\|\alpha^{\mu^{i}}\right\|_{\infty}}\leq R$ , $i=1,2$ .

Proof.

The function $V$ has at least the same regularity as $k$ with respect to the state variable since $V$ is the convolution product of $k$ with a probability measure. Then (A.1) and (A.2) are straightforward using Hölder inequality. Let us take the same notation as in (A.3), for $x\in{\mathbb{T}}^{d}$ we get

[TABLE]

Moreover, we know that $Z_{q_{0}}\left(x,\mu^{1}\right)\geq R^{-\frac{1}{q_{0}^{\prime}}}\left(\int_{{\mathbb{T}}^{d}}k(0,y)^{q_{0}^{\prime}}dy\right)^{\frac{1}{q_{0}^{\prime}}}>0$ where the right-hand side does not depend on $x$ , and

[TABLE]

The latter two chains of inequalities imply (A.3) with $C_{R}=1+R^{1+\frac{1}{q_{0}^{\prime}}}+\frac{1}{q_{0}^{\prime}}R^{2}$ . ∎

Here, we assume $\theta\in(0,1)$ . Indeed, $H$ admits an explicit form when $\theta=0$ or $\theta=1$ , then checking A1-A3, B1-B3, FP1-FP2, and T is straightforward.

Lemma A.2.

Assumptions A1, B1-B3, FP1, FP2, and T are satisfied when $L$ is defined in (6.5).

Proof.

We define ${\widetilde{L}}$ , ${\widetilde{H}}$ and $\boldsymbol{\alpha}$ as in the proof of 6.5.

Checking A1, B1 and B2.

The Legendre transform of a function is convex, therefore $H$ is convex with respect to $p$ . Since $L$ is strictly convex, $H$ is differentiable with respect to $p$ . Moreover, $\boldsymbol{\alpha}=-H_{p}$ thus $H_{p}$ is continuous by the Maximum theorem. Then $H(x,p,\mu)=p\cdot H_{p}\left(x,p,\mu\right)-L\left(x,-H_{p}\left(x,p,\mu\right),\mu\right)$ , so $H$ is continuous. Finally, $H$ is differentiable with respect to $x$ by the envelop theorem and

[TABLE]

for $(x,p,\mu)\in{\mathbb{T}}^{d}\times\mathbb{R}^{d}\times{\mathcal{P}}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ .

Using the growth properties of $L$ , we can prove that there exists $C_{0}>0$ such that

[TABLE]

for any $\left(x,p,\mu\right)\in{\mathbb{T}}^{d}\times\mathbb{R}^{d}\times{\mathcal{P}}\left(\mathbb{R}^{d}\times\mathbb{R}^{d}\right)$ . We refer to [28] Lemma $2.5$ for a complete proof.

One may prove that the function $h:z\in\mathbb{R}^{d}\mapsto|z|^{a^{\prime}}\in\mathbb{R}$ satisfies $h(z)-h(y)-\nabla h(y)\cdot(y-x)\geq C_{R}^{-1}|y-z|^{\max(a^{\prime},2)}$ for $y,z\in\mathbb{R}^{d}$ such that $|y|\leq R$ , $|z|\leq R$ , where $C_{R}>0$ is a constant. This implies that for $R>0$ there exists $C_{R}>0$ a constant such that $L$ satisfies

[TABLE]

for $\left(\alpha^{1},\alpha^{2},\mu\right)\in\mathbb{R}^{d}\times\mathbb{R}^{d}\times{\mathcal{P}}_{\infty}\left({\mathbb{T}}^{d}\times\mathbb{R}^{d}\right)$ , such that $\left|\alpha^{i}\right|\leq R$ and $\Lambda_{q_{0}}\left(\mu\right)\leq R$ . This implies

[TABLE]

Take $p^{i}\in\mathbb{R}^{d}$ and $\alpha^{i}=-H_{p}\left(x,p^{i},\mu\right)$ , $i=1,2$ . Recalling the conjugacy relation $p^{i}=-L_{\alpha}\left(x,\alpha^{i},\mu\right)$ we obtain that $H_{p}$ is locally Hölder continuous with respect to $p$ .

Checking B3.

Take $(p,V)\in\mathbb{R}^{2d}$ and $\boldsymbol{\alpha}=\boldsymbol{\alpha}(p,V)$ , the optimal control $\boldsymbol{\alpha}$ satisfies

[TABLE]

If $(p,V)\neq(0,0)$ , this implies

[TABLE]

and

[TABLE]

From (A.8), we deduce that

[TABLE]

We recall that $\boldsymbol{\alpha}=-H_{p}(p,V)$ , hence

[TABLE]

which implies B3.

Proof that $\boldsymbol{\alpha}$ is differentiable with respect to $V$ at $(0,0)$ .

Take $V\in\mathbb{R}^{d}$ that will eventually tend to [math] and $\boldsymbol{\alpha}=\boldsymbol{\alpha}(0,V)$ . From (A.8) we obtain

[TABLE]

Let us recall inequalities (6.6) and (6.7).

•

if $a>b$ then $\frac{(a^{\prime}-2)(b^{\prime}-1)}{a^{\prime}-1}<b^{\prime}-2$ , and we obtain the following expansion as $|V|$ tends to [math],

[TABLE]

•

if $a=b$ we obtain,

[TABLE]

•

if $a<b$ then $\frac{(a^{\prime}-2)(b^{\prime}-1)}{a^{\prime}-1}>b^{\prime}-2$ , and we obtain the following estimate as $|V|$ tends to [math],

[TABLE]

Therefore the derivatives of $\boldsymbol{\alpha}$ with respect to $V$ in any of the above three cases are:

[TABLE]

Proof that the operator norm of $D_{V}\boldsymbol{\alpha}=\left(\partial_{V^{j}}\boldsymbol{\alpha}^{i}\right)_{1\leq i,j\leq d}\in\mathbb{R}^{d\times d}$ is not larger than $\lambda$ .

Here, the norm of a square matrix $A\in\mathbb{R}^{d\times d}$ is defined by $\left\|A\right\|=\sup_{X\neq 0}\frac{\left|AX\right|}{\left|X\right|}$ . Let us introduce

[TABLE]

We recall that if $v_{i}\neq 0$ , then $v_{i}v_{i}^{T}$ is the orthogonal projection onto $\mathbb{R}v_{i}$ for $i=1,2$ .

If $\boldsymbol{\alpha}={\widetilde{\lambda}}V=0$ then $(p,V)=(0,0)$ , we see on (A.12) that $D_{V}\boldsymbol{\alpha}$ is a positive semi-definite matrix with eigenvalues in $[-\lambda,\lambda]$ . Therefore, we can now assume that $(\boldsymbol{\alpha},V)\neq(0,0)$ .

Let us assume temporarily that $a^{\prime}\neq 2,b^{\prime}\neq 2,\boldsymbol{\alpha}-{\widetilde{\lambda}}V\neq 0,\boldsymbol{\alpha}\neq 0$ . Then we differentiate the $i$ -th component of (A.8) with respect to $V^{j}$ ,

[TABLE]

This implies

[TABLE]

and thus

[TABLE]

We can check that this last equation holds in the general case for any $(\boldsymbol{\alpha},V)\neq(0,0),a^{\prime},b^{\prime}$ .

•

If $(a^{\prime}-2)v_{1}=0$ (i.e. $B=I_{d}$ ) or $(b^{\prime}-2)v_{2}=0$ (i.e. $C=I_{d}$ ), then (A.13) yields that $D_{V}\boldsymbol{\alpha}$ is a positive definite matrix with eigenvalues in $(-\lambda,\lambda)$ .

•

If $(a^{\prime}-2)v_{1}\neq 0$ , $(b^{\prime}-2)v_{2}\neq 0$ and $v_{1},v_{2}$ are aligned, Then $B$ and $C$ commute and $B^{-1}C$ is a positive definite matrix. Then (A.13) yields that $D_{V}\boldsymbol{\alpha}$ is a positive definite matrix with eigenvalues in $(-\lambda,\lambda)$ .

•

The last case consists of assuming that $(a^{\prime}-2)v_{1}\neq 0$ , $(b^{\prime}-2)v_{2}\neq 0$ , and $v_{1},v_{2}$ are linearly independent. We define $k$ by $k=\frac{(1-\theta)|\boldsymbol{\alpha}|^{b^{\prime}-2}}{\theta\left|\boldsymbol{\alpha}-{\widetilde{\lambda}}V\right|^{a^{\prime}-2}}>0$ . The two orthogonal subspaces ${\rm Span}(v_{1},v_{2})$ and $\{v_{1},v_{2}\}^{\bot}$ are stable by $D_{V}\boldsymbol{\alpha},B,C$ . The restriction of $D_{V}\boldsymbol{\alpha}$ to $\{v_{1},v_{2}\}^{\bot}$ is positive definite with eigenvalues in $(-\lambda,\lambda)$ .

Let us denote by ${\widetilde{A}},{\widetilde{B}},{\widetilde{C}}\in{\cal M}_{2\times 2}(\mathbb{R})$ respectively the restriction of $D_{V}\boldsymbol{\alpha},B$ and $C$ to ${\rm Span}(v_{1},v_{2})$ . We notice that

[TABLE]

thus the eigenvalues of ${\widetilde{B}}^{-1}$ are $1$ and $(a^{\prime}-1)^{-1}\leq 1$ since $a^{\prime}\geq 2$ . The eigenvalues of ${\widetilde{C}}$ are $1$ and $(b^{\prime}-1)\geq 1$ . Lemma A.3 below yields that $M=(I_{d}+k{\widetilde{B}}^{-1}{\widetilde{C}})(I_{d}+k{\widetilde{C}}{\widetilde{B}}^{-1})$ is a positive definite matrix with eigenvalues not smaller than $1$ . This implies

[TABLE]

This concludes the proof that the norm of $D_{V}\boldsymbol{\alpha}$ is not larger than $\lambda$ .

Proof of FP2.

Take $(p,V^{1},V^{2})\in\mathbb{R}^{3d}$ and $\boldsymbol{\alpha}^{i}=-{\widetilde{H}}_{p}\left(p,V^{i}\right),i=1,2$ , then

[TABLE]

Combining the latter inequality and (A.2), we conclude that FP2 is satisfied.

Proof of FP1.

Let $(p,V)\in\mathbb{R}^{2d}$ , we take $\boldsymbol{\alpha}=-H_{p}\left(p,V\right)$ .

•

We suppose $b^{\prime}\geq a^{\prime}$ , we make out two cases: the first case is when $|\boldsymbol{\alpha}|\leq|p|^{b-1}$ ; the second case is when $|\boldsymbol{\alpha}|>|p|^{b-1}=|p|^{\frac{1}{b^{\prime}-1}}$ which implies

[TABLE]

using (A.9). We recall that $1-\frac{b^{\prime}-2}{b^{\prime}-1}=b-1$ , hence

[TABLE]

•

We suppose that $b^{\prime}<a^{\prime}$ , we make out two cases: the first case is when $|\boldsymbol{\alpha}-{\widetilde{\lambda}}V|\leq|p|^{a-1}$ ; the second case is when $|\boldsymbol{\alpha}-{\widetilde{\lambda}}V|>|p|^{\frac{1}{a^{\prime}-1}}$ which implies

[TABLE]

where we used (A.9). From the equality $1-\frac{a^{\prime}-2}{a^{\prime}-1}=a-1$ , we deduce

[TABLE]

This concludes the proof of FP1.

Proof of T.

We proved above that $\boldsymbol{\alpha}$ is locally Lipschitz continuous with respect to $V$ and we recall that ${\widetilde{L}}$ is $C^{1}$ . Therefore ${\widetilde{H}}$ is also locally Lipschitz with respect to $V$ . This and (A.3) implies that T holds. ∎

Lemma A.3.

Let $B,C\in{\cal M}_{2\times 2}\left(\mathbb{R}\right)$ be two positive definite matrices with eigenvalues $(1,r)$ and $(1,s)$ respectively, and $0<r\leq 1,s\geq 1$ . Then for any $k>0$ the matrix $M$ defined by

[TABLE]

is positive definite with eigenvalues not smaller than $1$ .

Proof.

We can assume that $B,C$ have the following form:

[TABLE]

since the eigenvalues of $M$ are invariant by taking the conjugate of $B$ and $C$ by the same orthogonal matrix. The same argument and noticing that $C$ commutes with $\begin{pmatrix}1&0\\ 0&-1\end{pmatrix}$ , imply that we can assume that $U$ admits a positive determinant, and thus we can write it as

[TABLE]

with $\chi\in[0,2\pi)$ . In this case, $M$ is given by

[TABLE]

We name ${\widetilde{M}}$ the matrix in the last line of the latter calculation, $M$ and ${\widetilde{M}}$ have the same eigenvalues. Let us compute ${\widetilde{M}}$

[TABLE]

its trace is given by

[TABLE]

and its determinant by

[TABLE]

The eigenvalues of ${\widetilde{M}}$ are the roots of the following second-order polynomial function,

[TABLE]

its smallest root is

[TABLE]

which is not smaller than $1$ if and only if

[TABLE]

Therefore, it is sufficient to check that tr $({\widetilde{M}})\leq\det({\widetilde{M}})+1$ to conclude. We define the function $f:\mathbb{R}\rightarrow\mathbb{R}$ by

[TABLE]

This is a second-order polynomial in $x$ with

[TABLE]

If $(1+k)(1+krs)-(1+kr)(1+ks)=0$ , then $f$ is linear and thus $f(x)\geq 0$ for all $x\in[0,1]$ .

If $(1+k)(1+krs)-(1+kr)(1+ks)\neq 0$ , then the minimum of this polynomial function on $\mathbb{R}$ is obtained at $x_{\min}$ defined as

[TABLE]

since $0<r\leq 1,s\geq 1$ and $k>0$ . Thus $f$ has no local minimum on $[0,1]$ , then $f(x)\geq 0$ for all $x\in[0,1]$ since $f(0)\geq 0$ and $f(1)\geq 0$ .

Since $\det({\widetilde{M}})-\text{tr}({\widetilde{M}})+1=f(\cos^{2}\chi)\geq 0$ , this concludes the proof of the lemma. ∎

Bibliography36

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Y Achdou and Z Kobeissi. Mean field games of controls: Finite difference approximations, 2020.
2[2] Yves Achdou. Finite difference methods for mean field games. In Hamilton-Jacobi equations: approximations, numerical analysis and applications , volume 2074 of Lecture Notes in Math. , pages 1–47. Springer, Heidelberg, 2013.
3[3] Yves Achdou and Jean-Michel Lasry. Mean field games for modeling crowd motion. In Contributions to partial differential equations and applications , volume 47 of Comput. Methods Appl. Sci. , pages 17–42. Springer, Cham, 2019.
4[4] Clémence Alasseur, Imen Ben Taher, and Anis Matoussi. An extended mean field game for storage in smart grids. J. Optim. Theory Appl. , 184(2):644–670, 2020.
5[5] Robert Almgren and Neil A. Chriss. Optimal execution of portfolio trans-actions. 2000.
6[6] Joseph Bertrand. Théorie mathématiques de la richesse sociale. Journal des Savants , 67:499–508, 1883.
7[7] Charles Bertucci, Jean-Michel Lasry, and Pierre-Louis Lions. Some remarks on mean field games. Comm. Partial Differential Equations , 44(3):205–227, 2019.
8[8] Frédéric J. Bonnans, Saeed Hadikhanloo, and Laurent Pfeiffer. Schauder Estimates for a Class of Potential Mean Field Games of Controls. ar Xiv e-prints , page ar Xiv:1902.05461, Feb 2019.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On Classical Solutions to the Mean Field Game System of Controls

Abstract

Introduction

Related literature

Organization of the paper

Notations and assumptions

2.1 Notations and definitions

Definition 2.1**.**

2.2 Assumptions

Remark 2.2**.**

2.3 Main results

Lemma 2.3**.**

Lemma 2.4**.**

Theorem 2.5**.**

Theorem 2.6** (Uniqueness with short time horizon).**

Remark 2.7**.**

The fixed point relation in μ\muμ

Lemma 3.1**.**

Proof.

Lemma 3.2**.**

Proof.

A priori estimates

4.1 A priori estimates on uuu

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Corollary 4.3**.**

Proof.

4.2 A priori estimates on mmm

Lemma 4.4**.**

Proof.

4.3 A priori estimates on derivatives of uuu

Lemma 4.5**.**

Proof.

Lemma 4.6**.**

Proof.

Existence and uniqueness results

5.1 Solving the MFGC systems for M<∞M<\inftyM<∞

Lemma 5.1**.**

Proof.

Corollary 5.2**.**

Proof.

5.2 Existence results

Proposition 5.3** **(*Existence of solution

Proposition 5.4**.**

Proof.

5.3 Existence results

Proposition 5.5**.**

Proof.

Proposition 5.6** **(*Existence

Proof.

Remark 5.7**.**

5.4 Existence and uniqueness results with

Proposition 5.8** (Existence with short time horizon).**

Proof.

Proof of Theorem 2.6.

Applications

6.1 Exhaustible ressource model with nonpositively correlated ressources

Proposition 6.1**.**

Proposition 6.2**.**

6.2 Price impact models with bid and ask prices

Proposition 6.3**.**

6.3 First-order flocking model with velocity as controls

Proposition 6.4**.**

6.4 A model of crowd motion

Proposition 6.5**.**

Proof.

Appendix A Verification of the assumptions

Lemma A.1**.**

Proof.

Lemma A.2**.**

Proof.

Lemma A.3**.**

Definition 2.1.

Remark 2.2.

Lemma 2.3.

Lemma 2.4.

Theorem 2.5.

Theorem 2.6 (Uniqueness with short time horizon).

Remark 2.7.

The fixed point relation in $\mu$

Lemma 3.1.

Lemma 3.2.

4.1 A priori estimates on $u$

Lemma 4.1.

Lemma 4.2.

Corollary 4.3.

4.2 A priori estimates on $m$

Lemma 4.4.

4.3 A priori estimates on derivatives of $u$

Lemma 4.5.

Lemma 4.6.

5.1 Solving the MFGC systems for $M<\infty$

Lemma 5.1.

Corollary 5.2.

Proposition 5.3 (*Existence of solution

Proposition 5.4.

Proposition 5.5.

Proposition 5.6 (*Existence

Remark 5.7.

Proposition 5.8 (Existence with short time horizon).

Proposition 6.1.

Proposition 6.2.

Proposition 6.3.

Proposition 6.4.

Proposition 6.5.

Lemma A.1.

Lemma A.2.

Lemma A.3.