On Classical Solutions to the Mean Field Game System of Controls
Z Kobeissi (UPD7)

TL;DR
This paper investigates classical solutions to the mean field game system of controls, establishing existence and uniqueness results for the associated PDEs under natural assumptions, with applications illustrated through examples.
Contribution
It provides new existence and uniqueness theorems for mean field game systems involving controls, using Bernstein's method for a priori estimates.
Findings
Existence of solutions under natural assumptions
Uniqueness under more restrictive conditions
Application to specific examples
Abstract
In this paper, we consider a class of mean field games in which the optimal strategy of a representative agent depends on the statistical distribution of the states and controls. We prove some existence results for the forward-backward system of PDEs under rather natural assumptions. The main step of the proof consists of obtaining a priori estimates on the gradient of the cost function by Bernstein's method. Uniqueness is also proved under more restrictive assumptions. The last section contains some examples to which the previously mentioned existence (and possibly uniqueness) results apply.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On Classical Solutions to the Mean Field Game System of Controls
Z. Kobeissi Laboratoire Jacques-Louis Lions, Univ. Paris Diderot, Sorbonne Paris Cité, UMR 7598, UPMC, CNRS, 75205, Paris, France. [email protected]
Abstract
We consider a class of mean field games in which the optimal strategy of a representative agent depends on the statistical distribution of the states and controls.
We prove some existence results for the forward-backward system of PDEs under rather natural assumptions. The main step of the proof consists of obtaining a priori estimates on the gradient of the value function by Bernstein’s method. Uniqueness is also proved under more restrictive assumptions.
Finally, we discuss some examples to which the previously mentioned existence (and possibly uniqueness) results apply.
Introduction
The theory of Mean Field Games (MFG for short) has been introduced in the independent works of J.M. Lasry and P.L. Lions [31, 32, 33], and of M.Y. Huang, P.E. Caines and R.Malhamé [25, 26]. It aims at studying deterministic or stochastic differential games (Nash equilibria) as the number of agents tends to infinity. The agents are supposed to be rational (given a cost to be minimized, they always choose the optimal strategies), and indistinguishable. Furthermore, the agents interact via some empirical averages of quantities which depend on the state variable.
At the limit when , the game may be modeled by a system of two coupled partial differential equations (PDEs), which is named the MFG system. On the one hand, there is a Fokker-Planck-Kolmogorov equation describing the evolution of the statistical distribution of the state variable; this equation is a forward in time parabolic equation, and the initial distribution at time is given. On the other hand, the optimal value of a generic agent at some time and state is noted and is defined as the lowest cost that a representative agent can achieve from time to if it is at state at time . The value function satisfies a Hamilton-Jacobi-Bellman equation posed backward in time with a terminal condition involving a terminal cost. In the present work, we will restrict our attention to the case when the costs and the dynamics are periodic in the state variable, and we will work in the -dimensional torus (as it is often done in the MFG literature for simplicity). We will take a finite horizon time , and will only consider second-order non-degenerate MFG systems. In this case, the MFG system is often written as:
[TABLE]
We refer the reader to [10] for some theoretical results on the convergence of the -agents Nash equilibrium to the solutions of the MFG system. For a thorough study of the well-posedness of the MFG system, see the videos of P.L.Lions’ lecture at the Collège de France, and some lecture notes [9].
There is also an important literature on the probabilistic aspects of MFGs, see [12, 29] for some examples and [13, 14] for a detailed presentation of the probabilistic viewpoint.
For applications of MFGs, numerical simulations are crucial because it is most often impossible to find explicit or semi-explicit solutions to the MFG system. We refer to [2] for a survey on finite difference methods and to [3] for applications to crowd motion.
Most of the literature on MFGs is focused on the case when the mean field interactions only involves the distributions of states. Here we will consider a more general situation in which the cost of an individual agent depends on the joint distribution of states and optimal strategies. To underline this, we choose to use the terminology Mean Field Games of Controls (MFGCs) for this class of MFGs; the latter terminology was introduced in [11]. Within this framework, the usual MFG system (1.1) is replaced by the following MFGC system,
[TABLE]
We would like to point out two of the main difficulties that one may encounter when studying (1.2) and which are not present in the study of (1.1).
- 1)
The joint law of states and controls satisfies a fixed point relation described by (1.2c). 2. 2)
The HJB equation (1.2a) is non-local with respect to . Consequently, it is much more difficult to obtain uniform a priori estimates on and the its derivatives.
Difficulty 1 is in general not straightforward and one needs to make assumptions for the fixed point in to have a unique solution when are given. An example in which this fixed point relation does not admit any solution is given in [1] Remark .
Let us provide a simple illustration for describing difficulty 2 by comparing the results obtained when we apply the maximum principle on parabolic equations to (1.1a) and (1.2a) respectively: if satisfies (1.1a) where and are assumed to be uniformly bounded with respect to , then is uniformly bounded; under the same assumption on , if is a solution to (1.2a) and is not uniformly bounded with respect to , we can only say that is bounded in absolute value by a constant depending on . The other estimates used in the usual arguments of existence in MFG sytems suffer the same lack of uniformity with respect to . Conversely, the estimates of depend on . It is not obvious a priori how to combine the estimates on and in order to obtain uniform estimates on . Consequently, compactness results are harder to obtain for (1.2) than for (1.1).
The main assumption of this paper, namely FP1 and FP2 described below, is an original structural assumption designed to address difficulty 1. In particular, it implies that the map
[TABLE]
is a contraction in a convenient metric space, when are given.
Moreover, we also assume that the Hamiltonian behaves like a power function when tends to infinity. See paragraph 2.2 for more details.
The main objective of this work is to discuss existence of the solutions of the MFGC system (1.2) within this framework. We will also give a uniqueness result under a short time horizon assumption. We refer to [1] for a numerical application with multiple solutions. Indeed, uniqueness does not hold in general for arbitrary time horizon. It can be obtained though, under a monotonicity assumption which is investigated in the companion paper [28]. In [28], existence and uniqueness of solutions of the MFGC system are proved under the above-mentioned monotonicity assumption and with Hamitonian having similar growth as in the present paper. This monotonicity condition implies that the agents favor moving in a direction opposite to the mainstream. Such an assumption is adapted to some models coming from finance or economy; and may be unrealistic in several situations, in particular in models of crowd motions. This explains why here we introduce a new structural assumption and refrain from assuming monotonicity or investigating uniqueness in the general case.
Related literature
In the first articles devoted to MFGCs, [20, 21], D. Gomes and his collaborators have given several existence results for MFGCs in various cases, using the terminology extended MFGs instead of MFGCs. For instance, [21] contains existence results for stationary games (infinite horizon) under the assumption that some of the parameters involved in the models are small. We refer to [7, 11, 15, 13, 28] for other existence and uniqueness results for MFGC systems.
Uniqueness is a major issue in MFG theory, it has been proved for (1.1) in [33, 35] under an assumptions called the Lasry-Lions monotonicity on the coupling function and the terminal cost in the case of non-local coupling. This assumption has been extended to MFGC and discussed in [20, 13, 28] in which uniqueness is proved. It translates the fact that the agents prefer directions opposite to the mainstream direction; therefore it is not adapted to a large class of MFGC systems like crowd motion models in which an agent is more likely to go in the mainstream direction.
The latter example of population dynamic is the typical application we had in mind when writing the assumptions in the present paper, see paragraphs 6.3 and 6.4. To our knowledge, existence results for such MFGC systems have not been discussed in the literature before. Uniqueness should not hold in general but under a short-time assumption. We refer to [1] in which the MFGC system is discretized using a finite-difference scheme and simulations are provided where the approximating discrete MFGC system admits several different solutions.
For other applications of MFGCs we refer to [11] for an model of optimal trading, [8, 16, 22, 24, 27] in the case of competition between firms producing the same goods, or [4] for energy storage.
Organization of the paper
Section 2 describes the notations, assumptions and main results in this paper. In Section 3, we address difficulty 1 which consists of inverting the fixed point relation in (1.2c) and providing estimates on the resulting flow of measures. Section 4 is devoted to proving a priori estimates on the solutions to (1.2) and addresses difficulty 2. Section 5 contains the proofs of the main results. Finally, we discuss several applications in Section 6. Namely, we study
- •
the Bertrand and Cournot competition for exhaustible ressources and introduce an extension to negatively correlated ressources (for instance gold and other raw materials);
- •
a model of price impact for high-frequency trading by Almgren and Chriss in which we discuss the possibility for the bid and ask prices to be different;
- •
a first-order flocking model;
- •
a crowd motion model.
Notations and assumptions
2.1 Notations and definitions
The spaces of probability measures are equipped with the weak* topology. We denote by the subset of measures in with a second marginal compactly supported. For and , we define the quantities and by,
[TABLE]
Jensen inequality states that,
[TABLE]
for any .
For , we denote by the subset of measures in such that . The probability measures involved in (1.2) and (2.4), have a particular form, since they are the images of a measure on by , where is a bounded measurable functions from to ; in particular they are supported on the graph of . For , we call the set of such measures. For , we set to be the unique element of such that . Here, and defined in (2.1) are given by
[TABLE]
If is a normed space and is its norm, for we denote by the set of bounded continuous functions from to ; it is endowed with the norm .
We define as the set of the functions differentiable at any point with respect to the state variable, and such that its gradient satisfies . This is a Banach space equipped with the norm .
For and , we denote by the parabolic space of Hölder continuous functions which is commonly defined by
[TABLE]
This is a Banach space equipped with the norm,
[TABLE]
The space is defined as the set of the functions such that and which admits a finite norm defined by,
[TABLE]
We set to be the set of functions which admit first derivative with resepct to time and second derivatives with respect to the state variables, such that these derivatives are continuous with respect to time and state.
Throughout the paper, what we call a solution to (1.2) is precisely defined by the following definition.
Definition 2.1**.**
The triple is a solution to (1.2) if is a pointwise solution to the Hamilton-Jacobi-Bellman equation (1.2a) with terminal condition (1.2d), is solution to the Fokker-Planck-Kolmogorov equation (1.2b) in the sense of distribution with initial condition (1.2e), and satisfies (1.2c) at any .
A simple way to overcome difficulty 2 is to assume that the Hamiltonian and some of its derivatives admit uniform bounds with respect to . In this case, the well-posedness of the MFGC system with a possibly degenerate diffusion is investigated in [11]. Here we avoid such an assumption for (1.2) but we introduce the following approximating system which satisfies it,
[TABLE]
where is a positive constant and is a truncation map defined by
[TABLE]
The latter definition can be naturally extended to the case when by taking . In this case systems (1.2) and (2.4) coincide. A solution to (2.4) is defined by replacing (1.2) by (2.4) in Definition 2.1.
If and is a solution to (2.4), is compactly supported in the closed ball in centered at [math] with radius , for any . Consequently, and should satisfy estimates depending on and uniform with respect to . Therefore, compactness results for (2.4) should be less demanding than for (1.2) and difficulty 2 should vanish.
2.2 Assumptions
Let us start with some reasonable assumptions about the regularity and the boundedness of the Hamiltonian, the terminal cost and the inital distribution of agents. We introduce two constants: and .
- A1
is convex with respect to , and differentiable with respect to ; is locally -Hölder continuous with respect to ; and are continuous with respect to on for any , where is defined in paragraph 2.1 and equipped with the weak* topology. 2. A2
is continuous, and we suppose that is in , with a norm bounded uniformly with respect to , i.e.
[TABLE] 3. A3
is absolutely continuous with respect to the Lebesgue measure on and we also name its density (abuse of notation). Assume that and is positive (see Remark 2.2 below to drop out the positivity assumption).
These assumptions are not restrictive when looking for solutions with the regularity given in Definition 2.1. However, they can be relaxed if we are interested in weaker solutions of systems (1.2).
In this paper we consider nonlocal coupling through the controls. More precisely, we assume that these interactions involve the quantity defined in (2.1).
Let us introduce the assumptions used to address difficulty 1 which consists of solving the fixed point relations in given in (1.2c) and (2.4c), when are fixed and have the same regularity as in Definition 2.1. We introduce , for all , and , we assume that,
- FP1
. 2. FP2
.
These structural assumptions for MFGC are new in the literature and participate to the originality and novelty of the results presented in this paper. Moreover they do not seem to be restrictive as it is explained in what follows.
We recall that the optimal control of a representative agent is given by . Since is homogeneous to the norm of a control, we cannot expect the dependency of upon to involve an exponent larger than one. Moreover if is the first marginal of , taking the -norm in FP1 makes appear in both sides of the resulting inequality; this explains the form of the right-hand side in FP1 and the necessity of choosing smaller than . Similar arguments can provide insights on FP2, by noticing that if was seen as a norm on then would be the associated distance. We refer to Remark in [1] for a concrete example of a MFGC system which does not admit solution if .
As in a large part of the literature on MFG or HJB equations, we consider Hamiltonians that are power-like functions in at least asymptotically. Let be this asymptotic exponent, and the conjugate exponent of defined by . Namely, we assume that satisfies the following inequalities, for all , , , and ,
- B1
, with . 2. B2
. 3. B3
, where is a nonnegative constant satisfying .
One may notice that the dependencies of upon and involve different exponents (which happen to be equal when ). Indeed the Legendre transform applied to a power-like function make the exponent change into its conjugate. Since is defined as the Legendre transform of the Lagrangian , the exponent in the dependency of upon should be . Moreover, is homogeneous to the norm of a control, therefore should at most involve . Going back to the Hamiltonian by the Legendre transform, the exponent on stays the same which explains the right-hand side in B1-B3. One may find the abovementionned growth conditions on in [28].
Assumption B3 is a convexity property of with respect to . In MFG without coupling through the controls, such an assumption is common, the only difference is that the term in does not appear. This assumption will be particularly useful to obtain energy integral estimates by taking advantage of the duality properties of the forward-backward systems (1.2a), (1.2b) and (2.4a), (2.4b). The inequality satisfied by is needed in the calculation for getting these estimates. Let us mention that the right-hand side in this inequality comes from the estimates in Lemma 3.1, and that the constant can be identified with the one in FP1.
In order to obtain classical solutions of the HJB equations (1.2a) and (2.4a), we need Hölder continuity of . While the space regularity of the latter map is straightforward here, its time regularity may be more demanding and we need assumptions which allow one to compare at different measures . Assumption FP2 is not enough since it requires and to share the same marginal with respect to .
- T
For , there exists a constant such that
[TABLE]
for such that with , with , with and , .
One may notice that when and have the same first marginal with respect to the second inequality in T is implied by FP2. If one is only interested in weak solution to (1.2), T can be removed.
Remark 2.2**.**
Letting depends on was motivated by models of population dynamics which are discussed in paragraphs 6.3 and 6.4. The drawback of this assumption is that we have to assume that the initial distribution of agents is positive.
All the results in this paper hold if we do not assume to be positive in A3, and we remove the condition in T.
2.3 Main results
We recall that assumptions FP1 and FP2 are designed to address difficulty 1, and T to obtain time regularity of the fixed point in 1.2c or 2.4c. More precisely, we state the following lemma that will be proved in Section 3.
Lemma 2.3**.**
Assume A1, FP1, FP2 and T. Take and such that and for , where and are constants. For any , there exists a unique satisfying
[TABLE]
where . Moreover, the map is in , and its associated norm can be estimated from above by a constant which depends on , , and the constants in the assumptions.
In Section 4, we prove the a priori estimates stated in the following lemma.
Lemma 2.4**.**
Assume A1-A3, B2, B3, FP1, FP2 and T. If is a solution to (2.4) for , then
- •
* and , where is independent of and depends only on the constants in the assumptions,*
- •
* is positive,*
- •
,
- •
,
- •
* is in ,*
where . Moreover, and the norms associated with the last three items above depend only on , , and the constants in the assumptions.
These estimates are weaker than their equivalents for MFG systems without interaction through controls. In particular, is not uniformly bounded in -norm. However, we believe that our estimate of is the best that we can achieve in our framework since its right-hand side should be at least linear with respect to . To our knowledge, such an estimate for systems of MFG with nonlocal dependency on (or more generally for MFG systems in which we do not have a uniform a priori estimate on ) is new in the literature.
Here, these a priori estimates are not sufficient to address the difficulty 2 and to obtain existence of solutions. However, existence can be obtained under several different kinds of assumptions; below, we supply a list of existence results under various assumptions:
Theorem 2.5**.**
Assume A1-A3, B1-B3, FP1, FP2, T. There exists a solution to (1.2) if one of the following assertions is satisfied
- a)
* and , where is a constant satisfying (Proposition 5.4),* 2. b)
* and where and are respectively defined in B3 and B1, the on the left-hand side comes from in B3 and the on the right-hand side comes from FP1 (Proposition 5.3),* 3. c)
, for any (Proposition 5.5), 4. d)
, for any (Proposition 5.6), 5. e)
, where is a constant depending on the constants in the assumptions (Proposition 5.8).
An other additional assumption under which existence holds is the monotonicity condition addressed in [28].
We also give a uniqueness result under a short time horizon assumption.
Theorem 2.6** (Uniqueness with short time horizon).**
Assume A1, A2, A3, B1, B2, B3, FP1, FP2, and that the following three assumptions are satisfied,
- •
* is locally Lipschitz continuous with respect to ,*
- •
* satisfies*
[TABLE]
for any , where and is the -Wassertein distance on measures,
- •
the two inequalities in T hold when we replace by .
There exists such that if then there is at most one solution to (1.2).
We believe that this uniqueness result can be easily extended to more general Hamiltonians, but that the short-time assumption is essential. Indeed numerical examples in which non-uniqueness occurs are presented in [1]. In these examples, we consider groups of agents who start from some crowded areas at time , and travel through the domain to arrive at some target areas. Imposing a short time assumption in such an example results in the agents not trying to reach the targets at all. Indeed in this case the kinetic cost makes it more expensive for them to cross the domain very quickly before the end of the game than to do nothing and just wait passively at their starting point. For this reason we were not interested in finding less restrictive assumptions in Theorem 2.6. This theorem should be only seen as an example of uniqueness result with a short time horizon assumption. In particular we wanted the proof in paragraph 5.4 to stay simple.
Remark 2.7**.**
- )
In this work, we only consider MFGC systems in the -dimensional torus . However, we believe that our existence results (Theorem 2.5) hold under the same assumptions on the Euclidean space , and that the method introduced in **[28]** to pass from to can applied here. 2. )
We did not include the case in this work (i.e. when the Hamiltonian is Lipschitz continuous in ). In this case, systems (1.2) and (2.4) coincide when is large enough, therefore there exists a solution to (1.2) under assumptions A1-A3, B1-B3, FP1, FP2 and T, by the same arguments as in Lemma 5.1.
The fixed point relation in
and the proof of Lemma 2.3
We recall that (1.2) and (2.4) conincide when . Here, we take .
The following lemma takes advantage of the structural assumptions FP1 and FP2 to solve the fixed point relations (1.2c) and (2.4c) which consists of difficulty 1. It also states a priori estimates on which will be of great use in the next section to obtain a priori estimates on and its derivatives.
Lemma 3.1**.**
Assume A1, FP1 and FP2. Take , and . The following two assertions are satisfied.
- (i)
There exists a unique such that
[TABLE]
For any , it satisfies
[TABLE] 2. (ii)
The map given by (3.1), is continuous from to . We recall that the spaces of measures are equipped with the weak- topology.*
Proof.
Let us define the following map,
[TABLE]
This map is well defined by (A1). It is -Lipschitz continuous by FP2 and the fact that is -Lipschitz continuous, we recall that . Therefore it admits a unique fixed point by the Banach fixed point theorem. If satisfies (3.1) then is the only fixed point of . Conversely, if we denote by the fixed point of , then defined by satisfies (3.1). This implies that (3.1) admits a unique fixed point that we name in what follows. From FP1, satisfies,
[TABLE]
for , where we obtained the last line by using the triangle inequality for the -norm, and (2.2). This implies (3.2) for any . Then we extend this result to by combining (2.2) and (3.2) applied to . 2.
Let be a convergent sequence to in . We define as before, and the fixed points satisfying
[TABLE]
for . The sequence is bounded in , thus (3.2) with yields that are uniformly compactly supported. The sequence is compact in endowed with the weak-* topology. Let be the limit of a subsequence , for an increasing function. By continuity of and , we can pass to the limit in (3.3) taken at when tends to infinity, this gives that satisfies the same fixed point relation as . By uniqueness of this fixed point, we deduce that . This implies that the entire sequence tends to .
Therefore the map is continuous from to .
∎
In particular if , (2.2) and (3.2) yield
[TABLE]
and then we use the inequality which holds for and for any , to obtain
[TABLE]
If without restriction, we obtain
[TABLE]
These latter three inequalities will be of great use in Section 4 for getting a priori estimates.
Given as regular as in definition 2.1, we can use Lemma 3.1 to prove that the fixed point relations (1.2c) and (2.4c) are well-posed, and that if is a solution to (1.2) or (2.4) then is continuous with respect to time. However, we need a better regularity in time to get classical solution of the HJB equations (1.2a) and (2.4a). In Lemma 3.2, we use T to obtain an estimate of the distance between two fixed points of (3.1) associated with different . We will be particularly interested in using this estimate on a solution to (2.4) at different times.
Lemma 3.2**.**
Assume A1, FP1, FP2 and T. Take , and some positive probability measures. We define as the fixed point in in Lemma 3.1 associated with and , respectively. There exists a constant such that
[TABLE]
where depends on , , for , and the constants in the assumptions.
Proof.
We define by . The triangle inequality and the fact that is a contraction imply that for any ,
[TABLE]
The measures and are the image measures by the same function , of and respectively. From T, we obtain
[TABLE]
where and is the constant defined in T. We recall that can be estimated from above by a quantity which only depends on and the constants in the assumptions, by (3.2).
Since and have the same marginal with respect to , FP2 yields that,
[TABLE]
Then is locally -Hölder continuous by A1 so,
[TABLE]
for some constant . Combining the latter four inequalities, we obtain,
[TABLE]
which implies (3.7) up to replacing with . ∎
Lemma 2.3 is a straightfoward consequence of Lemmas 3.1 and 3.2.
A priori estimates
and the proof of Lemma 2.4
Here we take , and a solution to (2.4) defined in Definition 2.1. We will look for estimates independent of which allow us to address difficulty 2. These a priori estimates imply compactness results and play an essential role in the proofs of existence in Section 5.
4.1 A priori estimates on
When we consider MFG without interactions through controls and with bounded coupling function and terminal cost, we can apply the maximum principle on parabolic differential equations to (4.3) below and get an a priori estimates of which only depends on the constants in the assumptions. However, for MFGC systems and more generally for HJB equations with non-local interactions in , it is not possible to get such a strong a priori estimate directly from the maximum principle. Instead we get (4.1) and (4.2) which involve non-local quantities depending on .
Lemma 4.1**.**
Under assumptions A1, A2, B1, FP1, FP2, and , for satisfies,
[TABLE]
where is defined in B1. More generally, for any satisfies,
[TABLE]
Proof.
Here, we can rewrite (2.4a) in the following way,
[TABLE]
for . The maximum principle for parabolic second-order equation applies to and ,
[TABLE]
Moreover, and come from B1 and A2, respectively. We combine the latter inequalities with (3.5) and (3.6) to get (4.1) when , and (4.2) respectively. ∎
The non-local term in (4.1) involving corresponds roughly speaking to an energy. Moreover this is a quantity that naturally appears in MFG literature thanks to duality properties in the forward-backward systems (1.1), (1.2), or (2.4). More precisely, the FPK equations is the dual equation of the linearized HJB equation with respect to . Lemma 4.2 provides an a priori estimate of this quantity.
Lemma 4.2**.**
Under assumptions A1, A2, B3, FP1, FP2, and , the following inequality is satisfied,
[TABLE]
for any such that .
Proof.
We multiply (2.4a) by and (2.4b) by ; we add up and integrate over the resulting quantities; after performing some integrations by part, we obtain
[TABLE]
that we can combine with B3 and A2 to get,
[TABLE]
We integrate (3.5) over ,
[TABLE]
where we can choose such that , since satisfies the inequality in B3. The latter three inequalities imply (4.5). ∎
Roughly speaking, Lemma 4.1 with and Lemma 4.2 provide opposite inequalities which may become complementary under a smallness condition on the parameters, implying a uniform estimate on . This condition is explicitely given in the following corollary.
Corollary 4.3**.**
Under Assumptions A1, A2, B1, B3, FP1, FP2, , and , is bounded by a quantity which only depends on the constants in the assumptions.
Proof.
Combing (4.1) and (4.5) results in,
[TABLE]
where may be chosen such that , and is a positive constant depending on the constants in the assumptions and . This implies
[TABLE]
where , which concludes the proof. ∎
Let us mention that in the assumption in Corollary 4.3, the constant in the left-hand side comes from the in B3, and the in the right-hand side comes from FP1.
4.2 A priori estimates on
In order for the HJB equations (1.2a) and (2.4a) to admit classical solutions, we want to be regular in time. Since is the marginal of with respect to , we first prove that is regular in the following lemma. Moreover, we also prove that stays positive, which is required in Lemma 2.3 to obtain time regularity on .
Lemma 4.4**.**
Under assumptions A1, A3, FP1, FP2, is in for and its -norm can be estimated from above by a constant which depends on , , and the constants in the assumptions.
Furthermore, is positive everywhere and admits a positive lower bound which only depends on , and the constants in the assumptions.
Proof.
The distribution of agents satisfies the second-order parabolic FPK equation (2.4b), which is supplemented with a -Hölder continuous initial condition. Theorem section in [30] states that is uniformly bounded by a constant which depends on and . This, (FP1) and (3.4) yield that is bounded by a constant which depends on , and the constant of the assumptions. Finally, Theorem in [34] yields that for , and its associated norm can be estimated from above by a constant which depends on , , and the constants in the assumptions.
We define , for . In particular is positive, since we proved in the latter paragraph that is continuous. On we define the function by , it satisfies the following partial differential equation in the sense of viscosity,
[TABLE]
supplemented with the initial condition , where for . We define as the unique weak solution of the following partial differential equation defined on ,
[TABLE]
supplemented with the initial condition . Theorem section in [30] states that is bounded from above by a constant which depends on , and . Moreover, is a subsolution of the restriction of (4.6) to , with the same initial condition as . Therefore, by a comparison argument for second-order parabolic equations in divergence form (Theorem in [34] for instance), and satisfy . This implies that there exists a positive constant independent of , such that . We conclude the proof by taking and recalling that can be estimated from above using FP1 and (3.4). ∎
4.3 A priori estimates on derivatives of
Bernstein methods are useful tools when studying HJB equations or MFG systems. They allow one to obtain a priori estimates on by considering the partial differential equations satisfied by some well-chosen functions depending on and . See for example the video of the lecture of P.L. Lions on November the rd [35], in which Bernstein estimates are derived for MFG systems without interactions through controls. More precisely, P.L. Lions used the function defined by , for small . Here this method might work only if we knew a uniform estimates on and if . After significant changes in the latter method, we can derive an estimate on which is weaker than the one for MFG without interactions through controls. Namely, we state that is bounded by a quantity that depends linearly on by studying the functions and defined in (4.11) below. To our knowledge, such estimates for systems of MFG with nonlocal dependency on (or more generally for MFG systems in which we do not have a uniform a priori estimate on ) are new in the literature. We believe that this result may hold for more general HJB equations with nonlocal dependency on .
Lemma 4.5**.**
Under assumptions A1, A2, B2, B3, FP1 and FP2, there exists depending only on the constants of the assumptions, such that
[TABLE]
for any .
Proof.
In what follows, we only prove that (4.7) holds for , however the proof does not use additional information available at (the initial condition on for example), so it can be repeated for any and the constant in (4.7) does not depend on .
Here we wish to differentiate (2.4a) with respect to ; however we did not assume in Definition 2.1 enough regularity on for such an operation to have sense pointwisely on . Especially the time derivative of and the third derivatives of with respect to are not required to exist. This leads us to introducing a non-negative mollifier such that if and . We introduce and , for any and , where denotes the convolution operator.
Thus depends smoothly on the state variable and its partial derivatives in space at any order have the same regularity in time as , moreover it solves the following partial differential equation with final condition,
[TABLE]
Let us take the gradient with respect to the state variable of the latter equation and the scalar product of the resulting equality with ,
[TABLE]
where and are defined by
[TABLE]
By simple calculus, we notice that
[TABLE]
that we can combine with (4.9) and obtain
[TABLE]
We define the functions and by
[TABLE]
where and are constants that will be defined below. The derivatives of are given by
[TABLE]
which implies that and satisfy,
[TABLE]
Roughly speaking, we introduced and in order to have and as close as possible to . This will be achieved by taking large enough, and small enough.
For simplicity of the notations, we will omit to write the argument of since it is always .
The derivatives of verify the following equalities,
[TABLE]
We multiply (4.10) by and use the latter equalities in the resulting relation,
[TABLE]
We can rewrite the first line of (4.8) in the following way,
[TABLE]
where is defined by,
[TABLE]
This and (4.14) imply that
[TABLE]
In the following we will estimate from above the right-hand side of the latter expression. We notice that the second term of the right-hand side is negative since
[TABLE]
We notice that and are uniformly convergent to [math] as tends to [math], so we can assume that,
[TABLE]
for small enough and depending on .
The first term in the last line of (4.15) can be bounded using B2,
[TABLE]
In fact we are going to use the latter inequality to obtain (4.19) below, only by noticing that using (3.6), the right-hand side involves only terms with exponents in or not larger than .
Then we use B3 on the first term of the right-hand side of (4.15) since ,
[TABLE]
The term involving is a key element in this proof. On the one hand, it will allow us to cancel the term in . On the other hand, we will use the fact that it has a larger exponent than any of the remaining terms.
From (3.6) and (4.13), we obtain
[TABLE]
where will be defined below. Then (4.13) implies,
[TABLE]
Combining the latter six inequalities, (4.15), and the fact that , we obtain the following partial differential inequality,
[TABLE]
where is a positive constant which only depends on the constants in the assumptions and in . We systematically used the inequality on every term of the form with .
Let us mention the following result: the function defined by is a super-solution of the following differential equation,
[TABLE]
posed on , where and are positive constants and is a bounded positive function.
This and which comes from A2 and (4.13), yield that a super-solution to (4.19) is given by
[TABLE]
where we replace with .
From a comparison argument for parabolic second-order equation, is not larger than the latter expression. This result holds for by letting and tend to [math], thus verifies the following inequality,
[TABLE]
By B3, we can choose large enough, and small enough such that . This implies
[TABLE]
where we increased into .
We make out two cases: the first case is when . The second case is when . In the latter case, (4.20) implies that , which implies that . Therefore, in any of the two latter cases we obtain
[TABLE]
This and (4.13) yield (4.7) when , this concludes the proof. ∎
Now, we can combine the estimates obtained in this section with classical results on parabolic second-order equations and get further estimates of and its derivatives and on .
Lemma 4.6**.**
Assume A1, A2, B2, B3, FP1, FP2 and T. The function is in for any , where was introduced in the assumptions. Its -norm can be bounded by a quantity depending only on , , and the constants in the assumptions.
Proof.
Lemma 4.5 states that is bounded by a quantity which depends on and the constants in the assumptions. So is by (3.2). Then is the solution of the heat equation with a right-hand side equal to which is bounded in . Classical results (see for example Theorem in [34]) state that for any , the -norm of is bounded by a constant which depends on the -norm of the right-hand side, the terminal condition, and .
Lemma 4.4 yields that is in for , is positive, and that both its -norm and its lower bound depend on , , , and the constant of the assumptions.
Therefore, Lemma 2.3 yields that .
From A1, is locally Lipschitz continuous with respect to . This and T imply that . Thus is the solution of the backward heat equation with a right-hand side in supplemented with terminal condition in . Classical results (see for instance Theorem in [34]) yield that is in , and its -norm depends on and the -norm of the right-hand side. We recall that is any constant in . The proof of the lemma is complete.
Following precisely the dependencies in the above estimates, we obtain that the norm of can be estimated from above by a constant which depends on , , , and the constants in the assumptions. ∎
The conclusions of Lemmas 4.1, 4.4, 4.5 and 4.6 are summarized in Lemma 2.4.
Existence and uniqueness results
under additional assumptions
5.1 Solving the MFGC systems for
Lemma 5.1**.**
Under assumptions A1-A3, B2, B3, FP1, FP2, T and , there exists at least one solution to (2.4).
Proof.
For , we define by
[TABLE]
using Lemma 2.3. Then we define as the viscosity solution of the following backward HJB equation with a final condition,
[TABLE]
We can rewrite the first line of the latter system in the following way,
[TABLE]
where the right-hand side is bounded using , B1 and (2.2). The maximum principle for second-order parabolic equation provides that is bounded. Here, the proof of Lemma 4.5 can be repeated to prove that is bounded by a constant which depends on and the constants in the assumptions. Then with the same argument as in Lemma 4.6, is bounded in -norm, for all .
We define as the solution in the sense of distributions of the following Fokker-Planck-Kolmogorov equation with an initial condition,
[TABLE]
with which is a continuous function with respect to . Using the same arguments as in Lemma 4.4, we get that for .
Moreover, , are bounded by a constant which depends on , and the constants in the assumptions. The map is continuous from to by Lemma 3.1. The map is continuous from to by the stability of the solutions of viscosity. The map is continuous from to by linearity of the FPK equation.
Thus the map is continuous from to itself. Its fixed points are exactly the solutions to (2.4). The image of this map is a subset of a convex compact set. Therefore, there exists a fixed point by Schauder theorem, see [19] Corollary .
Using the same arguments as in the proof of Lemma 4.6, such a fixed point satisfies for any . ∎
Considering in (2.4) consists of enforcing the condition , i.e. the fact that the support of is embedded in the compact set , for . Therefore, the interactions through controls are uniformly bounded. Lemma 5.1 relies on that to state the existence of solutions to (2.4). For , we can not obtain such a uniform estimate by combining only the results of Section 4. However if such an estimate exists, the result of Lemma 5.1 holds for and yields the existence of solutions to (1.2). More precisely, if a solution to (2.4) satisfies for any , then it is also a solution to (1.2). This is summarized in the following Corollary.
Corollary 5.2**.**
Under the same assumptions as in Lemma 5.1, if, for any , any solution to (2.4) satisfies , or , for some , then there exists at least one solution to (1.2).
Proof.
By Lemma 5.1, we define as a solution to (2.4) for that will be defined later. By Lemma 2.4, assuming that is bounded is equivalent to assuming that is bounded. Therefore, without loss of generality, we can assume that . From FP1 and (3.4), we obtain
[TABLE]
We define , then the truncation leaves unchanged. Hence is a solution to (1.2). ∎
5.2 Existence results
when
When , we can use integral energy estimates. More precisely, inequalities (4.1) and (4.5) hold. Therefore, the assumptions under which we can prove existence should be weaker than in the case in which we have less estimates at our disposal.
In particular, Corollary 4.3 provides a uniform estimate on under suitable assumptions. Corollary 5.2 then yields the existence of a solution to (1.2): hence we may state the following theorem:
Proposition 5.3** **(*Existence of solution
with small non-linearities*).
Under assumptions A1-A3, B1-B3, FP1, FP2, T, , and , there exists at least one solution to (1.2).
Instead of assuming that the multiplicative parameters are small like in Proposition 5.3; we suppose in Propositions 5.5 below the exponent for the interactions through controls is in fact smaller than the one appearing in B1.
Proposition 5.4**.**
Assume A1-A3, B2, B3, FP1, FP2, T, , and that satisfies
[TABLE]
for , where is a constant. There exists a solution to (1.2).
Proof.
Let be a solution to (2.4) for . From A2, (4.4) and the new assumption, we obtain that,
[TABLE]
where the second line is obtained by a Hölder inequality, since . Let us recall that the inequality holds for any . The latter two inequalities and (3.5) with imply,
[TABLE]
where is a constant which depends on the constants in the assumptions. This and (4.5) yield that,
[TABLE]
up changing the value of . Let us make out two cases: the first case is when . The second case is when , which implies . In any of the two cases, is uniformly bounded with respect to . The desired result then stems from Corollary 5.2. ∎
5.3 Existence results
which do not need the assumption
Here, we do not make the assumption . We can still obtain an existence result in the same spirit as the one provided in Proposition 5.4. In the following proposition, the exponent for the interactions through controls is assumed to be smaller than the one appearing in B1 or in Proposition 5.4.
Proposition 5.5**.**
Assume A1-A3, B2, B3, FP1, FP2, T, and that satisfies
[TABLE]
for any . There exists a solution to (1.2).
Proof.
Take a solution to (2.4) for . Let us combine (4.3), (3.6) for , (4.7), (5.2), and the inequality which holds for ; this yields
[TABLE]
for a constant which depends only on the constants in the assumptions. We recall that by A2. We consider defined as and such that they are solution to the following differential equations
[TABLE]
By a comparison argument for second-order parabolic equation we obtain,
[TABLE]
for . Therefore is uniformly bounded with respect to . The desired result then stems from Corollary 5.2. ∎
In Propotitions 5.4 and 5.5, we changed the exponent appearing in B1. In the following proposition, we assume a smaller exponent than the one appearing in B2 instead.
Proposition 5.6** **(*Existence
with more restrictive assumptions on *).
Assume A1-A3, B1, B3, FP1, FP2, T, and the following inequality,
[TABLE]
for any . There exists at least one solution to (1.2).
Proof.
Take a solution to (2.4), for .
First step: we prove the following inequality,
[TABLE]
for any , where is a constant depending only on the constants in the assumptions. We will only prove this inequality for , however the proof does not use the additional information available at (the initial condition on for example), so it can be repeated for any and the constant in (5.5) does not depend on .
We introduce and as in the proof of Lemma 4.5. Using (5.4) instead of B2, we obtain
[TABLE]
From this and (3.2), one may notice that the right-hand side of the latter inequality only involves terms with exponents in or nor larger than . This and the same arguments as in the proof of Lemma 4.5 between (4.18) and (4.19), lead to the following inequality,
[TABLE]
instead of (4.19), where the novelty is the exponent on at the last line which changed from to . Then following the same steps as in the proof of Lemma 4.5 until the end, we obtain that,
[TABLE]
This concludes the first step of the proof.
Second step: obtaining a uniform estimate on .
Using B1, (3.6) with and (5.5), we obtain that,
[TABLE]
where the constant from the previous step may have been increased. This implies that satisfies the same partial differential inequality as in the proof of Proposition 5.5, namely (5.3). Therefore the same arguments as in Proposition 5.5 apply and we conclude that there exists a solution to (1.2). ∎
Remark 5.7**.**
Note that the exponent actually appears in several applications: for instance, the price impact model described in paragraph 6.2 in the quadratic case (i.e. ) with (i.e. when the bidding and asking prices are equal), satisfies the assumptions in both Propositions 5.5 and 5.6 with an exponent exactly equal to .
5.4 Existence and uniqueness results with
a short-time horizon assumption
Under a short-time horizon assumption, existence and even uniqueness of solutions are well-known in the MFG literature. Indeed, when the time horizon is small, one may obtain strong a priori estimates under non-restrictive assumptions. These estimates combined with Corollary 5.2 yield existence of solution to (1.2) as stated in the following proposition.
Proposition 5.8** (Existence with short time horizon).**
Assume A1, A2-B1, B2-FP1, FP2, and T. There exists such that, if then there exists a solution to (1.2).
Proof.
Take a solution to (2.4) for . We combine (4.3), FP1, (3.6), (4.7), and the convex inequality , and we obtain
[TABLE]
where is a positive constant which depends only on the constants in the assumptions. We recall that by A2. Let us consider the following differential equation,
[TABLE]
There exists such that the latter differential equation admits a bounded solution on . We suppose that , then is a super-solution to (5.6). Hence by a comparison principle, we get that . The same argument applies in order to prove that . Therefore is uniformly bounded with respect to , and there exists a solution to (1.2) by Corollary 5.2. ∎
We will now prove Theorem 2.6 which states that uniqueness is achieved under a short-time horizon assumption. We believe that this uniqueness result can be easily extended to more general Hamiltonians, but that the short-time assumption is essential. Indeed, numerical simulations in [1] show that uniqueness does not hold for the discrete MFGC system obtained by approximating (1.2) with finite differences; we believe that uniqueness does not hold for (1.2) either. Theorem 2.6 should be interpreted only as a simple example of uniqueness result with a short-time horizon assumption.
Proof of Theorem 2.6.
We suppose that , where was defined in Proposition 5.8, so that a solution to (1.2) satisfies uniform estimates on , and by Lemma 4.6, for . Take and two solutions to (1.2). We define , and .
In this proof is a constant which may differ from line to line and depends only on the constants in the assumptions, and , for .
We can repeat the proof of Lemma 3.2 replacing and respectively with and everywhere and we obtain that,
[TABLE]
for any . Let us consider and two random processes defined by
[TABLE]
where is a random variable on with law and is a Brownian motion independent of . The respective laws of and are and . Then we obtain,
[TABLE]
where we used the triangle inequality for the -norm twice. By the first additional assumption of the theorem and A1, is Lipschitz continuous with respect to and its Lipschitz constant depends on and . Using the estimates from the proof of Proposition 5.8, it only depends on the constants in the assumptions. This, the latter inequality and (5.7) imply
[TABLE]
This and Gronwall’s inequality yield that,
[TABLE]
From now on, we assume that , so that . Since , we obtain:
[TABLE]
Hence satisfies the following equation,
[TABLE]
The right-hand side of the first line can be estimated in absolute value from above as follows:
[TABLE]
by T, (5.7) and (5.8). Since , Theorem in [34] yields that and it satisfies:
[TABLE]
[TABLE]
Thus if we suppose furthermore that , then , so by (5.8), then by (5.7), and finally and solve the same Hamilton-Jacobi-Bellman equation with the same terminal condition, so by uniqueness .
Therefore, we proved the uniqueness for where is defined by . ∎
Applications
Here, we are going to work on , while it would be more realistic to work in the whole space for the applications considered below. We would like to recall that the existence results contained in the present work hold for MFGC systems on using the method introduced in [28] to pass from the torus to the whole Euclidean space. Therefore, the conclusions of this section may be adapted to treat the same applications on .
6.1 Exhaustible ressource model with nonpositively correlated ressources
This model is often referred to as Bertrand and Cournot competition model for exhaustible ressources, introduced in the independent works of Cournot [17] and Bertrand [6]; its mean field game version in dimension one was introduced in [24] and numerically analyzed in [16]; for theoretical results see [8, 22, 27, 23]. We consider a continuum of producers selling exhaustible ressources. The production of a representative agent is ; the agents differ in their production capacities (the state variable), that satifies,
[TABLE]
where and is a Brownian motion. Each producer is selling a different ressource and has her own consumers. However, the ressources are substitutable and any consumer may change her mind and buy from a competitor depending on the degree of competition in the game (which is characterized by in the linear demand case below for instance). Therefore, the selling price per unit of ressource that a producer can make when she sales units of ressource, depends naturally on and on the quantity produced by the other agents. The price satisfies a supply-demand relationship, and is given by , where is the aggregate demand which depends on the overall distribution of productions of the agents. A producer tries to maximize her profit, or equivalently to minimize the following quantity,
[TABLE]
where is a terminal cost which often penalizes the producers who have non-zero production capacities at the end of the game. In the Cournot competition, see [17], a producer is controling her production . Like the MFG version of the Bertrand and Cournot competition introduced in [16], here we consider the Bertrand formulation [6], where an agent directly controls her selling price . After inverting the latter equality, the production can be viewed as a function of the price and the mean field. Mathematically this corresponds to writing .
In [16], the authors considered a linear demand system depending on , and a price satisfying . In this case, the running cost and its Legendre transform are defined by
[TABLE]
where , and is defined by . Therefore the system of MFGC has the following form,
[TABLE]
for . Roughly speaking, corresponds to a monopoly in which a producer does not suffer from competition, and she plays as if she was alone in the game. Conversely, stands for all the producers selling the same ressource and the consumers not having any preference.
Here, Theorem 2.5 d implies the following existence result.
Proposition 6.1**.**
If and satisfy A2 and A3, there exists a solution to (6.1) for any .
To prove it, we may take , , , , and in FP1; then we check the assumptions of Theorem 2.5 d. In this case, the inequality in B3 has the form , and is satisfied for any .
Here, the Lagrangian satisfies a monotonicity assumption, but the latter existence result does not take advantage of it. We refer to [28] for a uniqueness result and an other existence result for the solution to (6.1) using this monotonicity assumption. Generalizations of (6.1) to larger dimensions with more general Hamiltonians and prices are also discussed in [28] under the monotonicty assumption.
In what follows, we provide a simple example of a generalization of (6.1) in which the monotonicity assumption does not hold and the results in [28] do not apply anymore. However, the results in the present work may hold in some cases even without the monotonicity assumption.
Let us consider a model in which every producer sells different kinds of ressources. The price of each ressource depends on the mean field like in (6.1). Namely, we take which is now a -dimensional vector and where is a given matrix. This leads to the following MFGC system,
[TABLE]
Proposition 6.2**.**
Assume A2, A3, that has an operator norm smaller than , and that is continuous, and differentiable with respect to with continuous derivatives. There exists a solution to (6.3).
The proof consists in taking , , , in FP1, and , where is the operator norm of ; and we check the assumptions of Theorem 2.5 d.
The monotonicity assumption discussed in [28] is equivalent to assuming that is a positive semi-definite matrix. Here, we do not make such an assumpion.
What we have in mind in the latter example is the case where the prices of the different ressources may be negatively correlated, like cars and oil (if the production of cars increases, then the demand for oil also increases and the price of oil rises while the price of cars decreases), or pesticides and medicines, or gold and other raw materiels. To our knowledge, such a generalization of the exhaustible ressource model to negatively correlated ressources is new in the MFG literature.
More generally, we believe that our results hold for the following MFGC system under various different sets of assumptions that we will not detailed here,
[TABLE]
where is a vector characterizing the mean field interactions.
6.2 Price impact models with bid and ask prices
The price impact model without bid and ask prices is inspired by the Almgren and Chriss’s model [5], and was introduced in the MFG literature in [11] and [15] where existence and uniqueness results are proved when the admissible controls stay in a compact set. Here we consider an extension with bid and ask prices.
We suppose that a continuum of agents are trading an asset, the state of a representative agent is the amount of this asset she owns. Her control is the quantity she buys (if ) or sell (if ). The state space is the one-dimensional torus , and is given by,
[TABLE]
where is a Brownian motion, and is a real constant. We define as the asking price of the asset, and as the difference between the bidding and asking prices, where is the law of . The agent buys at the bidding price , thus her cash is given by
[TABLE]
where is a differentiable function standing for the transaction cost. The price evolves accordingly with the amount of transactions at time , it satisfies the following SDE,
[TABLE]
The wealth of a representative agent is given by and it satisfies the following SDE,
[TABLE]
The objective function that she will try to maximize is given by,
[TABLE]
where and are penalization costs for holding stocks. Here, the Lagrangian and Hamiltonian are given by,
[TABLE]
for , where is the Legendre transform of .
The linear-quadratic case with is treated in [13]. Here, taking corresponds to assuming that the bidding and asking prices coincide. In this case the optimal control is given by and does not depend explicitely on . If , the optimal control depends explicitely on and is not separable in and , this prevents us from using the results in [13].
Let us give an example of choices for the functions and under which our result apply and a solution of the MFGC price impact model exists.
Proposition 6.3**.**
Assume A2, A3, that is , and that and are respectively given by and , where . There exists a solution to (1.2) with .
This existence result is a consequence of 2.5 c, where the assumptions are satisfied for , , and in FP1. We would like to insist on the fact that Theorem 2.5 c provides the existence of solutions for a wild class of Hamiltonian, larger than the one of the latter proposition and which goes beyond the linear-quadratic case.
Let us mention that we would be interested in defining the bidding price by , where . The associated MFGC system cannot be using the conclusions of the present work because the mean field interaction at time would depend not only on but on . However, we believe that existence holds under similar assumptions as here, and we plan to prove it in forthcoming works.
6.3 First-order flocking model with velocity as controls
Cucker and Smale proposed a form of Vicseck model in [18] to illustrate the behavior of flocks of birds. This model is of second-order in the sense that the state of an agent is given by a couple standing for her position and velocity respectively, and the equation of evolution of her state involves considering her acceleration.
A game version of this model in which an agent controls her acceleration has been introduced in [36], the authors derived a MFG formulation in the infinite horizon case. Here we are interested in the finite horizon problem which was studied in [15, 13]. This model is still of second-order. More precisely the state of an agent is given by respectively her position and velocity, two random processes which satisfy the following system of stochastic differential equations,
[TABLE]
where is the individual’s acceleration vector and her control, is a -dimentional Brownian motion, and is a positive definite matrix. The cost that a representative agent tries to minimize is given by
[TABLE]
where is the joint distribution of states and velocities of the agents, is a nonincreasing function, and is a function modeling the spatial preferences of the agents (for instance, we can take significantly smaller in some areas which corresponds to where the food is).
Here we consider an alternative viewpoint in which an agent directly controls her velocity. This is a first-order model since the state of an agent is now given by a vector of , and the acceleration does not appear anymore in the dynamics of a given agent, which is given by
[TABLE]
Here, the cost that an agent tries to minimize is given by
[TABLE]
First-order physical models are generally easier to study than second-order models. However the price we paid here to go from a second-order model to a first-order model is to consider a MFGC system instead of a MFG system without interaction through the controls.
If and are such that is the marginal of with respect to , we define and by,
[TABLE]
for . We define the Lagrangian of the first-order flocking model by,
[TABLE]
for , and the Hamiltonian by,
[TABLE]
for , such that is the Legendre’s transform of .
Proposition 6.4**.**
Under assumptions A2 and A3, there exists such that if , there exists a unique solution to (1.2) with .
Hereafter, we present an other model for crowd motion which is very similar to the first-order flocking model discussed above. The main difference between these two models is the normalization constants. However, the assumptions and conclusions of this work are more adapted to the following crowd motion model and we can derive more existence results for it. We believe that these results can be adapted to the first-order Cucker-Smale system.
6.4 A model of crowd motion
This model of crowd motion has been numerically studied in [1] in the quadratic case, and has some similarities with the first-order flocking model presented in the previous paragraph. For , we define and by
[TABLE]
where , is the conjugate exponent of , is a nonnegative kernel, and is the marginal of with respect to . The quantity is called the average drift.
The state of a representative agent is given by her position and she controls her velocity ,
[TABLE]
Her objective is to minimize the cost given by,
[TABLE]
where and are two constants standing for the preference of an individual to have a similar (resp. opposite) control as the mainstream when (resp. ), and are respectively the running cost and the terminal cost which encode the spatial preferences of the agents, and are exponents.
Here, we take . In this model we define the Lagrangian by,
[TABLE]
and the Hamiltonian as its Legendre transform. If , is given by
[TABLE]
If , satisfies
[TABLE]
For other choices of the parameters , and , does not admit an explicit form.
Proposition 6.5**.**
Assume that and satisfy A2 and A3 respectively. There exists a solution to (1.2) where is the Legendre transform of given in (6.5), under one of the following assertions,
- a)
* and ,* 2. b)
* and one of the following assertions is satisfied,*
- )
, 2. )
, 3. )
,
where are constants coming from Theorem 2.5 c, 3. c)
, 4. d)
* is constant,* 5. e)
, where is a positive constant coming from Theorem 2.5 e.
Proof.
We refer to the appendix, Lemma A.2 for the proof that satisfies A1-A3, B1-B3, FP1-FP2, and T. The existence results c, d and e are direct consequences of Theorem 2.5 c, d and e respectively.
We define by
[TABLE]
for , as the Legendre transform of with respect to its first argument, and as the unique control which achieves the maximum in the definition of (it is unique because is strictly convex with respect to ).
Proof of a. Take and , since achieves the maximum in the definition of , we know that
[TABLE]
which implies
[TABLE]
and then
[TABLE]
The two latter equalities yield . We make out two cases:
- •
if then we have , and . Therefore, (6.7) yields
[TABLE]
and , so we obtain
[TABLE]
which yields
[TABLE]
with , and , and .
- •
if then we have , and . Therefore, (6.7) yields
[TABLE]
We notice that , and we obtain
[TABLE]
This implies
[TABLE]
with , and , and .
We conclude by (A.1) and Theorem 2.5 a.
Proof of b
Here, we assume that since the case is addressed in a.
Take , and . In this case, admits an explicit form given by
[TABLE]
Therefore, taking , or small enough allows one to conclude by (A.1) and Theorem 2.5 b.
∎
Acknowledgements. I wish to express my gratitude to Y. Achdou and P. Cardaliaguet for technical advices, insightful comments and corrections. The work was supported by the ANR project MFG ANR-16-CE40-0015-01.
Appendix A Verification of the assumptions
for the model of crowd motion
We start by establishing some properties of the function in the following lemma.
Lemma A.1**.**
The function is with respect to and it satisfies
[TABLE]
where .
For and , the following inequality is satisfied,
[TABLE]
For , there exists a constant such that,
[TABLE]
for such that with , with and , .
Proof.
The function has at least the same regularity as with respect to the state variable since is the convolution product of with a probability measure. Then (A.1) and (A.2) are straightforward using Hölder inequality. Let us take the same notation as in (A.3), for we get
[TABLE]
Moreover, we know that where the right-hand side does not depend on , and
[TABLE]
The latter two chains of inequalities imply (A.3) with . ∎
Here, we assume . Indeed, admits an explicit form when or , then checking A1-A3, B1-B3, FP1-FP2, and T is straightforward.
Lemma A.2**.**
Assumptions A1, B1-B3, FP1, FP2, and T are satisfied when is defined in (6.5).
Proof.
We define , and as in the proof of 6.5.
The Legendre transform of a function is convex, therefore is convex with respect to . Since is strictly convex, is differentiable with respect to . Moreover, thus is continuous by the Maximum theorem. Then , so is continuous. Finally, is differentiable with respect to by the envelop theorem and
[TABLE]
for .
Using the growth properties of , we can prove that there exists such that
[TABLE]
for any . We refer to [28] Lemma for a complete proof.
One may prove that the function satisfies for such that , , where is a constant. This implies that for there exists a constant such that satisfies
[TABLE]
for , such that and . This implies
[TABLE]
Take and , . Recalling the conjugacy relation we obtain that is locally Hölder continuous with respect to .
Checking B3.
Take and , the optimal control satisfies
[TABLE]
If , this implies
[TABLE]
and
[TABLE]
From (A.8), we deduce that
[TABLE]
We recall that , hence
[TABLE]
which implies B3.
Proof that is differentiable with respect to at .
Take that will eventually tend to [math] and . From (A.8) we obtain
[TABLE]
Let us recall inequalities (6.6) and (6.7).
- •
if then , and we obtain the following expansion as tends to [math],
[TABLE]
- •
if we obtain,
[TABLE]
- •
if then , and we obtain the following estimate as tends to [math],
[TABLE]
Therefore the derivatives of with respect to in any of the above three cases are:
[TABLE]
Proof that the operator norm of is not larger than .
Here, the norm of a square matrix is defined by . Let us introduce
[TABLE]
We recall that if , then is the orthogonal projection onto for .
If then , we see on (A.12) that is a positive semi-definite matrix with eigenvalues in . Therefore, we can now assume that .
Let us assume temporarily that . Then we differentiate the -th component of (A.8) with respect to ,
[TABLE]
This implies
[TABLE]
and thus
[TABLE]
We can check that this last equation holds in the general case for any .
- •
If (i.e. ) or (i.e. ), then (A.13) yields that is a positive definite matrix with eigenvalues in .
- •
If , and are aligned, Then and commute and is a positive definite matrix. Then (A.13) yields that is a positive definite matrix with eigenvalues in .
- •
The last case consists of assuming that , , and are linearly independent. We define by . The two orthogonal subspaces and are stable by . The restriction of to is positive definite with eigenvalues in .
Let us denote by respectively the restriction of and to . We notice that
[TABLE]
thus the eigenvalues of are and since . The eigenvalues of are and . Lemma A.3 below yields that is a positive definite matrix with eigenvalues not smaller than . This implies
[TABLE]
This concludes the proof that the norm of is not larger than .
Proof of FP2.
Take and , then
[TABLE]
Combining the latter inequality and (A.2), we conclude that FP2 is satisfied.
Proof of FP1.
Let , we take .
- •
We suppose , we make out two cases: the first case is when ; the second case is when which implies
[TABLE]
using (A.9). We recall that , hence
[TABLE]
- •
We suppose that , we make out two cases: the first case is when ; the second case is when which implies
[TABLE]
where we used (A.9). From the equality , we deduce
[TABLE]
This concludes the proof of FP1.
Proof of T.
We proved above that is locally Lipschitz continuous with respect to and we recall that is . Therefore is also locally Lipschitz with respect to . This and (A.3) implies that T holds. ∎
Lemma A.3**.**
Let be two positive definite matrices with eigenvalues and respectively, and . Then for any the matrix defined by
[TABLE]
is positive definite with eigenvalues not smaller than .
Proof.
We can assume that have the following form:
[TABLE]
since the eigenvalues of are invariant by taking the conjugate of and by the same orthogonal matrix. The same argument and noticing that commutes with , imply that we can assume that admits a positive determinant, and thus we can write it as
[TABLE]
with . In this case, is given by
[TABLE]
We name the matrix in the last line of the latter calculation, and have the same eigenvalues. Let us compute
[TABLE]
its trace is given by
[TABLE]
and its determinant by
[TABLE]
The eigenvalues of are the roots of the following second-order polynomial function,
[TABLE]
its smallest root is
[TABLE]
which is not smaller than if and only if
[TABLE]
Therefore, it is sufficient to check that tr to conclude. We define the function by
[TABLE]
This is a second-order polynomial in with
[TABLE]
If , then is linear and thus for all .
If , then the minimum of this polynomial function on is obtained at defined as
[TABLE]
since and . Thus has no local minimum on , then for all since and .
Since , this concludes the proof of the lemma. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Y Achdou and Z Kobeissi. Mean field games of controls: Finite difference approximations, 2020.
- 2[2] Yves Achdou. Finite difference methods for mean field games. In Hamilton-Jacobi equations: approximations, numerical analysis and applications , volume 2074 of Lecture Notes in Math. , pages 1–47. Springer, Heidelberg, 2013.
- 3[3] Yves Achdou and Jean-Michel Lasry. Mean field games for modeling crowd motion. In Contributions to partial differential equations and applications , volume 47 of Comput. Methods Appl. Sci. , pages 17–42. Springer, Cham, 2019.
- 4[4] Clémence Alasseur, Imen Ben Taher, and Anis Matoussi. An extended mean field game for storage in smart grids. J. Optim. Theory Appl. , 184(2):644–670, 2020.
- 5[5] Robert Almgren and Neil A. Chriss. Optimal execution of portfolio trans-actions. 2000.
- 6[6] Joseph Bertrand. Théorie mathématiques de la richesse sociale. Journal des Savants , 67:499–508, 1883.
- 7[7] Charles Bertucci, Jean-Michel Lasry, and Pierre-Louis Lions. Some remarks on mean field games. Comm. Partial Differential Equations , 44(3):205–227, 2019.
- 8[8] Frédéric J. Bonnans, Saeed Hadikhanloo, and Laurent Pfeiffer. Schauder Estimates for a Class of Potential Mean Field Games of Controls. ar Xiv e-prints , page ar Xiv:1902.05461, Feb 2019.
