Mean Field Game for Linear Quadratic Stochastic Recursive Systems
Liangquan Zhang, Xun Li

TL;DR
This paper develops a framework for linear-quadratic mean-field games involving forward-backward stochastic differential equations, establishing well-posedness and epsilon-Nash equilibrium properties for decentralized strategies.
Contribution
It introduces a coupled mean-field FBSDE approach with projection operators for LQ mean-field games and proves well-posedness using monotonicity conditions.
Findings
Established well-posedness of the consistency system.
Proved epsilon-Nash equilibrium property.
Provided a decentralized strategy framework.
Abstract
This paper focuses on linear-quadratic (LQ for short) mean-field games described by forward-backward stochastic differential equations (FBSDEs for short), in which the individual control region is postulated to be convex. The decentralized strategies and consistency condition are represented by a kind of coupled mean-field FBSDEs with projection operators. The well-posedness of consistency condition system is obtained using the monotonicity condition method. The -Nash equilibrium property is discussed as well.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Mean Field Game for Linear Quadratic Stochastic Recursive Systems
Liangquan Zhang1, Xun Li2
- School of Science
Beijing University of Posts and Telecommunications
Beijing 100876, China
- Department of Applied Mathematics
The Hong Kong Polytechnic University, Hong Kong
L. Zhang acknowledges the financial support partly by the National Nature Science Foundation of China (Grant No. 11701040, 61871058, 11871010 & 61603049) and the Fundamental Research Funds for the Central Universities (No. 500417024 & 505018304). E-mail: [email protected]. Li acknowledges the financial support partly by PolyU G-UA4N, Hong Kong RGC under grants 15224215 and 15255416. E-mail: [email protected].
Abstract
This paper focuses on linear-quadratic (LQ for short) mean-field games described by forward-backward stochastic differential equations (FBSDEs for short), in which the individual control region is postulated to be convex. The decentralized strategies and consistency condition are represented by a kind of coupled mean-field FBSDEs with projection operators. The well-posedness of consistency condition system is obtained using the monotonicity condition method. The -Nash equilibrium property is discussed as well.
AMS subject classifications: 93E20, 60H15, 60H30.
Key words:-Nash equilibrium, Mean-field forward-backward stochastic differential equation (MF-FBSDE), Linear-quadratic constrained control, Projection, Monotonic condition.
1 Introduction
The control of stochastic multi-agent systems has attracted large attentions by many researchers. As well-known, the large population systems arise naturally in various different fields (e.g., biology, engineering, social science, economics and finance, operational research and management, etc.). Readers interested in this topic may refer [27, 28, 29, 30] for more details of their solid backgrounds and real applications. The agents (or players) in large population system are individually negligible but their collective behaviors will make some significant impact on all agents. This trait can be captured by the weakly-coupling structure in the individual dynamics and cost functionals through the state-average. The individual behaviors of all agents in micro-scale can make their mass effects in the macro-scale.
As for the controlled large population system, it is intractable for a given agent to collect all agents due to the highly complex interactions among its colleagues. Consequently, the centralized controls, which are built upon the full information of all agents, are not implementable and not efficient in large population framework. Alternatively, it is more reasonable and effective to study the decentralized strategies which depend on the local information111Here local information means the optimal control regulator for a given agent, is designed on its own individual state and some quantity which can be obtained in off-line way. only. The mean-field type stochastic control problem is of both great interest and importance in various fields such as science, engineering, economics, management, and particularly in financial investment. In contrast with the standard stochastic control problems, the underlying dynamic system and the cost functional involve state processes as well as their expected values (hence the name mean-field). In financial investment, however, one frequently encounters interesting problems which are closely related to money managers’ performance evaluation and incentive compensation mechanisms. Together with MF-FSDEs, research is naturally required on optimal control problems based on mean-field forward-backward stochastic differential equations (hereafter MF-FBSDEs). Hence, one powerful tool employed is so-called mean-field games (see [35]). The basic idea is to approximate the initial large population control problem by its limiting problem via some mean-field term (i.e., the asymptotic limit of state-average). There are huge literature can be found in [3, 11, 12, 13, 18, 31, 34, 35] for the study of mean-field games; [29] for cooperative social optimization; [27], [36] and [37] and references therein for models with a major player; [1, 4, 46] for optimal control with a mean term in the dynamics and cost, etc.
The main contribution of this paper is to study the forward backward mean-field LQG of large population systems for which the individual states follow some forward backward stochastic differential equations (FBSDEs in short). This framework makes our setting very different to existing works of mean-field LQG games wherein the individual states evolve by some forward stochastic differential equations. In contrast to classical stochastic differential equations, the terminal condition of BSDE should be specified as the priori random variable, which means, the BSDE will admit one pair of adapted solutions, in which the second solution component (the diffusion term) is naturally appeared here by virtue of the martingale representation theorem and the adaptiveness requirement for filtration. The linear BSDEs are first introduced in [2] for studying the optimal control problems, and the general nonlinear BSDEs are developed by Pardoux and Peng in 1992 [38]. Since then, the study of BSDE has initiated consistent and intense discussions, moreover, it has been used in many applications of diverse areas. For instance, the BSDE takes very important role to characterize the nonlinear expectation (-expectation, see [40]), or the stochastic differential recursive utility (see [14]). Subsequently, El Karoui, Peng, and Quenez [32] presents many applications of BSDE in mathematical finance and optimal control theory. Pardoux and Peng establish a kind of stochastic partial differential equations with backward doubly SDE (see [39]). Therefore, it is very natural to study its dynamic optimization in large-population setting. Indeed, the dynamic optimization of backward large population system is inspired by a variety of scenarios. For example, the dynamic economic models for which the participants are of some recursive utilities or nonlinear expectations, or some production planning problems with some tracking terminal objectives but affected by the market price via production average.
Another example arises from the risk management when considering the relative or comparable criteria based on the average performance of all other peers through the whole sector. This is the case for a given pension fund to evaluate its own performance by setting the average performance (say, average hedging cost or initial deposit, surplus) as its benchmark. In addition, the controlled forward large population systems, which are subjected to some terminal constraints, can be reformulated by some backward large population systems, as motivated by [33]. Applying to performance evaluation and incentive compensation of fund managers in the field of financial engineering is of both academic and practical importance. Findings from FBSDEs and mean-field stochastic optimal controls will not only contribute to the academic literature by shedding light on performance evaluation and incentive compensation schemes, but also provide practical applications to fund management and risk control, especially under the current circumstances with on-going economic recession. More importantly, research outcomes in this field are expected to add to our knowledge based on economic theory about providing appropriate incentives for managers in an agency framework. They can also be generalized to various industries and economic regions to provide policy makers with a theoretical basis during their decision-making processes. Inspired by above mentioned motivations, this paper studies the forward backward mean-field linear-quadratic-Gaussian (BMFLQG) games.
We concern on the linear-quadratic (LQ) mean-field game where the individual control domain is convex subset of . The LQ problems with convex control domain comes naturally from various practical applications. For instance, the no-shorting constraint in portfolio selection leads to the LQ control with positive control ( the positive orthant). Moreover, due to general market accessibility constraint, it is also interesting to study the LQ control with more general closed convex cone constraint (see [19]). As a response, this paper investigates the LQ dynamic game of large-population system with general closed convex control constraint.
The control constraint brings some new features to our study here: (1) The related consistency condition (CC) system is no longer linear, and it becomes a class of nonlinear FBSDEs with projection operator. (2) Due to the nonlinearity of (1), the standard Riccati equation with feedback control is no longer valid to represent the consistency condition of limit state-average process. Instead, the consistency condition is embedded into a class of mean-field coupled FBSDEs with a generic driven Brownian motion.
Similarly like in Hu, Huang, Li [24], we first apply the stochastic maximum principle for convex control domain of the optimal decentralized response through some Hamiltonian system with projection operator upon the control set for forward-backward systems. Then, the consistency condition system is connected to the well-posedness of some mean-field coupled forward-backward stochastic differential equation (MF-FBSDE). Next, we state some monotonicity condition of this MF-FBSDE to obtain its uniqueness and existence. At last, the related approximate Nash equilibrium property is also established. The MFG strategy derived is an open-loop manner. Consequently, the approximate Nash equilibrium property is verified under the open-loop strategies perturbation and some estimates of forward-backward SDE are involved. In addition, all agents are set to be statistically identical thus the limiting control problem and fixed-point arguments are given for a representative agent.
In order to make our paper more accessible to the reader, we provide the standard procedure of MFG, and describe our result mainly consisting of the following steps:
Step 1: Fix the state-average limit: lim and by lim a frozen process and (the law of large numbers) and formulate an auxiliary stochastic control problem for which is parameterized by and . Note that the coupled Hamiltonian systems admits a unique strong adapted solution.
Step 2: Solve the above auxiliary stochastic control problem via Pontryagin’s maximum principle to obtain the decentralized optimal state , (which should depend on the undetermined processes and ). By means of convex analysis, we are able to construct the unique feedback control using a so-called projection mapping,
Step 3: Characterize the decentralized strategies of Problem (CC) through the auxiliary (LCC) and consistency condition system. Namely, show is an -Nash equilibrium. This step actually can be further divided into:
*Step 3-1: *Introduce the decentralized state with its decentralized open-loop optimal strategy and the consistency conditions systems. We get two estimations between them in Lemma 7 and Lemma 8, respectively;
*Step 3-2: *For any fixed , , we shall consider a group of state equations driven by certain perturbation control and systems of the decentralized limiting state with perturbation control. Similarly, we have the estimations between perturbation systems and consistency condition system in Lemma 10, plus the estimations between perturbation systems and decentralized limiting state with perturbation control system in Lemma 11;
*Step 3-3: *Finally, based on Lemma 7-Lemma 11, employing the relation between limiting cost functional and the cost functional of with help of perturbational control, we are able to get our desired result.
This paper is organized as follows: Section 2 formulates the LQ MFGs of BSDEs type with convex control domain. The decentralized strategies are derived with the help of a mean field forward-backward SDEs with projection operators. The consistency condition is also established. Section 3 verifies the -Nash equilibrium of the decentralized strategies. Some proofs will be scheduled in Appendix A. Related results on properties of projection in convex analysis can be found in Appendix B.
2 Preliminaries
Throughout this paper, we denote the -dimensional Euclidean space by with standard Euclidean norm and standard Euclidean inner product . The transpose of a vector (or matrix) is denoted by . denotes the trace of a square matrix . Let be the Hilbert space consisting of all ()-matrices with the inner product and the norm . Denote the set of symmetric matrices with real elements by . If is positive (semi)definite, we write . is the space of uniformly bounded valued functions. If and for all , we say that is positive (semi) definite, which is denoted by . is the space of all valued functions satisfying
Consider a finite time horizon for fixed . We assume is a complete, filtered probability space on which a standard -dimensional Brownian motion is defined. For given filtration let denote the space of all -progressively measurable -valued processes satisfying Let be the subspace satisfying for Let denote the space of all -measurable -valued random variable satisfying
Now let us consider a large-population system with weakly-coupled negligible agents . The state and for each satisfies the following controlled linear stochastic system:
[TABLE]
where and is the state-average,
are matrix-valued functions with appropriate dimensions to be identified. For sake of presentation, we set all agents are homogeneous or statistically symmetric with same coefficients and deterministic initial states .
Now we identify the information structure of large population system: is the natural filtration generated by and augmented by all null sets in is the natural filtration generated by and augmented by all null sets in Thus, is the individual decentralized information of Brownian motion while is the centralized information driven by all Brownian motion components. Note that the heterogeneous noise is specific for individual agent but is adapted to instead of due to the coupling state-average
The (centralized) admissible control where the (centralized) admissible control set is defined as
[TABLE]
where is a closed convex set. Typical examples of such set is which represents the positive control. By “centralized”, we mean is the centralized information generated by all Brownian motion components. Moreover, we also define decentralized control as , where the decentralized admissible control set is defined as
[TABLE]
Note that both and are defined in open-loop sense, and . Let , denote the set of control strategies of all agents and denote the control strategies set except the agent Introduce the cost functional of as
[TABLE]
We assume the followng conditions:
(A1)
Assume and
(A2)
and , , .
By virtue of the theory of mean field BSDEs (see Lemma 3.1 in [5]), under the assumptions (A1)-(A2), Eq. admits a unique solution with an admissible control . We now formulate the large population LQG with control constraint (CC).
Problem (CC). Find an open-loop Nash equilibrium strategies set satisfying
[TABLE]
where represents , the strategies of all agents except .
Observe that the problem (CC) is of large computational issue since the highly-complicated coupling structure among these agents. Alternatively, the mean-field game theory employed is to search the approximate Nash equilibrium, which bridges the “centralized" LQG to the limiting LQG control problems, as the number of agents tends to infinity. Similar in [24], we need to construct some auxiliary control problem using the frozen state-average limit. Based on it, we can find the decentralized strategies by consistency condition.
Let us introduce the following auxiliary problem for
[TABLE]
with the limiting cost functional given by
[TABLE]
where are the average limit of realized states which should be determined by the consistency-condition (CC) in our later analysis (see (10)). Note that the auxiliary state is different to the true state . Also, the admissible control in (3), (4) whereas in (1), (2), the admissible control (for sake of simplicity, we still denote them with the same notation).
Now we formulate the following limiting stochastic optimal control problem with control constraint (LCC).
Problem (LCC). For the agent, find satisfying
[TABLE]
Then is called a decentralized optimal control for Problem (LCC). Now we apply the well known maximum principle (Theorem 3.3 in [43]) to characterize with the optimal state To this end, let us introduce the following adjoint process
[TABLE]
The Hamiltonian function can be expressed by
[TABLE]
Since is a closed convex set, then maximum principle reads as the following local form
[TABLE]
Hereafter, time argument is suppressed in case when no confusion occurs. Noticing (5), then (6) yields that
[TABLE]
or equivalently (noticing ),
[TABLE]
for all a.e. -a.s. As , we take the following norm on (which is equivalent to its Euclidean norm) and by the well-known results of convex analysis, we obtain that (7) is equivalent to
[TABLE]
where is the projection mapping from to its closed convex subset under the norm . For more details, see Appendix. Hereafter, denote
[TABLE]
Here, for simplicity, the dependence of on time variable is suppressed. The related Hamiltonian system becomes
[TABLE]
After above preparations, it follows that
[TABLE]
Here, the first equality of (8) and (9) is due to the consistency condition: the frozen term and should equal to the average limit of all realized states the second equality is due to the law of large numbers. Thus, by replacing by in above Hamiltonian system, we get the following system
[TABLE]
Clearly, all agents are statistically identical, therefore we can suppress subscript “” and the following consistency condition system appears for generic agent:
[TABLE]
Here, stands for a generic Brownian motion on and denote the natural filtration generated by it and augmented by all null-sets. are defined in the similar way with before. The system (10) is a nonlinear mean-field forward-backward SDE (MF-FBSDE) with projection operator. It characterizes the state-average limit and MFG strategies for a generic agent in the combined manner, which is totally different from [24, 25]. As you may concern, we need to prove the above consistency condition system admits a unique solution. We have the following uniqueness and existence result.
Remark 1**.**
It is necessary to point out that there should put a term in . But we claim that after taking expectation, it will disappear. Indeed, according to (7), one has . The first equality is just the definition of the second one is because of the law of large numbers, whilst the last one is due to the fact that is an -measurable random vector; and therefore is deterministic. Apparently, in contrast to Huang et al. [24, 25], our framework involves the state
Theorem 2**.**
Assume that (A1) and (A2) are in force. There exists a unique adapted solution to system (10).
For simplicity, we put the proof of Theorem 2 in the Appendix A.
3 Main result
In above sections, we can characterize the decentralized strategies of Problem (CC) through the auxiliary (LCC) and consistency condition system. For sake of presentation, we alter the notation of consistency condition system to be :
[TABLE]
Now, we are in position to verify the -Nash equilibrium of them. To this end, let us first present the definition of -Nash equilibrium.
Definition 3**.**
A set of strategies, , , for agents, is called to satisfy an -Nash equilibrium with respect to costs if there exists such that for any , we have
[TABLE]
when any alternative strategy is applied by .
Remark 4**.**
If , then Definition 3 is reduced to the usual exact Nash equilibrium.
Now, we give the main result of this paper and its proof will be shown step by step.
Theorem 5**.**
*Assume that *(A1) and (A2) are in force. Then, is an -Nash equilibrium of Problem (CC).
In order to prove the main Theorem 5, we needs several lemmas which are presented later. For agent recall that its decentralized open-loop optimal strategy is . The decentralized state is
[TABLE]
where and Note that satisfies (11).
For each , the monotonic fully coupled FBSDEs (11) has a unique solution . Thus, the system of all first equation of (13), , has also a unique solution , where denotes the -tuple Cartesian product. Moreover, since is -dimensional Brownian motions whose components are independent and identically distributed, we have are independent and identically distributed.
Lemma 6**.**
If (A1) and (A2) hold, then
[TABLE]
The proof of Lemma 6 is classical by virtue of B-D-G inequality and Schwarz inequality, so we omit it.
Lemma 7**.**
If (A1) and (A2) hold, then
[TABLE]
Proof.
Let us add up both sides of the first and second equation of (13) with respect to all and multiply , we obtain (recall that , and
[TABLE]
On the other hand, by taking the expectation on both sides of the second equation of (13), it follows from Fubini’s theorem that satisfies the following equation:
[TABLE]
From (16) and (17), by denoting
[TABLE]
we have
[TABLE]
and the inequality yields that, for any ,
[TABLE]
From the well-known Cauchy-Schwartz inequality and the B-D-G inequality, we obtain that there exists a constant independent of (which may vary line by line) such that
[TABLE]
Since are independent identically distributed, for each fixed , let us denote that (note that does not depend on ), we have
[TABLE]
Since are independent, we have
[TABLE]
Then, due to the fact that are identically distributed, there exists a constant independent of such that
[TABLE]
where the last equality comes from the fact that .
We proceed the second term of (20), using the fact that are identically distributed as follows:
[TABLE]
Moreover, we obtain from (20) that
[TABLE]
Consequently, by virtue of Gronwall’s inequality, we get the first estimate (18).
We now handle the estimates (19). Applying Itô’s formula again, we have
[TABLE]
Using B-D-G inequalities, we show that there exists a constant modifying if necessary,
[TABLE]
Employing the classical Cauchy-Schwarz inequality and Gronwall’s inequality with estimation (14), we get (15).
Lemma 8**.**
Assume that (A1) and (A2) are in force. Then, we have
[TABLE]
Proof.
From (13) and (11), we have that
[TABLE]
where is the unique solution to the following FBSDEs:
[TABLE]
From (23), we have
[TABLE]
The classical estimate for the SDE yields that
[TABLE]
where is a constant independent of . Noticing (14) of Lemma 7, we obtain (21). We consider
[TABLE]
By classical estimation for BSDE, we have
[TABLE]
where is a constant independent of . By Gronwall’s inequality, we get the desired result.
Lemma 9**.**
For all , we have
[TABLE]
Proof.
From the definition of (2), (4) and (13), we have
[TABLE]
and
[TABLE]
then
[TABLE]
We will use the following
[TABLE]
and Lemma 7, Lemma 8 as well as , for some constant independent of which may vary line by line in the following, we have
[TABLE]
With similar argument, using (15) and (22), one can show that
[TABLE]
The proof is completed by noticing (24).
We will prove the control strategies set is an -Nash equilibrium for Problem (CC). For any fixed , , we consider the perturbation control and we have the following state dynamics ():
[TABLE]
where . The wellposedness of above system is easily to obtain. To prove is an -Nash equilibrium, we can show that for ,
[TABLE]
Then we only need to consider the perturbation such that . Thus we have
[TABLE]
which implies that
[TABLE]
where is a constant independent of .
Now, for the agent, we consider the perturbation in the Problem (LCC). We introduce the following system of the decentralized limiting state with perturbation control ():
[TABLE]
We have the following results:
Lemma 10**.**
Let (A1) and (A2) hold, then
[TABLE]
Proof.
By (25), we get
[TABLE]
Let us denote
[TABLE]
and recall (17) which is
[TABLE]
we have
[TABLE]
By the Cauchy-Schwartz inequality as well as the B-D-G inequality, we obtain that there exists a constant independent of which may vary line by line such that, for any ,
[TABLE]
On the one hand, by denoting (note that since , , , are independent identically distributed, thus is independent of ), we have
[TABLE]
Then, due to the fact that are identically distributed and , similarly to Lemma 7 we can obtain that there exists a constant independent of such that
[TABLE]
In addition, due to (26), we get
[TABLE]
and similarly, since , , , are identically distributed, we have
[TABLE]
Therefore, from above estimates, we get from (31) that, for any ,
[TABLE]
Finally, by using Gronwall’s inequality, we get (28). We now proceed the second inequality. Applying Itô’s formula again, we have
[TABLE]
Still using B-D-G inequalities, we show that there exists a constant modifying if necessary,
[TABLE]
By employing the classical Cauchy-Schwarz inequality and Gronwall’s inequality, we obtain get (29).
Lemma 11**.**
[TABLE]
Proof.
From respectively the first equation of (25) and (27), we obtain
[TABLE]
With the help of classical estimates of SDE and BSDE, Gronwall’s inequality and (28) and (29) of Lemma 10, it is easily to obtain (32) and (33). The proof is completed.
Lemma 12**.**
For all for the perturbation control , we have
[TABLE]
Proof.
Recall (2), (4), (8), and (9), we have
[TABLE]
Using Lemma 10 and Lemma 11 as well as , for some constant independent of which may vary line by line in the following, we have
[TABLE]
[TABLE]
With similar argument, we can show that
[TABLE]
Hence, we get the desired result.
Proof of Theorem 5: Now, we consider the -Nash equilibrium for for Problem (CC). Combining Lemma 9 and Lemma 12, we have
[TABLE]
Consequently, Theorem 5 holds with \epsilon=O\Big{(}\frac{1}{\sqrt{N}}\Big{)}.
Appendix A Proof of theorem
Proof of Theorem 2.
(Uniqueness) Suppose that there exists two solutions:
and denote
[TABLE]
Then, we have
[TABLE]
with
[TABLE]
Taking the expectation in the second equation of (34) yields . Applying Itô’s formula to \big{<}\hat{q},\hat{x}\big{>}-\big{<}\hat{p},\hat{y}\big{>} and taking expectations on both sides (also, noting which derives that and the monotonicity property of ), we arrive at
[TABLE]
Thus, G\big{(}\hat{x}_{T}-\mathbb{E}\hat{x}_{T}\big{)}=0, Q\big{(}\hat{x}-\mathbb{E}\hat{x}\big{)}=0, and which according to the uniqueness and existences of classical BSDE theory implies Next, we have which further implies hence Moreover, yields, by Theorem 3.1 in [6], Hence the uniqueness follows.
In order to prove the existence for FBSDE (10), we need the following result. It involves a priori estimates of solutions of the following family of mean fields FBSDEs parameterized by
Before that, we denote
[TABLE]
Consider the following a family of FBSDEs with parameter
[TABLE]
where and ( is a -valued (-valued) square integrable random varible which is -measurable. Note the coefficient It is easy to check then by uniqueness of BSDE, Specifically, letting one immediately has
[TABLE]
Obviously, (36) is kind of decoupled FBSDEs whose solvability is trivial.
Lemma 13**.**
Assume that (A1) and (A2) are in force, there exists a positive constant such that if, a priori, for some , for each mean field FBSDEs (35) have a unique adapted solution in , then for each , for each Eq. (35) also have a unique solution in .
Proof.
Define
[TABLE]
We set and solve iteratively the following equations:
[TABLE]
We set
[TABLE]
with
[TABLE]
Now introduce a map by the following mean fields FBSDEs:
[TABLE]
Applying the Itô’s formula to on we have
[TABLE]
After simple computation, we have
[TABLE]
By using the monotonicity property of (Proposition 17 below and the classical geometric inequality and Lipschitz property of projection operator (Proposition 16), it follows that.
[TABLE]
Next by baisc technique in SDE, BSDE, we have
[TABLE]
[TABLE]
and
[TABLE]
Moreover,
[TABLE]
Observe that inequality (40) does not contain and Combining (39)-(43), by similar method used in [20], we have, for some
[TABLE]
which means that the map is a contraction.
(Existence) We can solve Eq. (35) successively for the case When we deduce immediately that the solution to Eq. (10) exists.
Appendix B Properties of projection
We recall the following properties of projection onto a closed convex set , see [9], Chapter 5.
Theorem 14**.**
For a nonempty closed convex set , for every , there exists a unique , such that
[TABLE]
Moreover, is characterized by the property
[TABLE]
The above element is called the projection of onto and is denoted by .
One can immediately obtain the following
Proposition 15**.**
Let be a nonempty closed convex set, then we have
[TABLE]
Proposition 16**.**
Let be a nonempty closed convex set, then the projection does not increase the distance, i.e.
[TABLE]
Now let us consider and the projection both with the norm , from (45), we have
Proposition 17**.**
Let be a nonempty closed convex set, then
[TABLE]
The proofs of Proposition 15-Proposition 17 can be found in [7, 9].
Conflict of Interest: The authors declare that they have no conflict of interest.
Acknowledgements. The authors wish to thank the editors and two referees for their valuable comments and constructive suggestions which improved the presentation of this manuscript.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Andersson and B. Djehiche, A maximum principle for SD Es of meanfield type, Appl. Math. Optim ., vol. 63, 341–356, 2011.
- 2[2] J. Bismut, “An introductory approach to duality in optimal stochastic control,” SIAM Rev ., vol. 20, 62–78, 1978.
- 3[3] M. Bardi, Explicit solutions of some linear-quadratic mean field games, Netw. Heterog. Media , vol. 7, pp. 243–261, 2012.
- 4[4] R. Buckdahn, B. Djehiche, and J. Li, A general stochastic maximum principle for SD Es of mean-field type, Appl. Math. Optim ., vol. 64, 197–216, 2011.
- 5[5] R. Buckdahn, B. Djehiche, J. Li, S.Peng, Mean-field backward stochastic differential equations. A limit approach. Ann. Probab . 37 (4), 1524-1565, (2009).
- 6[6] R. Buckdahn, J. Li, S. Peng, Mean-field backward stochastic differential equations and related partial differential equations. Stoch. Process. Appl . 119 (10), 3133-3154, (2009).
- 7[7] A.V. Balakrishnan. Applied Functional Analysis . Springer-Verlag, New York, 1976.
- 8[8] A. Bensoussan, J. Frehse and S. Yam. Mean Field Games and Mean Field Type Control Theory . Springer, New York, 2013.
