Linear quadratic problems for fully coupled forward-backward stochastic control systems
Mingshang Hu, Shaolin Ji, Xiaole Xue

TL;DR
This paper develops a new approach to solve fully coupled forward-backward stochastic linear quadratic control problems with indefinite costs, deriving a state feedback form of the optimal control through novel decoupling and differential equations.
Contribution
Introduces a new decoupling technique and non-Riccati-type ODEs to obtain the optimal control for fully coupled FBLQ problems with indefinite costs.
Findings
Established existence of solutions for the derived ODEs.
Derived the state feedback form of the optimal control.
Illustrated results with special case examples.
Abstract
This paper is concerned with optimal control of stochastic fully coupled forward-backward linear quadratic (FBLQ) problems with indefinite control weight costs. In order to obtain the state feedback representation of the optimal control, we propose a new decoupling technique and obtain one kind of non-Riccati-type ordinary differential equations (ODEs). By applying the completion-of-squares method, we prove the existence of the solutions for the obtained ODEs under some assumptions and derive the state feedback form of the optimal control. For this FBLQ problem, the optimal control depends on the entire trajectory of the state process. Some sepcial cases are given to illustrate our results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Insurance, Mortality, Demography, Risk Management · Risk and Portfolio Optimization
Linear quadratic problems for fully coupled forward-backward stochastic
control systems
Mingshang Hu Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, Shandong 250100, PR China. [email protected]. Research supported by NSF (No. 11671231) and Young Scholars Program of Shandong University (No. 2016WLJH10).
Shaolin Ji Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, Shandong 250100, PR China. [email protected] (Corresponding author). Research supported by NSF No. 11571203.
Xiaole Xue Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, Shandong 250100, China. [email protected], [email protected]. Research supported by NSF (Nos. 11701214 and 11801315) and Natural Science Foundation of Shandong Province(ZR2018QA001)
Abstract
This paper is concerned with optimal control of stochastic fully coupled forward-backward linear quadratic (FBLQ) problems with indefinite control weight costs. In order to obtain the state feedback representation of the optimal control, we propose a new decoupling technique and obtain one kind of non-Riccati-type ordinary differential equations (ODEs). By applying the completion-of-squares method, we prove the existence of the solutions for the obtained ODEs under some assumptions and derive the state feedback form of the optimal control. For this FBLQ problem, the optimal control depends on the entire trajectory of the state process. Some sepcial cases are given to illustrate our results.
Key words. fully coupled forward-backward stochastic differential equation, linear quadratic optimization control, stochastic maximum principle, completion-of-squares method
AMS subject classifications. 93E20, 60H10, 35K15
1 Introduction
The fully coupled forward-backward stochastic differential equations (FBSDEs) are an important class of stochastic differential equations and there are many literatures on the well-posedness of them. When the coefficients of a fully coupled FBSDE are deterministic and the diffusion coefficient of the forward equation is nondegenerate, Ma, Protter and Yong [13] proposed the four-step scheme approach. Under some monotonicity conditions, Hu and Peng [9] first obtained an existence and uniqueness result which was generalized by Peng and Wu [19]. Yong [24] developed this approach and called it the method of continuation. The fixed point approach is due to Antonelli [1], Pardoux and Tang [17]. The readers may refer to Ma and Yong [15], Cvitanić and Zhang [5], Ma, Wu, Zhang and Zhang [14], Yong and Zhou [26] for the FBSDE theory.
As a well-defined dynamic system, it is appealing to investigate the optimal control of the fully coupled FBSDEs. In this paper, the optimal control of a linear fully coupled FBSDE with a quadratic criteria is investigated. We call this kind of problem the stochastic forward-backward linear-quadratic (FBLQ) problem.
It is well-known that the stochastic linear-quadratic (LQ) problems play an important role in optimal control theory. On one hand, many nonlinear control problems can be approximated by the LQ control problems; on the other hand, solutions to the LQ control problems show elegant properties because of their brief and beautiful structures. Stochastic LQ regulator problems have been first studied by Wonham [22] and by many researchers later [2, 20, 21, 10]. Most of them imposed the positiveness for the coefficient of the control in the cost functional. Chen, Li and Zhou found even when the coefficient is negative, the stochastic control problem is still well-posed (see [3, 4]). For stochastic LQ problems, one method is applying the stochastic maximum principle to obtain the optimal control and then solving the corresponding Hamiltonian system by a decoupling technique which leads to a Riccati equation. Finally the optimal control is expressed in the form of state feedback. Another method is the completion-of-squares method which yields the same Riccati equation and state feedback form of the optimal control. Dokuchaev and Zhou [6] first proposed the stochastic backward linear-quadratic (BLQ) problem in which the state equation is described by a backward stochastic differential equation (BSDE). Applying the completion-of-squares method and the decoupling method, Lim and Zhou [12] completely solved it and obtained the state feedback representation.
Up to our knowledge, there are only a few results for the stochastic FBLQ problem and except some special examples in the literatures, there are no systematical results related to the state feedback form of the optimal control. Our main contribution of this paper is to obtain the state feedback form of the optimal control for the FBLQ problem. After applying the stochastic maximum principle, we find that the decoupling technique for stochastic LQ and BLQ problems is no longer applicable to the FBLQ problem. In more details, for the stochastic FBLQ problem, the obtained Hamiltonian system (3.1) consists of two parts: (the forward state process and its backward adjoint process ) and (the backward state process and its forward adjoint process ). Both of them are fully coupled FBSDEs. Following the decoupling method for the stochastic LQ problem, we try to decouple the above Hamiltonian system by
[TABLE]
In other words, we want to use the state process to represent the adjoint process . But after calculation, we can’t get the Riccati-type equations for , through this decoupling approach. To overcome this difficulty, we propose the following new decoupling technique: we regard the forward stochastic differential equation (SDE) as the state process, the BSDE as the adjoint process and decouple the Hamiltonian system (3.1) by
[TABLE]
Using the above decoupling technique, we derive the equations for , , , and obtain the optimal control which can be explicitly expressed as a feedback form of the state process (see Corollary 3.3).
Although we can decouple the Hamiltonian system (3.1) by (1.1), the obtained equations for , are no longer Riccati-type ones. They are highly nonlinear ordinary differential equations (ODEs) and the solvability of them is challenging. In this paper, we propose a project to obtain the existence of the solutions , . We first construct a sequence of Riccati equations for . Then, applying the completion-of-squares method, we establish the the relations between , and (see Theorem 4.4) which are different from the stochastic LQ and BLQ problems. With the help of these relations and the good properties of , we obtain the existence of the solutions , . Especially, we relax the positiveness of the control weight in the cost functional as in Chen et. al [3, 4]. For this indefinite case, the control obtained by our decoupling technique is only a candidate of the optimal control. By applying the completion-of-squares method, it can be verified that is indeed the optimal control of the FBLQ problem. Furthermore, although the optimal control for the FBLQ problem may not be unique, we can still prove that the optimal state feedback optimal control law is unique (see Theorem 5.2). Finally, it is worth pointing out that we can’t solve the FBLQ problem by the decoupling method or the completion-of-squares method alone.
The rest of the paper is organized as follows. In Section 2, we give the preliminaries and the formulation of the FBLQ problem. A new decoupling technique is introduced in Section 3. Applying the completion-of-squares method, we prove the existence and uniqueness results for non-Riccati-type equations in Section 4. In Section 5, we obtain the feedback optimal control for the FBLQ problem. Several special cases are given to illustrate our results in Section 6.
2 Preliminaries and formulation of FBLQ problem
Let be a complete probability space on which a standard -dimensional Brownian motion is defined. Assume that is the -augmentation of the natural filtration of , where contains all -null sets of . Denote by the -dimensional real Euclidean space and the set of real matrices. Let (resp. ) denote the usual scalar product (resp. usual norm) of and . The scalar product (resp. norm) of , is denoted by (resp.), where the superscript ⊺ denotes the transpose of vectors or matrices.
For each given , we introduce the following spaces.
: the space of all symmetric matrices;
: the subspace of all nonnegative definite matrices of ;
: the subspace of all positive definite matrices of ;
: the space of -measurable -valued random vectors such that
[TABLE]
: the space of -measurable -valued random vectors such that
[TABLE]
: the space of essential bounded measurable -valued functions;
: the space of continuos -valued functions;
: the space of -adapted -valued stochastic processes on such that
[TABLE]
: the space of -adapted -valued stochastic processes on such that
[TABLE]
: the space of -adapted -valued stochastic processes on such that
[TABLE]
: the space of -adapted -valued continuous stochastic processes on such that
[TABLE]
Consider the following linear forward-backward stochastic control system
[TABLE]
and minimizing the following cost functional
[TABLE]
where , , , are deterministic matrix-valued functions of suitable sizes, , , , are , , valued matrices respectively. To simplify the presentation, we only consider the case . The results for are similar. The solution to (2.1) is . The admissible control set is all the elements in . Let be an admissible control, and the corresponding state is .
Let be an optimal control and be the corresponding optimal state. Then by stochastic maximum principle (see [18, 23, 7]), the optimal control satisfies
[TABLE]
where
[TABLE]
Assumption 2.1
For any , (2.1)(resp. (2.4)) has a unique solution in (resp. ).
Remark 2.2
It is well-known that there are many conditions which can guarantee the existence and uniqueness of (2.1) and (2.4) (see [15], [5], [19], [7]) such as monotonicity conditions or weakly coupled conditions and so on.
Assumption 2.3
The data appearing in the FBLQ problem satisfy , , , , for , , , , , , , , , , , , .
Sometimes we need the data to satisfy the following assumptions:
Assumption 2.4
, , , .
Assumption 2.5
, .
Note that (2.3) becomes a sufficient condition for the optimal control under some positiveness assumptions on the coefficients.
Theorem 2.6
(see [16, 8])Suppose that Assumptions 2.1, 2.3, 2.4 and 2.5 hold. If there exists an admissible control satisfying (2.3), where is defined in (2.4), then is the unique optimal control for the FBLQ problem (2.1)-(2.2).
In the rest of this paper, sometimes we write for a (deterministic or stochastic) process, omitting the variable , whenever no confusion arises. Under this convention, when means , .
3 A new decoupling technique for FBLQ problem
3.1 FBLQ problem with positive definite control weight
cost
In this subsection, we only consider the FBLQ problem (2.1)-(2.2) with positive definite control weight cost. In other words, we assume that . The Hamiltonian system for the FBLQ problem is
[TABLE]
Set
[TABLE]
Due to (2.3), we have . Then the Hamiltonian system (3.1) can be rewritten as
[TABLE]
where
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
In order to obtain the state feedback form of the optimal control, the following new decoupling technique is introduced: we conjecture that and are related by
[TABLE]
with and . Applying the same steps as in Section 4 of [25] or Appendix in [7], we obtain satisfies the following matrix ODE
[TABLE]
and satisfies the following linear BSDE
[TABLE]
where
[TABLE]
Set
[TABLE]
[TABLE]
[TABLE]
where , , , are -valued, , , , are -valued, , , , are -valued, , , , are -valued, , , , .
Theorem 3.1
Suppose that Assumptions 2.1, 2.3, 2.4 and 2.5 hold. Moreover, suppose that (3.3) has a solution such that is invertible and . Then Problem (2.1)-(2.2) has a unique optimal control
[TABLE]
where is the solution to the following SDE
[TABLE]
Furthermore, the solution to (3.1) with respect to defined in (3.5) satisfies
[TABLE]
Proof. By , one has that and (3.4) is a BSDE with Lipschitz coefficients. Then (3.4) has a unique solution . It yields that the stochastic differential equation (3.6) admits a unique strong solution . Thus the control defined in (3.5) is admissible. Putting this into (3.1) and reversing the above decoupling technique, it can be verified that defined in (3.7) solves (3.1) and satisfies (2.3). By Theorem 2.6, this is the unique optimal control. This completes the proof.
Remark 3.2
We give a sufficient condition which guarantee the existence of solution to (3.3) in Corollary 4.9.
Corollary 3.3
(i) Under the same assumptions as in Theorem 3.1, if in (3.3) is invertible on , then
[TABLE]
and
[TABLE]
(ii) If , then the optimal control for the fully coupled forward-backward control system in Theorem 3.1 depends only on . Moreover, has the following closed-form:
[TABLE]
where , , , , and is the solution of the following linear equation:
[TABLE]
This corollary can be directly derived from Theorem 3.1. So we omit the proof.
Remark 3.4
By Corollary 3.3, the optimal control at time depends on the entire past history of the state process . This is different from the classical stochastic LQ problems. Furthermore, if in (3.3) is invertible on , then the optimal control at time will depend only on the current state pair
3.2 FBLQ problem with indefinite control weight cost
In this subsection, we relax the assumption and deduce formally the following non-Riccati-type equations (3.19), (3.23) and (3.24) which play an important role in solving the FBLQ problem (see Section 5).
Set
[TABLE]
where is the solution to Hamiltonian system (3.1), satisfy some ODEs which will be determined later, and , satisfies the following BSDE
[TABLE]
Applying Itô’s formula to , in (3.8) and comparing with the diffusion terms of the equation (3.1), we have
[TABLE]
[TABLE]
where
[TABLE]
Combining (3.9) and (3.10), we have
[TABLE]
where
[TABLE]
Putting them into (2.3), we obtain
[TABLE]
where
[TABLE]
Remark 3.5
Instead of requiring , here we assume that is invertible.
From (3.9)-(3.14), we deduce that
[TABLE]
where
[TABLE]
Now we determine the equations satisfied by , . We first put (3.8), (3.14) and (3.16) into (3.1) and obtain a new form of the Hamiltonian system (3.1). Then applying Itô’s formula to in (3.8) and comparing with the drift term of the new form of (3.1), we have
[TABLE]
Hence, , and should be solutions of
[TABLE]
[TABLE]
[TABLE]
respectively. Applying Itô’s formula to in (3.8) and comparing with the drift term of the new form of (3.1), we have
[TABLE]
, and should be solutions of
[TABLE]
[TABLE]
[TABLE]
respectively. It can be verified that the equation (3.19), (3.24) are symmetric and (3.23) is indeed the transpose of (3.20).
Remark 3.6
If and , then the following relations hold:
[TABLE]
4 Non-Riccati-type equations
In this section, we study the existence and uniqueness results for solutions to non-Riccati-type equations (3.19), (3.23) and (3.24).
4.1 Auxiliary Riccati-type equations
Our aim of this subsection is to reveal the origin of the following auxiliary Riccati equation (4.5) and (4.3). Hence we will present the material in this subsection in an informal way although they can be verified rigorously.
We first introduce an auxiliary stochastic LQ problem which leads to a Riccati-type equation for . Then the relations between and are deduced and with the help of good properties of , we will obtain the existence results for the solutions of (3.18)-(3.25).
Inspired by [15, 11, 12], for the FBLQ problem (2.1)-(2.2), we regard the BSDE as a controlled forward SDE and the term as a control. Thus, it becomes a forward LQ problem. Set and . The state equation becomes
[TABLE]
and the cost functional becomes
[TABLE]
where
[TABLE]
[TABLE]
[TABLE]
Now we solve the above LQ problem by the completion-of-squares technique similar as in Theorem 3.1 in [4]. Suppose that satisfies the following BSDE
[TABLE]
where will be determined later. For a function to be determined, applying Itô’s formula to
[TABLE]
we have
[TABLE]
where
[TABLE]
Thus, we can obtain the form of the Riccati equation and the optimal control as following:
[TABLE]
[TABLE]
and
[TABLE]
Set
[TABLE]
By the relationship between the adjoint process and the state process for stochastic LQ problems, we have
[TABLE]
Comparing (4.7) with (3.8), we obtain the relations between , and , as following:
[TABLE]
[TABLE]
or the equivalent form
[TABLE]
[TABLE]
Note that which makes meaningless. So we need to modify the terminal conditions of , and , . For , consider the solutions
[TABLE]
to equations (3.19), (3.23), (3.24), (3.21), (3.25) with the terminal conditions
[TABLE]
Correspondingly, we consider the Riccati equation (4.5) and (4.3) for
[TABLE]
with terminal conditions
[TABLE]
Remark 4.1
In fact, (4.5) and (4.3) with terminal conditions (4.9) correspond to the following stochastic control problem: the state equation is (4.1) and the cost functional is
[TABLE]
Theorem 4.4 justifies the above heuristic derivation.
Assumption 4.2
There exist a natural number such that for , (4.5) has a positive definite solution which satisfies the terminal condition (4.9).
Remark 4.3
Under the assumption and , it is easy to check that . Then, by Theorem 4.1 in [4], Assumption 4.2 holds for .
Set
[TABLE]
Theorem 4.4
Suppose that Assumptions 2.3, 2.4 and 4.2 hold. For , define
[TABLE]
[TABLE]
where and are solutions to (4.5) and (4.3). Suppose that and exist. Then the above defined solves (3.19), (3.23), (3.24) and solves (3.21), (3.25) with (4.8) for each .
We put the proof in Appendix 7.1.
Lemma 4.5
Under the same assumptions as Theorem 4.4, for , we have
[TABLE]
[TABLE]
The proof is in Appendix 7.2. This lemma will be used in the proof of Theorem 5.2.
Remark 4.6
If and , then it can be verified that Assumption 4.2 holds. If and , then and in Theorem 4.4 exist.
4.2 Existence and uniqueness results
In this subsection, we study the solvability of (3.19), (3.23), (3.24) by Theorem 4.4.
Lemma 4.7
Suppose and are solutions to Riccati equation (4.5) with terminal conditions , then for .
Proof. By Theorem 6.1 in [26], the value function of the corresponding LQ problem is (resp. ) for all . The proof can be obtained from .
Theorem 4.8
Suppose that the same assumptions as Theorem 4.4 hold and is bounded for each . Then , and for . Moreover, suppose that has upper bound and , , and are uniformly bounded for each . Then (3.19), (3.23), (3.24) have a unique solution .
Proof. It can be verified that
[TABLE]
By Lemma 4.7, we have which yields that and . Moreover, note that
[TABLE]
By the relationship (4.11), we obtain that and .
Thus, (resp. ) is a bounded deceasing (resp. increasing) sequence in (resp. ) and therefore has a limit. The convergence of can be obtained by the following Proposition 4.10. Denote by the limit of . By the bounded convergence theorem, one can obtain that is the solution to (3.19), (3.23), (3.24). This completes the proof.
Corollary 4.9
Suppose that Assumptions 2.3, 2.4 and 2.5 hold. Moreover, suppose that has upper bound and , and are uniformly bounded for each . Then the equation (3.3) has a unique solution.
Proof. By Remark 4.6, Assumption 4.2 holds. Since , it is easy to verify that for each . By Remark 3.6 and Theorem 4.8, then the equation (3.3) has a unique solution.
Proposition 4.10
Suppose that all assumptions in Theorem 4.8 hold. Then for , we have
[TABLE]
where is the limit of , and is a constant independent of .
Proof. Set , , . By (3.19), (3.23), (3.24) and the boundedness assumptions, we have
[TABLE]
where is a constant independent of . Then, by Gronwall’s inequality we have
[TABLE]
where
Remark 4.11
Since is increasing and is decreasing, the sequence is not monotonic which is different from the indefinite stochastic LQ problem in [3]. With the help of the solutions to the auxiliary Riccati equations, we study the components of and prove the existence of the solutions to (3.19), (3.23), (3.24).
Now we give an example to show that there exists a unique solution to (3.19), (3.23), (3.24).
Example 4.12
Consider a special case of problem (2.1)-(2.2) in which the controlled system is governed by a partially coupled FBSDE. Suppose that , , , and . Due to , and are invertible and bounded. It is easy to verify the other assumptions in Theorem 4.8 except that has upper bound and is bounded. Note that satisfies the following equations:
[TABLE]
[TABLE]
[TABLE]
where
[TABLE]
By Theorem 4.8, is bounded. Then one can check that
[TABLE]
where is a constant independent of . By Gronwall’s inequality, is bounded. Because
[TABLE]
where is a constant independent of , we deduce that has a upper bound by Gronwall’s inequality. Thus, (3.19), (3.23), (3.24) have a unique solution by Theorem 4.8.
5 Feedback optimal control for FBLQ problem
In this section, we prove the existence of optimal control without the positiveness of and . We first give the following lemma.
Lemma 5.1
Suppose all assumptions in Theorem 4.8 hold. Then is uniformly bounded, where is defined by replacing with in (4.4).
The proof is in Appendix 7.3.
Theorem 5.2
Suppose Assumption 2.1 and all assumptions in Theorem 4.8 hold, and , for . Then there exists an optimal control for the FBLQ problem (2.1)-(2.2). Furthermore, any optimal control satisfies
[TABLE]
where , , and , , are defined in (3.15).
Proof. By Theorem 4.8, solves the equations (3.19), (3.23), (3.24). Then there exists a unique solution to the BSDE (3.21) and (3.25). Set
[TABLE]
Consider the following linear SDE for :
[TABLE]
Since (5.2) has bounded coefficients, it has a unique solution . Set
[TABLE]
which is an admissible control. It can be verified that
[TABLE]
solves the Hamiltonian system (3.1). Now we prove that is an optimal control in two steps.
Step 1: For , set
[TABLE]
For any given , by Theorem 4.4, solves the equation (4.5) on . By the completion-of-squares technique, we have
[TABLE]
The part is simplified as follows.
[TABLE]
where
[TABLE]
One can check that which implies .
Then, we prove the part converges to [math] as . Noting that , we have
[TABLE]
The part converges to [math] as due to the integrability of , and . By Lemma 4.5, (5.1) and (5.4), we deduce that the part equals to [math].
Finally, by Lemma 5.1 and letting on both sides of (5.5), we have
[TABLE]
Step 2. We first give a lower bound for the cost functional by the completion-of-squares technique. For an admissible control , let be the corresponding state process. Set . Applying Itô’s formula to
[TABLE]
and taking expectations, we have
[TABLE]
where and are defined by replacing with in (5.6). By the completion-of-squares technique,
[TABLE]
Note that . Letting and appealing to Fatou’s lemma and Lemma 5.1, we have
[TABLE]
Since achieves the lower bound, it is clear that is optimal.
For any other optimal control , by (5.8) and , we have
[TABLE]
By Lemma 4.5, we obtain (5.1). This completes the proof.
In the following we solve a special case of the FBLQ problem in which and .
Example 5.3
Suppose that all variables are -dimensional. For the FBLQ problem (2.1)-(2.2), suppose that and . Then the solutions to (3.19), (3.23) and (3.24) are , and satisfies
[TABLE]
Suppose that , , , and where
[TABLE]
By Comparison theorem we have which leads to . Then, by Theorem 4.8 , , has a unique solution. Moreover,
[TABLE]
It is obvious that for . Thus, by Theorem 5.2 the optimal control is
[TABLE]
Remark 5.4
Although the forward-backward stochastic control system in the above example is completely decoupled, in order to obtain the optimal control in (5.9) we still need to solve a fully coupled FBSDE.
6 Some special cases
In this section, we illustrate our results for the indefinite stochastic LQ, BLQ and deterministic FBLQ problems.
6.1 Indefinite stochastic LQ problem
If , then the FBLQ problem (2.1)-(2.2) degenerates to the following indefinite stochastic LQ problem as in [3]: minimizing the following cost functional
[TABLE]
subject to
[TABLE]
By Theorem 5.2, the optimal control is
[TABLE]
where
[TABLE]
The state feedback representation of the optimal control and the Riccati equation for are just the corresponding ones in Theorem 3.2 in Chen, Li and Zhou [3].
6.2 BLQ problem
If and , then the problem (2.1)-(2.2) degenerates to the following BLQ problem as in [12]: minimizing the following cost functional
[TABLE]
subject to
[TABLE]
By Theorem 3.1, the optimal control is
[TABLE]
and the following relation holds:
[TABLE]
where
[TABLE]
[TABLE]
The equation for is just the Riccati equation (3.4) in Lim and Zhou [12]. And the optimal control is consistent with the one in Theorem 3.3 in [12].
Remark 6.1
It is worth pointing out that our results in this paper can be also applied to the indefinite BLQ problem.
6.3 Deterministic FBLQ problem
If and , then the problem (2.1)-(2.2) degenerates to a deterministic FBLQ problem. For this case, (3.19), (3.23), (3.24) become
[TABLE]
By Theorems 3.1 and 5.2, we obtain the following proposition.
Proposition 6.2
Suppose that Assumptions 2.1, 2.3, 2.4 and 2.5 hold. If (6.1) has a solution such that for , then the above deterministic FBLQ problem has a unique optimal control
[TABLE]
For -dimensional case (), if , and is large enough such that is non-negative and bounded, then (6.1) has a unique solution. The reason is that when , and , we have
[TABLE]
Thus, is bounded and we obtain the desired result due to Theorem 4.8.
7 Appendix
This appendix is devoted to proofs of Theorem 4.4, Lemma 4.5 and Lemma 5.1. Before giving the proofs, let’s give some notations.
Set
[TABLE]
[TABLE]
where
[TABLE]
And set
[TABLE]
where
[TABLE]
7.1 Proof of Theorem 4.4
Before we prove Theorem 4.4, we list the following relations which can be verified directly:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Proof of Theorem 4.4: The proof is divided into five steps. The first three steps we verify the relationship between and , that is,
[TABLE]
or equivalently
[TABLE]
Recall that the equations satisfied by are
[TABLE]
[TABLE]
[TABLE]
Step 1: In this step we verify .
satisfies the following equation:
[TABLE]
Note that and are governed by the same equations except the terminal conditions. Putting the relation (7.15) into (7.19) and comparing with (3.24), we need to verify
[TABLE]
We compare the coefficients of and the remainder terms on both sides of the above equation. The coefficient of on the left hand side (LHS) is
[TABLE]
and the one on the right hand side (RHS) is
[TABLE]
The remainder terms on the LHS is
[TABLE]
and the ones on the RHS is
[TABLE]
By relations (7.8), (7.9), (7.12) and (7.15), we obtain that (7.20) holds.
Step 2: In this step we verify .
satisfies the following equation:
[TABLE]
Putting the relation (7.15) into (7.21) and comparing with (3.23), we only need to verify
[TABLE]
The coefficient of on the LHS is
[TABLE]
and the one on the RHS is
[TABLE]
By calculation, we need to prove
[TABLE]
which has already been verified in Step 1. The remainder terms on the LHS is
[TABLE]
and the ones on the RHS is
[TABLE]
By relations (7.8), (7.11), (7.12), (7.13) and (7.15), we obtain that (7.22) holds.
Step 3: In this step we verify .
Since , , we have
[TABLE]
Deriving on both sides of the above equation,
[TABLE]
Putting the relation (7.15) into (7.23) and comparing with (3.19), we need to verify
[TABLE]
that is
[TABLE]
The coefficient of on the LHS is
[TABLE]
and the one on the RHS is
[TABLE]
By (7.9), (7.13), (7.14), we derive
[TABLE]
The remainder terms on the LHS is
[TABLE]
and the ones on the RHS is
[TABLE]
By the definition of and
[TABLE]
the remainder terms on both sides are consistent.
In the following two steps, we verify the relationship between and , that is,
[TABLE]
Since , the above relations are equivalent to
[TABLE]
and
[TABLE]
In the completion-of-squares technique, satisfies
[TABLE]
Then
[TABLE]
and we need to verify the following two equalities:
[TABLE]
and
[TABLE]
**Step 4: **Verification of (7.25):
The equation (7.25) can be simplified to
[TABLE]
Then we compare the coefficients of , , and on both sides of the above equation. The coefficient of on the LHS is
[TABLE]
and the one on the RHS is
[TABLE]
The coefficient of on the LHS is
[TABLE]
and the one on the RHS is
[TABLE]
The coefficient of on the LHS is
[TABLE]
and the one on the RHS is The coefficient of on the LHS is and the one on the RHS is By the notations in (7.3) and (7.15), we obtain (7.25) holds.
**Step 5: **Verification of (7.24):
The equation (7.24) can be simplified to
[TABLE]
By comparing the coefficients of , , and on both sides of the above equation, we deduce that (7.24) holds.
7.2 Proof of Lemma 4.5
From (3.8) and (4.6), the optimal control has the following form
[TABLE]
The following relations can be verified directly:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
By notations in (7.3), (7.10), (7.13) and (7.14), it can be verified that
[TABLE]
Before proving Lemma 4.5, we give the following lemma:
Lemma 7.1
Under the same assumptions as Theorem 4.4, for , we have
[TABLE]
Proof. We first prove the equality for . Compare the coefficients of , , and for in (7.35) with the ones in (3.15). By the notations in (7.3)-(7.14) and (7.26)-(7.33), we obtain the equality for holds.
Then, we prove the equality for . Putting , and into , the equality for becomes
[TABLE]
Compare the coefficients of , , and for in (7.35) with the ones in (7.36). By the notations in (7.3)-(7.14) and (7.26)-(7.33), we obtain the equality for holds.
Proof of Lemma 4.5: By the notations in (7.1) and (7.2), we have
[TABLE]
By the notations in (7.34), we obtain the first relation in Lemma 4.5 holds.
From the relationship of , , , and , we have
[TABLE]
Due to (7.34) and Lemma 7.1, we obtain
[TABLE]
This completes the proof.
7.3 Proof of Lemma 5.1
It can be verified that
[TABLE]
[TABLE]
[TABLE]
where
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Proof of Lemma 5.1: By Lemma 4.5, and the relations between , , and , we have
[TABLE]
It can be verified that the following two equalities hold
[TABLE]
[TABLE]
[TABLE]
Under the bounded assumptions in Lemma 5.1, one can check that , , , , , , , are uniformly bounded. Thus is uniformly bounded. This completes the proof.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Antonelli, Backward-forward stochastic differential equations, Ann. Appl. Probab. 3 (1993):pp. 777-793.
- 2[2] A. Bensoussan,. Lectures on stochastic control. Nonlinear filtering and stochastic control. Springer, Berlin, Heidelberg, 1982. pp. 1-62.
- 3[3] S. Chen, X. Li and X.Y. Zhou. Stochastic linear quadratic regulators with indefinite control weight costs. SIAM Journal on Control and Optimization 36(5) (1998):pp. 1685-1702.
- 4[4] S. Chen,and X.Y. Zhou. Stochastic linear quadratic regulators with indefinite control weight costs. II. SIAM Journal on Control and Optimization, 39(4), (2000) pp.1065-1081.
- 5[5] J. Cvitanić and J. Zhang, Contract theory in continuous-time models. Springer-Verlag, 2013.
- 6[6] M. Dokuchaev and X.Y. Zhou, Stochastic controls with terminal contingent conditions. J. Math. Anal. Appl. 238(1) (1999):pp. 143-165.
- 7[7] M. Hu, S. Ji and X. Xue, A global stochastic maximum principle for fully coupled forward-backward stochastic systems, SIAM J. Control Optim. 56(6) (2018):pp. 4309-4335.
- 8[8] J. Huang and J. Shi, Maximum principle for optimal control of fully coupled forward-backward stochastic differential delayed equations. ESAIM: Control, optimisation and calculus of variations, 18(4) (2012):pp. 1073-1096.
