Optimal Control of Markov Regime-Switching Stochastic Recursive Utilities
Liangquan Zhang, Xun Li

TL;DR
This paper develops a stochastic maximum principle for optimal control of systems with Markov regime-switching and recursive utilities, addressing non-convex control domains and control-dependent diffusion terms.
Contribution
It introduces new first and second-order adjoint equations and derives variational equations for forward-backward stochastic differential equations in this context.
Findings
Established a general maximum principle for complex stochastic control systems.
Derived novel second-order adjoint equations involving solutions of second-order adjoint.
Provided illustrative examples demonstrating the theoretical results.
Abstract
In this paper, we establish a general stochastic maximum principle for optimal control for systems described by a continuous-time Markov regime-switching stochastic recursive utilities model. The control domain is postulated not to be convex, and the diffusion terms depend on control variables. To this end, we first study a kind of classical forward stochastic optimal control problems. Afterwards, based on previous results, we introduce two groups of new first and second-order adjoint equations. The corresponding variational equations for forward-backward stochastic differential equations are derived. In particular, the generator in the maximum principle contains solutions of second-order adjoint equation which is novel. Some interesting examples are concluded as well.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Insurance, Mortality, Demography, Risk Management · Risk and Portfolio Optimization
Optimal Control of Markov Regime-Switching Stochastic Recursive Utilities
Liangquan Zhang1 and Xun Li2
- School of Science
Beijing University of Posts and Telecommunications
Beijing 100876, China
- Department of Applied Mathematics
The Hong Kong Polytechnic University
Hong Kong, China L. Zhang acknowledges the financial support partly by the National Nature Science Foundation of China (Grant No. 11701040, 11871010 & 61871058) and Innovation Foundation of BUPT for Youth (No. 500417024 & 505018304). E-mail: [email protected]. Li acknowledges the financial support partly by Hong Kong RGC under grants 15209614, 15255416 and 15213218. Email: [email protected]
Abstract
In this paper, we establish a general stochastic maximum principle for optimal control for systems described by a continuous-time Markov regime-switching stochastic recursive utilities model. The control domain is postulated not to be convex, and the diffusion terms depend on control variables. To this end, we first study a kind of classical forward stochastic optimal control problems. Afterwards, based on previous results, we introduce two groups of new first and second-order adjoint equations. The corresponding variational equations for forward-backward stochastic differential equations are derived. In particular, the generator in the maximum principle contains solutions of second-order adjoint equation which is novel. Some interesting examples are concluded as well.
AMS subject classifications: 93E20, 60H15, 60H30.
**Key words: ** Stochastic maximum principle, Regime-switching, Adjoint equations, Spike variation.
1 Introduction
A large number of literatures focus on applications of Markov regime-switching models in finance as well as stochastic optimal control since the fundamental feature brings from the Markov regime-switching models in contrast to the traditional models described via the diffusion processes from the empirical point of view. The basic idea of regime-switching is to modulate the model with a continuous-time and finite-state Markov chain where each state represents a regime of the system or level of the economic indicator. The regime-switching models have been variously applied in stock trading, option pricing, portfolio selection, risk measurement, etc (see [21, 29, 35, 36, 32] references therein).
It is well known that stochastic optimal control is one of the central themes of modern control sciences. Necessary conditions for the optimal control of the forward continuous-time stochastic system, which are the so-called stochastic maximum principle of Pontraygin’s type, have been extensively studied since early 1960s. As a matter of fact, the basic idea of the stochastic maximum principle is to derive a set of necessary conditions that must be fulfilled by any optimal control. The stochastic maximum principle describes that any optimal control must satisfy a system of forward-backward stochastic differential equations (FBSDEs, for short), called the optimality system, and minimize a functional, called the Hamiltonian. It leads to explicit expressions for the optimal controls in certain cases. The stochastic maximum principle can be applied to situations where state processes involve random coefficients and state constraints. Recently, there are some works applying the stochastic maximum principle in finance (see Cadenillas and Karatzas [4], see references therein).
When Brownian motion is the unique noise source, Peng [28] initially studied a general stochastic maximum principle using the second-order adjoint equations, which allow the control to enter into both drift and diffusion coefficients under a nonconvex control domain. For more details about the stochastic maximum principle theory, interested readers may refer to Yong and Zhou [9]. Forward-backward stochastic control systems driven by FBSDEs are widely applied in mathematical economics and mathematical finance. They mainly emerge in stochastic recursive utility optimization problems and principal-agent problems. Pardoux and Peng [25] proved the well-posedness for nonlinear BSDE. Duffie and Epstein [7] introduced the notion of recursive utilities in continuous-time situation, which is actually a type of backward stochastic differential equation (BSDE, for short) where the generator is independent of . El Karoui et al [19] extended the recursive utility to the case where contains . The term can be interpreted as an ambiguity aversion term in the market (see Chen and Epstein 2002, [2]). In particular, the celebrated Black-Scholes formula indeed provides an effective way of representing the option price (which is the solution to a kind of linear BSDE) through the solution to the Black-Scholes equation (parabolic partial differential equation actually). Since then, BSDE has been extensively studied and employed in the areas of applied probability and optimal stochastic controls, particularly in financial engineering (see [19]).
Peng [28] derived necessary conditions for optimal control for the partially coupling case when the control domain is convex. Later, Xu [42] studied the nonconvex control domain case and obtained the corresponding necessary conditions. In the above work, however, assume that the diffusion term in the forward control system does not include the control variable. Ji and Zhou applied the Ekeland variational principle to establish a maximum principle for a partially coupled forward-backward stochastic control system, while the forward state variable is constrained in a convex set at the terminal time. Meng [24] and Wu [41] obtained the necessary conditions for optimal control of fully coupled forward-backward stochastic control systems when the control domain is convex. Shi and Wu [31] studied the nonconvex control domain case and obtained the corresponding necessary conditions under some -monotonic assumptions. However, the control variable is not involved in the diffusion term of the forward equation.
In order to study the backward linear-quadratic optimal control problem, Kohlmann and Zhou [20], Lim and Zhou [22] developed a kind of new method to handle the problem. The variable is regarded as a control process and the terminal condition as a constraint, and then it is possible to apply the Ekeland variational principle to obtain the maximum principle. Adopting this idea, Yong [44] and Wu [40] independently established the maximum principle for the recursive stochastic optimal control problem. Nonetheless, the maximum principle derived by these method involves two unknown parameters. Hence, some hard questions naturally emerge as follows: What is the second-order variational equation for the BSDE? How to derive the second-order adjoint equation since the quadratic form with respect to the variation of . All of which seem to be extremely complicated. Hu [14] overcame the above difficulties by introducing two new adjoint equations. Then, the second-order variational equation for BSDE and the maximum principle are obtained. The main difference of his variational equations compared with those in Peng [37] consists in the term in the variation of . Due to the term in the variation of , Hu developed a global maximum principle, which is novel and different from those in Wu [40], Yong [44] and other previous works, to solve Peng’s open problem. Hu, Ji and Xue further considered the maximum principle for fully coupled FBSDEs (see [15, 16]).
Inspired by above works, we are more interested in studying optimal control for FBSDE of Markov-regime diffusion type. Compared with the literature mentioned above, the novelty of the formulation and the contribution in this paper should be stated as follows: This work states the necessary condition for stochastic utilities, the maximum principle of ours is significantly different from the existing results obtained in [8, 45, 32] which describe the sufficient condition. In particular, Sun et al [33] considered the forward-backward controlled systems and established a sufficient condition for optimal control. This paper fills a much-needed gap for the necessary part. Due to the presence of regime-switching, the second-order solution enters in the generator of BSDE which is new. Tao and Wu [34] also consider optimal control for FBSDEs modulated by continuous-time and finite-state Markov chains with convex control domain (see [23, 26]). Recently, a similar work [49] has been done by Zhang et al for forward mean field system with jump, but not containing the generator. In this paper, we adopt the idea from Peng [37] within our own framework using the Riesz Representation Theorem to interpret the dual relationship between the adjoint and variational equation, which is much clearer and more readable. Besides, we prove a general estimate for BSDE (see Lemma 4). Then, we can easily derive some delicate estimates for backward variational equation which is important to establish the variational inequality.
The rest of this paper is organized as follows: After some preliminaries and notation in the second section, we are devoted the third section to studying the classical forward stochastic control problems adopting the form, similar to [37]. Based on the previous estimates, we establish a general maximum principle for FBSDEs optimal control problem. Finally, in Section 5, we conclude the novelty of this paper and schedule possible generalizations in future. A proof of technique lemma is put in the Appendix 5.
2 Preliminaries
Throughout this paper, we denote by the space of -dimensional Euclidean space, by the space the matrices with order . Let be a completed filtered probability space. For a fixed finite time horizon, is an -valued standard Brownian motion defined on the space.
To model the controlled state process, we first introduce a set of Markov jump martingales associated with the chain . We adopt the notation from Elliott et al [11]. The state space of the chain with a finite state space is denoted by , where , , and the -th component of is the Kronecker delta for each . The state space is called a canonical state space. We suppose that the chain is homogeneous and irreducible. To specify statistical or probabilistic properties of the chain , we define the generator of the chain under . This is also called the rate matrix, or the -matrix. For each , is the constant transition intensity of the chain from state to state at time . Note that for an , so . In general, for each with , we suppose that , which means that . From Elliott et al [11], we have the following semimartingale expression for :
[TABLE]
where is an -valued, -martingale. For each , with , let be the number of jumps from state to state up to time . Then
[TABLE]
where
[TABLE]
is an -martingale. The ’s are called the basic martingales associated with the chain . For each fixed , let be the number of jumps into state up to time . Thus,
[TABLE]
Set . For each , is also an -martingale. For convenience, define
[TABLE]
Then, for every , is also an -martingale.
Note that appearing in this paper as superscript denotes the transpose of a matrix. Let be a compact subset of . In what follows, represents a generic constant, which can be different from line to line.
The state process is governed by FBSDEs of the following type:
[TABLE]
for all , -a.s., with , on some filtered probability space , where
[TABLE]
are given deterministic functions. Denote as the admissible control set, where
[TABLE]
Our problem is to minimize the following cost functional over
[TABLE]
Our stochastic recursive optimal control problem is to find an optimal control such that .
We now introduce the following spaces of processes:
[TABLE]
Set . Assume that
(A1)
The coefficients and satisfy the Lipschitz condition for , and satisfies the Lipschitz condition as follows:
[TABLE]
, for every . Moreover,
[TABLE]
for all , where .
(A2)
Maps , , is twice continuously differential, with the (partial) derivatives up to the second order being uniformly bounded, Lipschitz continuous in , and continuous in and . The map is twice differentiable with the derivatives up to the second order being uniformly bounded and uniformly Lipschitz continuous in .
Clearly, under Assumptions (A1)-(A2), FBSDEs (1) and (2) admit a unique adapted strong solution (see [10]). Due to the influence of Markov regime switching, we need an extension of Itô’s formula with Markov regime switching in the sequel.
Lemma 1**.**
Assume that an -dimensional process driven by
[TABLE]
and the function for every . Then
[TABLE]
where denotes the -th columns of the matrix .
Lemma 2**.**
Let
[TABLE]
be one-dimensional regime switching equations. Then
[TABLE]
Proof. Applying Itô’s formula to , we have
[TABLE]
which implies the desired result.
Remark 3**.**
The process
[TABLE]
is called the quadratic covariance of and .
We now present a technical result which will be useful in the next section. Its proof can be found in Appendix 5.
Lemma 4**.**
Consider the following linear BSDE with Makov regime switching:
[TABLE]
where with , , and are -adapted, and
[TABLE]
for some . Then there exists a positive constant such that
[TABLE]
We outline the proof for the convenience of the reader, which can be found in the Appendix 5.
3 Classical optimal control problems
The aim of this section is to derive a kind of variational equation (first and second-order) and corresponding variational inequality. Observe that the control variable enters in , and the control domain is non-convex, both the convex perturbation and the first-order expansion approach fail. In contrast to Peng [37], one needs to find new adjoint equations because of presence of Markov chains. We now introduce the spike variation with respect to optimal control , more precisely, let and be a Borel set with Borel measure , defined as follows:
[TABLE]
where is an arbitrarily fixed time, is a sufficiently small constant, and is an arbitrary -valued -measurable random variable such that . Let be the trajectory of the control system (1) corresponding to the control .
The cost functional is
[TABLE]
where , . Our problem is to find an optimal control such that
[TABLE]
In what follows, we add the following additional assumption:
(A3)
The map and are twice differentiable with the derivatives up to the second order being uniformly bounded and uniformly Lipschitz continuous in , .
We shall derive the variational inequality from the following fact:
[TABLE]
Next, fixed , for , we define
[TABLE]
Let and be respectively solutions to the following stochastic differential equations:
[TABLE]
and
[TABLE]
where
[TABLE]
and
[TABLE]
Remark 5**.**
Equations (8) and (9) are usually called the first and second-order variational equations, respectively.
Define the Hamiltonian function as follows:
[TABLE]
where .
The following result proceeds the Taylor expansion of the state with respect to the control perturbation. We need the following estimations:
Lemma 6**.**
Let (A1)-(A2) hold. Then, we have
[TABLE]
Proof.
Let , . Then, we have
[TABLE]
where
[TABLE]
By the classical method, it is easy to verify and for Taking the expectation after squaring both sides of (18) and using Gronwall inequality, we have
[TABLE]
Similarly, we can prove (14). We now deal with (15). In fact,
[TABLE]
To prove (16), let
[TABLE]
Then, it follows from (18) and (8) that we obtain
[TABLE]
Therefore,
[TABLE]
We consider the following term
[TABLE]
Similarly, we have
[TABLE]
and
[TABLE]
Hence, we get (16). Set
[TABLE]
Then, we have
[TABLE]
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
One can check that
[TABLE]
Recall
[TABLE]
From (19), we have
[TABLE]
It is easy to check that
[TABLE]
By virtue of Gronwall inequality, Itô’s formula and (20), we complete the proof.
Lemma 7**.**
Let (A1)-(A3) hold. Then
[TABLE]
Proof. From the fact that is optimal triple, we have
[TABLE]
By Lemma 6, we get
[TABLE]
By Lemma 6, we complete the proof.
Next, we shall find the first and second-order adjoint equations. Using these processes, we are able to derive the variational inequality from(21).
Consider the following stochastic system:
[TABLE]
where . We construct a linear functional on the Hilbert space as follows:
[TABLE]
where and are defined in(22). Then, by virtue of the Riesz Representation Theorem, there exists a unique solution
[TABLE]
such that
[TABLE]
for . We now consider equations (8) and (9) as the solution to (22), respectively. Then, we have
[TABLE]
and
[TABLE]
Then, using Assumption (A1) and Lemma 6, we can re-write the expression (21) as
[TABLE]
Next, we focus on the quadratic terms in (23) by employing the Riesz Representation Theorem. Applying Itô’s formula to on yields
[TABLE]
Consider the following symmetric matrix-valued linear stochastic differential equations:
[TABLE]
where .
Define a linear functional based on (25):
[TABLE]
Obviously, is a linear continuous functional on . There exists a unique solution , such that
[TABLE]
Now let
[TABLE]
Then, the relation (23) can be expressed as
[TABLE]
From the definition of and Lemma 6, we have
[TABLE]
where
[TABLE]
Moreover, we adopt the scalar product: tr, , . It therefore follows that for all and , we have tr.
Remark 8**.**
The inequality (28) is called the variational inequality of our optimal control problem. In contrast to the classical work by Peng [37], the appearance of extra term with is due to the random environments.
Next we shall find the second-order adjoint equation. The first one has been given in Zhang et al [45] like:
[TABLE]
From (26) and (28), we can find the second-order adjoint equation is
[TABLE]
where is defined in (12). Now define an -function:
[TABLE]
Inequality (28) can be expressed as
[TABLE]
Theorem 9**.**
Suppose that Assumptions (A1)-(A3) are in force. If is a solution to optimal control problem(7). Then
[TABLE]
which are, respectively, unique solutions to equations (29)-(30) such that the variational inequality (28) holds.
Proof. From (28) and discussion above, we complete the proof.
We now establish the relationship between MP and DPP. To this end, we consider the Markovian (feedback) control, namely, the control of the form .
Define
[TABLE]
Let us define the value function as follows:
[TABLE]
for every .
Employing the classical dynamic programming principle (see Fleming and Soner [12]), we are able to attain the following Hamilton-Jacobi-Bellman (HJB, for short) equation with terminal boundary condition:
[TABLE]
where
[TABLE]
Put
[TABLE]
for .
Remark 10**.**
Clearly, because of the presence of regime-switching, (33) is a coupled system of nonlinear differential equations. Meanwhile, it is impossible for us to obtain the explicit form of the value function and an optimal control by solving (33). The main reason comes from the fact that (33) is a coupled system of nonlinear second-order partial differential equations, adding extreme difficulty in finding a closed-form solution of (33). Recall in lieu of a single nonlinear differential equation in the traditional literature for optimal control problems, we need more powerful tools and analysis to deal with it.
As a matter of fact, one should seek an optimal control and associated state trajectories. Normally, the main motivation for studying the dynamic principle programming is that one might construct an optimal control through the value function. The following result gives a way of testing whether a given admissible pair is optimal and, more importantly, suggests how to construct an optimal feedback control. Such a result is called a verification theorem. For smooth case, we have a following result.
Theorem 11** (Verification theorem).**
Assume that (A1)-(A3) hold. Let for each be a solution of HJB equation (33). Then
[TABLE]
Furthermore, an admissible pair is an optimal control for(32) if and only if
[TABLE]
Proof. Fix . Let with its associated state trajectory . Applying Itô’ formula to on , we have
[TABLE]
which implies (34). Applying (36) to , together with (35), yields the desired result.
Remark 12**.**
As claimed in Theorem 11, the value function requires to be smooth enough. As we have well known, the value function 32 is not necessarily smooth. Thus, one perhaps introduce the viscosity solution to study our control problems. As this complete remark of the existence is much longer than the present paper, it will be reported elsewhere.
Let us now provide a substantial example to demonstrate how to construct an optimal control by Theorem 9.
Example 13**.**
Consider the following control system :
[TABLE]
where , with the control domain being and the cost functional being
[TABLE]
Let be optimal pair to be determined. The first and second-order adjoint equations are
[TABLE]
and
[TABLE]
According to the existence and uniqueness of BSDE (40), the unique adapted the solution is , . The corresponding -function is
[TABLE]
The function attains its minimum at
[TABLE]
Particularly, set , . Then
[TABLE]
Hence the necessary condition of optimality as specified by the stochastic maximum principle (Theorem 9) would be satisfied if one could find a control such that the corresponding , . By the first-order adjoint equation (39), it is clear that if we take with the corresponding state , then the unique solution of (39) is .
4 Optimal control for recursive utilities
Based on the results obtained in Section 3, we study the optimal control problem for stochastic recursive utilities for systems composed of SDE (1) and BSDE (2). The cost functional now is defined in (3). The control problem is to minimize over . For simplicity, we assume that , . Let be the optimal control and let be the corresponding solution of the equations (1) and (2). Similarly, we define for .
We shall give the variational equation for BSDE (2). For this aim, we consider the following two adjoint equations:
[TABLE]
and
[TABLE]
where and are adapted processes with suitable properties, will be determined later.
Applying Itô’s formula to yields
[TABLE]
where
[TABLE]
From Lemma 6, we derive that
[TABLE]
and
[TABLE]
Let
[TABLE]
Then, a simple calculation yields
[TABLE]
Now let
[TABLE]
It is easy to show that
[TABLE]
Next, we deal with
[TABLE]
where
[TABLE]
We shall construct a linear BSDE with solution via selecting certain suitable , . The following conditions must be satisfied:
- •
and are determined by ;
- •
We must have
[TABLE]
where is independent on and .
Applying Taylor’s expansion to
[TABLE]
we are able to find
[TABLE]
where and similarly for , , , and . By Assumptions (A1) and(A2), we get that the adjoint equations (41) and (42) have unique solutions and , respectively, and
[TABLE]
We introduce the following BSDE:
[TABLE]
Next, we shall prove the following estimates:
Lemma 14**.**
Assume that (A1)-(A2)
[TABLE]
[TABLE]
[TABLE]
Proof.
We first prove (45). We reformulate BSDE (43) as follows:
[TABLE]
Clearly,
[TABLE]
First, we consider
[TABLE]
where
[TABLE]
for , , , respectively. Next, we deal with the following term
[TABLE]
where
[TABLE]
It therefore follows that
[TABLE]
where
[TABLE]
It follows from Lemma 4 and Lemma 6 that we obtain (45). By the same argument, we can easily get (46). Next we handle (47). To this end, we set
[TABLE]
Subtracting (44) from BSDE (48) yields
[TABLE]
Applying Lemma 4, we only need to prove that
[TABLE]
[TABLE]
and
[TABLE]
Obviously,
[TABLE]
One needs to check that
[TABLE]
and
[TABLE]
From (4)-(55), we prove (50). Due to is bounded the definition of , we have
[TABLE]
which yields
[TABLE]
It is fairly easy to get that
[TABLE]
One can check that
[TABLE]
and
[TABLE]
Therefore, (52) holds. Finally, from Lemma 4 we complete the proof.
Now we are able to show the variational equations for BSDE (2):
[TABLE]
Similar to and , we have
[TABLE]
and
[TABLE]
Now let us come back to discuss about the maximum principle for optimal control of FBSDEs (1)-(2). From the definition of performance functional (3), it follows
[TABLE]
Define the adjoint equation for (2) ():
[TABLE]
Clearly, SDE (60) admits a unique strong solution. By virtue of Itô’s formula to , we can obtain
[TABLE]
In order to state our main result, we define a function as follows:
[TABLE]
We now assert the following main result.
Theorem 15**.**
Assume that (A1)-(A2) hold. Let be an optimal control and , be the associated solution. Then
[TABLE]
where is defined in (61).
We also provide a concrete example.
Example 16**.**
Consider the following FBSDEs with ,
[TABLE]
where and is a -valued deterministic process, . Clearly, the solutions to equations (41) and (42) are , , respectively. Therefore, from Theorem 15, the necessary condition for optimal control is
[TABLE]
For instance, let . Then, we immediately derive that is an optimal control. The corresponding trajectories are .
5 Concluding remarks
In this paper, we study a general stochastic maximum principle for optimal control for systems governed by a continuous-time Markov regime-switching stochastic recursive utility. The control domain is assumed not to be convex, and the diffusion terms allow to contain control variables. On the one hand, we derive a result for forward stochastic optimal control problems. Afterwards, based on previous estimates and inspirations, we introduce two groups of new first and second-order adjoint equations. Moreover, the corresponding variational equations for forward-backward stochastic differential equations are obtained. Finally, we present our results using an illustrated example. It is necessary to point out that, due to the Markov regime-switching, the generator in the maximum principle involves solutions of the novel second-order adjoint equations. However, a number of other issues deserve further investigations as follows:
- •
Apart from Pontryagin’s maximum principle, the Bellman’s dynamic programming principle (DPP for short) is another important tool in tackling stochastic optimal control problems. Nowadays, many researchers engage in this field and achieve fundamental result. As for the DPP, systematic investigations of the classical stochastic optimal control problems are discussed deeply in the famous book by Fleming and Soner [12]. Now return to the expression of HJB equation (33). It vividly indicates that it is a coupled system of nonlinear differential equations because of the presence of regime-switching, in contrast to a single nonlinear differential equation in the traditional literature such as the user’s guide by Crandall et al [3]. Out of question, it is conceivable that the fruitful analysis will be much more involved than that in [3]. Nevertheless, we are not able to provide the explicit form of the value function and an optimal control via solving (33). The main obstacle appears since it is a coupled system of nonlinear second-order partial differential equations, rendering extreme difficulty in constructing a closed-form solution of (33). In future, we shall employ the viscosity solution to study this problem(see [39, 46] in this direction).
- •
An interesting topic is to consider the fully coupled FBSDEs. Indeed, some financial optimization problems for large investors and some asset pricing problems with forward-backward differential utility (see [5, 6, 30] directly lead to fully coupled FBSDEs. Using the ideas from Peng and Wu [38], under certain -monotonicity conditions for the coefficients, we may derive the existence and uniqueness of fully coupled FBSDEs with Markov regime-switching. And then, a series of questions turn out, for instance, the associated partial differential equations and control problems. Another problem of great interests is to consider the case when the random environment or the Markov regime-switching is unobservable.
Appendix A The Proof of Lemma 4
Proof of Lemma 4.
We first assume that are bounded. Let and define
[TABLE]
Clearly for any , the function is smooth and as . The motivation of introducing such a function is to avoid some difficulties that might be encountered in differentiating functions like for non-integer . Applying Itô’s formula to on , we have
[TABLE]
Here is independent of . Applying the well-known Young’s inequality ( with ), we get
[TABLE]
Let be large enough such that . It immediately yields
[TABLE]
Similarly, let be large enough such that . Then, we have
[TABLE]
Hence, it follows from Gronwall’s inequality that
[TABLE]
and
[TABLE]
Note that . Besides since we postulate that at the beginning being that are bounded, the above procedure becomes valid (otherwise the integration on the right-hand side of (63) may not exist; see (5). Next, we want to refine the above estimate so that(6) will follow. To this end, observe that (65) implies that its left-hand side is bounded uniformly in . Thus, it is allows us to define
[TABLE]
We now come back (64), using (67). Define , for small enough. Then, for any , we have from Young’s inequality, Hölder inequality and (66)
[TABLE]
The constant in (68) is independent of . Hence, it follows from (68) that
[TABLE]
Now we repeat the same procedure on and on , and so on. Eventually, we end up with
[TABLE]
with . According to the definition of (67), we conclude that
[TABLE]
Letting , we get the desired result. In the case that we only have(5), we may use the usual approach of approximation. We thus complete the proof.
**Acknowledgements. **The author wish to thank Dr. Shuai Jing and Dr. Zhongyang Sun for his conversation and suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. M. Bismut, An introductory approach to duality in optimal stochastic control. SIAM Rev. , 20 (1978), 62-78.
- 2[2] Z. Chen and L Epstein, Ambiguity, risk, and asset returns in continuous time. Econometrica , 70 (2002), 1403-1443.
- 3[3] M. G. Crandall, H. Ishii, and P.-L. Lions, User’s guide to viscosity solutions of second order partial differential equations, Bull. Amer. Math. Soc. (N.S.), 27 (1992), 1-67.
- 4[4] A. Cadenillas and I. Karatzas, The stochastic maximum principle for linear, convex optimal control with random coefficients, SIAM J. Control Optim ., 33 (1995), 590-624.
- 5[5] J. Cvitanic and J. Ma, Hedging options for a large investor and forward-backward SDE’s, The Annals of Applied Probability , 6 (1996), 370-398.
- 6[6] D. Cuoco and J. Cvitanic, Optimal consumption choices for a “large” investor, Journal of Economic Dynamics & Control , 22 (1998), 401-436.
- 7[7] D. Duffie and L. G. Epstein, Stochastic differential utility. Econometrica . 60(1992) 353-394.
- 8[8] C. Donnelly, Sufficient stochastic maximum principle in a regime-switching diffusion model. Appl. Math. Optim. , 64 (2011), 155-169.
