On a switching control problem with c\`adl\`ag costs
Said Hamad\`ene, H\'ector Jasso-Fuentes, Yamid A. Osorio-Agudelo

TL;DR
This paper investigates a switching control problem with discontinuous, cdlg costs, providing characterizations of the optimal cost, existence of optimal policies, and connections to backward stochastic differential equations and PDE systems.
Contribution
It introduces new characterizations of the optimal cost function and establishes the existence of optimal control policies for switching problems with discontinuous costs.
Findings
Characterization of the optimal cost function
Existence of mbda-optimal control policies
Connection to backward stochastic differential equations and PDEs
Abstract
This work addresses a switching control problem under which the cost associated with the changes of regimes is allowed to have discontinuities in time. Our main contribution is to show several characterizations of the optimal cost function as well as the existence of "-optimal control policies. As a by-product, we also study the existence and uniqueness of solutions of a system of backward stochastic differential equations whose barriers (or obstacles) are discontinuous (in fact of c\`adl\`ag type) and depend itself on the unknown solution. At the last part of the paper, we study the case when an underlying diffusion is part of the dynamic of the system. In this special case, the optimal payoff becomes a weak solution of the HJB system of PDEs with obstacles which is of quasi-variational type. This paper is somehow a continuation of the papers [8, 17] that consider continuous costs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On a switching control problem with càdlàg costs
Said Hamadène111Département de Mathématiques, Equipe Statistique, et Processus, Université du Maine, Avenue Olivier Messiaen, 72085 Le Mans, Cedex 9, France [email protected].
Héctor Jasso-Fuentes222Departamento de Matemáticas. CINVESTAV-IPN. A. Postal 14-740, Ciudad de México, 07000, México. {hjasso, yaosorio}@math.cinvestav.mx. 333Corresponding author.
Yamid A. Osorio-Agudelo222Departamento de Matemáticas. CINVESTAV-IPN. A. Postal 14-740, Ciudad de México, 07000, México. {hjasso, yaosorio}@math.cinvestav.mx.
Abstract
This work addresses a switching control problem under which the cost associated with the changes of regimes is allowed to have discontinuities in time. Our main contribution is to show several characterizations of the optimal cost function as well as the existence of -optimal control policies. As a by-product, we also study the existence and uniqueness of solutions of a system of backward stochastic differential equations whose barriers (or obstacles) are discontinuous (in fact of càdlàg type) and depend itself on the unknown solution. At the last part of the paper, we study the case when an underlying diffusion is part of the dynamic of the system. In this special case, the optimal payoff becomes a weak solution of the HJB system of PDEs with obstacles which is of quasi-variational type. This paper is somehow a continuation of the papers [8, 17] that consider continuous costs.
2010 Mathematics Subject Classification: 60G40, 93E20, 62P20, 91B99
Keywords and phrases: Switching control, -optimal strategies, backward stochastic differential equations, viscosity solutions.
1 Introduction
Among the family of optimal control problems we can highlight those whose control is applied on the discontinuities to the dynamic. A special type of these problems is the so-called optimal multiple switching problems consisting in configuring the state of system according to doing changes of regimes (a.k.a. configurations) allowed for the controller. The times on which these changes are triggered are also part of the control, so the controller needs to apply a sequence, say such that at time , he/she changes the state from the regime to , . The objective for him/her is to find an optimal sequence like the one above that maximizes a certain total payoff.
This class of problems has been studied in the literature by several authors. For instance, Carmona and Ludkovski [5] study this kind of problems in order to find management optimal strategies with the purpose to release a power plant that converts natural gas into electricity and hence to sell this commodity in the market. Doucet and Ristic [9] apply the switching control theory to problems of target tracking that are commonly used in aerospace and electronic systems. Trigeorgis [28, 29] relates this type of problems to real option theory. Perhaps the most studied switching control problem is when only two-modes are considered. Several authors have put attention on this type of problems (see e.g., Brekke and Oksendal [3, 4], Hamadène and Jeanblanc [16], Duckworth and Zervos [10], among others).
During the last decade, the switching control problem has been extensively studied by several authors including [5, 6, 8, 16, 18, 19, 27], etc. (see also the references therein).
However all the aforementioned papers consider the cases where the switching costs are continuous. To the best of our knowledge the case where the switching costs are discontinuous has not been considered yet. This is the main objective of this work.
In this paper, as for the continuous switching costs, we show that the optimal payoffs are given by either a solution of a system of reflected BSDEs with obstacles depending on the solution or equivalently a system of processes expressed through their corresponding Snell envelopes. This solution is discontinuous in time. On the other hand, while an optimal strategy may not exist, a nearly optimal strategy of switching always exists. Finally in the Markovian framework of randomness, the previous system provides a viscosity solution in weak sense of the Hamilton-Jacobi-Bellman system of PDEs associated with the switching problem. This paper is somehow the extension of the references Djehiche et. al. [8] and Hamadène and Morlais [17] when the switching costs are of càdlàg type.
The rest of the paper is organized as follows: After this introductory part, in section 2 we introduce our switching problem and provide a verification theorem that is very common in control theory. In section 3, we present the existence and uniqueness of the solution for the system of RBSDEs with interconnected càdlàg obstacles and whose coefficients (drift and obstacles) depend on the unknown solution. Finally, in Section 4 we show that under a Markovian framework i.e. the dynamic of the system is also governed by a underlying diffusion process, our unique solution, obtained in section 3, provides a weak viscosity solution for a system of a quasi-variational inequality with interconnected obstacles.
1.1 Notation and terminology
Let be a fixed probability space and a -dimensional Brownian motion with completed natural filtration thus it satisfies the usual conditions, i.e., it is right continuous and complete. Associated to , we denote by its respective expectation.
Next let us consider the following elements:
- •
will denote the Euclidean norm in , for some appropriate .
- •
Given , is the set of random variables , -measurable and such that \mathbb{E}\big{[}\left|\xi\right|^{2}\big{]}<\infty.
- •
denotes the -algebra on of -progressively measurable sets.
- •
denotes the set of -measurable processes with values in such that \left\|w\right\|_{\mathcal{H}^{2,l}}:=\mathbb{E}\big{[}\int^{T}_{0}\left|w_{s}\right|^{2}ds\big{]}^{\frac{1}{2}}<\infty. If , then we will simply write .
- •
stands for the set of -measurable, càdlàg, -valued processes such that \left\|w\right\|_{\mathscr{S}^{2}}:=\{\mathbb{E}\big{[}\sup_{t\leq T}\left|w_{t}\right|^{2}\big{]}\}^{\frac{1}{2}}<\infty.
- •
A random variable defined on and valued in is called a stopping time with respect to the filtration , or simply an -stopping time, if for all .
- •
denotes the set of indexes so-called set of configurations, while the notation means .
- •
The notation and denote the Hessian matrix and the gradient vector of the function , respectively.
2 Model definition and preliminary results
Consider the stochastic processes , , and , and , together with a sequence
[TABLE]
of non-decreasing -stopping times , and random variables which are -measurable with values in , such that , for some .
Together with these elements, define the functional as follows:
[TABLE]
with , when ; and the same reasoning applies to , i.e., if and .
Definition 2.1**.**
A sequence defined as in (2.1) is called a strategy or switching control policy for the controller. Furthermore, we say that a strategy is admissible if it satisfies the following condition:
[TABLE]
For each , denote by the set of admissible strategies with the property of , .
The processes and are usually called the payoff rate per unit of the time and the switching cost, respectively. We will impose a condition to the processes , , that will be considered throughout this paper
Assumption A**.**
There exists a constant such that the processes -a.s.
A finite horizon switching control problem with -modes and initial configuration for , consists in finding an admissible sequence such that
[TABLE]
where is the functional defined in (2.2).
There is also a weaker formulation of what we understand for optimal strategy, namely, we say that is -optimal strategy if for all , we have
[TABLE]
We first provide an existence result of -interconnected processes, which will be useful later on.
Theorem 2.2**.**
Consider processes , and processes , , . Then, under Assumption (A), there exist valued càdlàg processes satisfying:
[TABLE]
Proof. For , and any , use the sequence defined by:
[TABLE]
and for ,
[TABLE]
First note that the process is continuous for all . Next since the process is càdlàg, is also a càdlàg process, and thus by an induction procedure we have that for all , is càdlàg too.
Let us prove now that, for , the sequence converges increasingly and pointwisely -a.s. for any and in the norm to a càdlàg process . To begin with, for any let us define , and let us prove that for fixed, can be characterized by
[TABLE]
Since the processes for are càdlàg, it is not obvious to use the same procedure as given in Djehiche. et al. [8] Proposition 3-(ii). In contrast, we shall consider the sequence of -stopping times given by follows: ,
[TABLE]
and for ,
[TABLE]
where
- •
,
and for ,
- •
Note that by (2.5) the process is a super-martingale. Hence, if its Doob-Meyer decomposition is given by (recall that is a martingale and a non-decreasing process), then by definition of , we have that for , i.e., is a martingale. Therefore,
[TABLE]
Analogously, taking:
[TABLE]
we have again that is a martingale. Arguing similarly as above, we have
[TABLE]
Plugging (2.8) into (2.7), rearranging terms and since that , we see that
[TABLE]
Repeating this procedure times, we obtain
[TABLE]
But
[TABLE]
Plugging (2.10) into (2.9), and noting that , we deduce
[TABLE]
Since belongs to , we can take essential supremum over and then sending to obtain
[TABLE]
Now we derive the inverse inequality. Let be an arbitrary strategy. Since , -a.s., and , then from (2.5) we have
[TABLE]
In the same way, since and is also - measurable, then
[TABLE]
Plugging this last inequality into (2.12), rearranging terms and using that , we see that
[TABLE]
Continuing this procedure, we have
[TABLE]
But again, since Y^{\xi_{N},0}_{\tau_{N}}=\mathbb{E}\big{[}\int^{T}_{\tau_{N}}\psi_{\xi_{n}}(s)ds\big{|}\mathcal{F}_{\tau_{N}}\big{]}, we get
[TABLE]
Thus, taking the essential supremum on , we get
[TABLE]
This last inequality together with (2.11), yield the characterization (2.6).
Since , we have , -a.s. for all . On the other hand, by Assumption (A), we obtain for each ,
[TABLE]
and hence the sequence is convergent. We now let for . Note that the process satisfies
[TABLE]
Besides, is also càdlàg process. Indeed, from (2.5) the process is a càdlàg super-martingale for all and . Thus its limit process is càdlàg as a limit of increasing sequence of càdlàg super-martingales (see Dellacherie and Meyer [[7], p. 86]), which gives the desired càdlàg property of . Moreover, from (2.13), the -properties of and by Doob’s maximal inequality, for each , we have
[TABLE]
and hence by the Lebesgue dominated convergence theorem the sequence converges to in . Thus, by Snell envelope properties (see Proposition 2-(iv) in Djehiche, et al. [8]), the càdlàg processes satisfy (2.4) since they are limits of the increasing sequence of processes , for , satisfying (2.6).
Let us show now some properties of the -strategy introduced in Theorem 2.2.
Proposition 2.3**.**
The -strategy defined as follows:
- •
, \tau^{\varepsilon}_{1}:=\inf\Big{\{}s\geq 0:Y^{i}_{s}\leq\max\limits_{k\in\mathcal{I}^{-i}}\left(Y_{s}^{k}-g_{ik}\left(s\right)\right)+\frac{\varepsilon}{2}\Big{\}}\wedge T
and, for ,
- •
\tau^{\varepsilon}_{n}:=\inf\Big{\{}s\geq\tau^{\varepsilon}_{n-1}:Y^{\xi^{\varepsilon}_{n-1}}_{s}\leq\max\limits_{k\in\mathcal{I}^{-\xi^{\varepsilon}_{n-1}}}\left(Y_{s}^{k}-g_{\xi^{\varepsilon}_{n-1}k}(s)\right)+\frac{\varepsilon}{2^{n}}\Big{\}}\wedge T**
and the sequence given by
- •
,
and for ,
- •
**
is admissible.
Proof. Suppose for contradiction that is not admissible, that is, . Then, by definition of we have
[TABLE]
If the event has positive probability, then there is a state and a loop (with ) of elements of (recall that is a finite set), and subsequence corresponding of this configuration such that
[TABLE]
Since is monotone and bounded, then we can define . Taking the limit with respect to in (2.14), we obtain
[TABLE]
But it is easy to verify that
[TABLE]
then from (2.15) we have
[TABLE]
Since -a.s., we have a contradiction. Therefore, is admissible.
Our next result has to do with a so-called verification theorem for the switching problem (2.3) in the context of càdlàg cost functions
Theorem 2.4**.**
The -processes in Theorem 2.2 are unique and they have the following relation with the switching problem (2.3):
- (i)
For each ,
[TABLE] 2. (ii)
The -strategy defined in Proposition 2.3 forms an -optimal strategy, i.e., for ,
[TABLE]
Proof.
- (i)
Assuming that at time the system is in mode , it follows by (2.4) that, for any ,
[TABLE]
Since is -measurable, it is a -a.s. constant, that is, . Now take defined in Proposition 2.3. Arguing similarly to Theorem 2.2, we can deduce
[TABLE]
The rest of the proof uses the same arguments as in the proof of Theorem 2.2. Namely, for every , we can deduce
[TABLE]
Then, from the definition of and since is a martingale, we get
[TABLE]
Plugging (2.19) into (2.18) and noting that is -measurable, it follows that:
[TABLE]
since . Repeating this procedure times, we obtain
[TABLE]
Taking liminf as we obtain
[TABLE]
By Proposition 2.3 we can take supremum over all admissible strategies , to obtain
[TABLE]
Letting , it follows that . The inverse inequality is analogous to the previous Theorem 2.2. Hence, the result follows. 2. (ii)
From part (i), specifically, (2.16) and inequality (2.20), we deduce
[TABLE]
which proves (ii).
3 Reflected Backward Stochastic Differential Systems
In this section we will provide the existence as well as uniqueness of the solution of the system of reflected backward stochastic differential equations (RBSDEs) of type
[TABLE]
in which the associated barriers are càdlàg processes. This system is connected with the previous switching problem. Actually when do not depend on , the system (3.1) is exactly the translation of the verification Theorem 2.2 in terms of reflected BSDEs as it is well-known that the Snell envelope can be expressed through reflected BSDEs (see e.g. El Karoui [12] or Hamadène [15]). On the other hand, this form of system (3.1) allows to consider switching problems when the cost functions are of risk sensitive type (utility functions) —see El Karoui and Hamadène [13].
To begin with our analysis, we will first introduce the following assumptions relate to the items involved in (3.1):
Assumption H**.**
- (H1)
: The stochastic process is in for any . 2. (H2)
: For any , the function satisfies:
- (i)
* is continuous uniformly with respect to ;* 2. (ii)
* is uniformly Lipschitz continuous with respect to , i.e., for some ,*
[TABLE] 3. (iii)
the mapping is Borel measurable and of polynomial growth. 4. (iv)
Monotonicity*: For all , for all , the mapping is non-decreasing whenever the other components are fixed.* 3. (H3)
: For each , the function is bounded from below, i.e. there exists a real constant such that, . Furthermore it is càdlàg in , continuous and of polynomial growth in . 4. (H4)
: For each , the function is continuous with polynomial growth and satisfies
[TABLE]
Note that in the (3.1) the process does not play a specific role. We consider this form of system (3.1) only in the perspective to deal with the Hamilton-Jacobi-Bellman system associated with the switching problem.
Proposition 3.1**.**
Under Assumptions (H), the system of RBSDEs (3.1) has a solution .
Proof. To begin with, we first consider the following standard BSDEs:
[TABLE]
and
[TABLE]
It is easy to verify that under (H) the data of (3.2) and (3.2) satisfy the conditions of Pardoux and Peng’s result [23], pp. 59-60 and then in virtue of Theorem 4.1 of this same reference, we claim the existence and uniqueness of solutions of both (3.2) and (3.3). To solve the system (3.1), we shall use an iterative method and regard (3.1) as a limit system. To this end, for any , we set , and for , we seek a triplet such that, for
[TABLE]
Note that for each the process is given. Then, by letting
[TABLE]
for , the data of the RBSDE associated with satisfy the assumptions in Hamadène [15], Theorem 1.4, and hence the processes do exist. Next, using the comparison theorem of solutions of BSDEs (see e.g. Theorem 2.2 in El Karoui et al. [14]) we deduce that for any , . Besides, as satisfies the monotonicity property (H2)-(H2)(iv) and using again the comparison of solutions of RBSDEs (see Theorem 1.5 in Hamadène [15]) we obtain by induction that:
[TABLE]
On the other hand, the process in (3.2), can be regarded as the triplet (i.e. ), which is solution for the system of RBSDEs with data
[TABLE]
Note that
[TABLE]
since satisfies the monotonicity property (H2)-(H2)(iv) and due that, for each (the fixed processes), . Therefore, by comparison Theorem 1.5 in Hamadène [15], we get that . In general, through an induction procedure, we can obtain for all and , and hence
[TABLE]
Arguing as in Theorem 2.2, we can see that there exists such that and \mathbb{E}\big{[}\sup_{0\leq t\leq T}\left|Y^{i}_{t}\right|^{2}\big{]}<\infty. Therefore, using Peng’s monotonic limit theorem (see Theorem 2.1 and Theorem 3.6 in Peng [24]), we deduce that for any , the limit process is a càdlàg process and there exists with non-decreasing process and such that: ,
[TABLE]
Now we claim that is, in fact, the desired solution of (3.1). Indeed, consider the RBSDEs at the -th variable and the other variables fixed, that is to say
[TABLE]
The solution of do exist by using again Theorem 1.4 in Hamadène [15]. Such a solution becomes the smallest -supermartingale that dominates (for more details on this last assertion, see Peng and Xu [25]). Whence . On the other hand, since for any and , we get
[TABLE]
Also observe that assumptions ((H2))-((H2)(iv)) yields that
[TABLE]
Then using again the comparison theorem for RBSDEs given in Theorem 1.5 in Hamadène [15], we have . This implies that , and hence . Moreover, this also implies that and for any , -a.s. This proves the existence of solution for (3.1).
We now provide a representation result for the solutions of system (3.1) and, as a by-product, we obtain the uniqueness. For later use, let us fix in and let us consider the following system of RBSDEs:
[TABLE]
Observe that does not depend on . Let be fixed, and let be the following set of strategies as in (2.1), such that:
[TABLE]
where , , is the following cumulative costs up to time , i.e.,
[TABLE]
Therefore and for any admissible strategy we have:
[TABLE]
Consider a strategy and let be the solution of the following BSDE
[TABLE]
with
[TABLE]
Making the change of variable , the equation in (3.8) is transformed in a standard BSDE. Since is adapted and , we easily deduce the existence and uniqueness of the process . We then have the following representation for the solution of (3.7).
Proposition 3.2**.**
Assume that for any :
- (i)
* satisfies (H2)-(H2)(ii),(H2)(iii);* 2. (ii)
* (resp. ) satisfies (H3) (resp. (H4)).*
Then the solution of system of RBSDEs (3.7) exists, it is unique and satisfies:
[TABLE]
Proof. Since does not depend on variables , then, it trivially satisfies (H2)-(H2)(iv). Then, by hypothesis (i) and (ii), and Proposition 3.1, the solution ( of the system (3.7) exists. Therefore, plugging an arbitrary strategy in (3.7), we obtain:
[TABLE]
with and as in (3.9), and,
[TABLE]
Adding from both sides of (3.11) and taking into account that , we have
[TABLE]
Therefore, we have
[TABLE]
Next let be the strategy defined recursively as follows (compare to the -strategy in Proposition 2.3): and for ,
[TABLE]
and
[TABLE]
In a similar manner as in the proof of Proposition 2.3, we can ensure that satisfies that . Let us prove now that \mathbb{E}\big{[}(\mathtt{C}^{\varepsilon}_{T})^{2}\big{]}<\infty and that is -optimal in for the problem (3.10). Following the strategy and since solves the RBSDE (3.7), it turns out that,
[TABLE]
since for . Taking now the limit with respect to in (3.14) we get:
[TABLE]
Taking supremum over all , and next letting and using the assumptions ((H4)) and ((H2))-((H2)(ii)),((H2)(iii)) satisfied for and respectively and since , and , we deduce from (3.15) that . It follows that . This last fact together with (3.13) yield (3.10). As a by-product, we obtain that the solution of (3.7) is unique.
Next for let us define
[TABLE]
where is the solution of system (3.7) which exists and is unique under the assumptions of Proposition 3.2. Note that the processes belong to . Hence, is a mapping from to .
We introduce the norm defined on by
[TABLE]
Note that , for all , implies that these norms are equivalent. For sake of completeness, we present the following result, established in Chassagneux et al. [6], which ensures that is a contraction on the Banach space .
Proposition 3.3**.**
Assume that for any the following hypotheses are in force:
- (i)
* verifies (H2)-(H2)(ii),(H2)(iii);* 2. (ii)
* (resp. ) verifies (H3) (resp. (H4)).*
Then, there exists such that the mapping is a contraction operator on . Therefore has a fixed point which belongs to and which provides a unique solution for system (3.1).
Proof. Let and consider the respective images under , and . Besides, let us introduce the following “auxiliary dominating” RBSDE, for :
[TABLE]
where , and and are the continuous and discontinuous parts of . Note that by Proposition 3.2 a unique solution exists for (3.16).
For fixed, and for any , denote by , and the respective solutions of the following one-dimensional BSDEs: ,
[TABLE]
We deduce from Proposition 3.2 that
[TABLE]
Besides, note that for an -optimal strategy , we have
[TABLE]
Using a comparison argument, we easily check that for any strategy , and hence, by (3.17) we get that . Therefore, taking into account the last two inequalities and (3.18), we get that
[TABLE]
This implies
[TABLE]
and by using the inequality , we have
[TABLE]
Now, applying Ito’s formula to , using the inequality and the fact that is Lipschitz, taking expectation, to obtain: ,
[TABLE]
The inequalities and also imply
[TABLE]
Rearranging terms, we obtain:
[TABLE]
Taking , we get
[TABLE]
Now, an analogous procedure to lead to similar result, namely
[TABLE]
Combining these two inequalities with (3.19), we deduce
[TABLE]
By integrating with respect to on both sides of the last inequality and taking into account the fact that such inequality holds true for any and for all , we get
[TABLE]
Finally, choosing and taking , we see that this mapping is a contraction. This gives the existence and uniqueness of the system of RBSDE (3.1).
4 The Markovian Framework
In this section we will provide more specifications to the process treated in previous sections. Namely, we will assume now that this process has a Markovian evolution described by means of a stochastic differential equation (diffusion process) as in (4.3) below. Under this framework our previous analysis can be reduced to study a system of partial differential equations with obstacles (quasi-variational system). Among the main result of this section we can highlight the characterization of both the optimal function (2.3) and the solution of the system of RBSDEs (3.1) as a viscosity solution in a weak sense (see Theorem 4.9). We will start to introduce the following functions:
[TABLE]
satisfying the following hypotheses:
The functions and are jointly continuous and Lipschitz continuous with respect to uniformly in , that is, there exists a constant such that for any and ,
[TABLE]
Note that continuity and (4.1) imply that and are of linear growth, i.e., there exists a constant such that:
[TABLE]
It is well known that under (4.1)-(4.2), there exists a unique Markov process , for , that is a (strong) solution of the following standard stochastic differential equation:
[TABLE]
satisfying the following estimates: For any , and
[TABLE]
for some constant (one can see Karatzas and Shreve [22] or Revuz and Yor [26], for more details).
Recall that the associated infinitesimal generator to is given by :
[TABLE]
for in ( is the trace of a square matrix and, is the transpose of a matrix ).
Now let be fixed and let be the unique solution of system (3.1) when the process is taken to be equal to of (4.3), i.e., the solution associated with ( are the switching costs and they satisfy the same assumptions as in Assumption (H)) with and .
Assume now that Assumptions (H) are satisfied. Since we are in the Markovian framework then there exist deterministic functions , , with polynomial growth such that for any
[TABLE]
Note that we also have
[TABLE]
On the other hand the polynomial growth of stems from the polynomial growths of the data assumed in Assumption (H) and the BSDEs (3.2), (3.3) as well.
Notation: For a sake of simplicity of notation, hereafter we sometimes denote by , for some generic function or vector .
Remark 4.1**.**
From now on we will assume that is non-decreasing w.r.t for any and not only w.r.t (as precised in (H2)-(H2)(iv)). This assumption is not really restrictive since by considering the system of RBSDEs verified by , we obtain new generators given by
[TABLE]
which have the same properties as . Moreover, with an appropriate choice of , those new generators are non-decreasing w.r.t for any , i.e., they fulfill the property we are requiring for (one can see Hamadène and Morlais [17], for more details on this transform).
Our main interest will be to show that the function is a solution in a weak viscosity sense for the Hamilton-Jacobi-Bellman system of PDEs associated with the switching problem. In the case when the functions and , , are continuous, this system reads as: for all ,
[TABLE]
and it is shown that is the unique viscosity solution of system (4.6). But in our framework the functions , , are no longer continuous w.r.t , therefore the definition should be adapted. We are going to show that is a viscosity solution in a weak sense for the HJB system of PDEs (4.6), associated with the swiching problem, and which we are going to define in what follows. This definition is inspired by Ishii’s works [21, 20], and also by the paper of Barles and Perthame [1].
To proceed for a locally bounded -valued function , (), we define its lower (resp. upper) semi-continuous envelope (resp. ) as follows: For any ,
[TABLE]
Note that the function (resp. ) can also be seen as the smallest usc (resp. lsc) function which is greater (resp. smaller) than . On the other hand, the following properties of the semi-continuous envelopes of functions will be useful later.
Lemma 4.2**.**
Let and , , be two locally bounded -valued functions. We then have:
- (i)
If is continuous then and . 2. (ii)
. 3. (iii)
* and .* 4. (iv)
If is continuous then and .
Proof. (i) Obviously we have and then since this latter is lsc. On the other hand and then since is lsc. This completes the proof of the claim as the other property can be obtained similarly.
Points (ii) and (iii) are rather obvious, we then leave their proofs to the care of the reader.
(iv) First note that . Next let be a sequence such that as . As is continuous then as . Therefore, by definition of the usc envelope, which completes the proof of the first claim. The proof of the other one is similar.
Next for , let us denote by the non-linearity which defines the equation in (4.6), i.e.,
[TABLE]
where
[TABLE]
Note that by Assumption (H) on and (4.1), the function is jointly continuous in its arguments. Therefore, taking into account the results of Lemma 4.2, for any , the semi-continuous envelopes of (in all arguments) are given by:
[TABLE]
and
[TABLE]
We are now ready to precise the definition of the viscosity solution of HJB system associated with the switching problem. As noticed previously it is inspired by the papers [1, 21, 20]. On the other hand, the discontinuities of the functions generated by the ones of make that the terminal condition at time is not the same as in (4.6), but should be adapted as well to this weak sense (see e.g. [2]).
Definition 4.3**.**
Let be a locally bounded function from into .
(1)
We say that is a viscosity subsolution of (4.6) if for any , and ,
(a)
* verifies the following inequality at point :*
[TABLE]
(b)
Moreover, at , the function is such that, for and any with and attaining its minimum at , we have
[TABLE]
(2)
In the same manner, is said to be a viscosity supersolution of (4.6) if for any , and ,
(a)
* verifies at the following:*
[TABLE]
(b)
Similarly, at , satisfies the next: for and any with and attaining its maximum at , we have
[TABLE]
(3)
We say that is viscosity solution of (4.6) if it is both a viscosity sub. and supersolution.
To proceed we are going to show that the functions is a viscosity solution of the system (4.6) in a weak sense, i.e., according to Definition 4.3. However we need some preliminary results which we give as lemmas hereafter. From now is the open ball of radius and center .
Lemma 4.4**.**
Under the Assumption ((H2)), the mapping
[TABLE]
is u.s.c. for any .
Proof. Let . Since is u.s.c for , then for all there exists such that for all , satisfying , we have
[TABLE]
Next, by monotonicity and Lipschitz properties of , for all we get
[TABLE]
where is the Lipschitz constant of . By continuity of with respect to and Lipschitz in (Assumptions ((H2))-((H2)(i)) and ((H2))-((H2)(ii))), the quantity inside the brackets goes to zero as . Therefore, taking a suitable , we obtain
[TABLE]
for all and for some other constant and the claim follows.
Lemma 4.5**.**
Let , and . If
[TABLE]
and
[TABLE]
then there exist and a ball such that for all we have:
[TABLE]
and
[TABLE]
Proof: By (4.11) and the continuity of there exist and a ball such that
[TABLE]
for all . Next, by the u.s.c property, there exists such that for all we have
[TABLE]
where in the last inequality we use that the usc envelope of a function is greater or equal to the function itself. Therefore, from (4.15), (4.16) and assuming , without loss of generality, that we have
[TABLE]
for all .
As for the second inequality we can do a similar procedure since is continuous and is u.s.c. Namely, there exist and such that for each we have
[TABLE]
Now, supposing, without loss of generality, that and , we have that inequalities (4.13) and (4.14) hold true for all .
Remark 4.6**.**
In a similar manner, it is possible to obtain a parallel result as in Lemmas 4.4 and 4.5 for in lieu of . Namely, it can be proved that under Assumption ((H2)) the mapping
[TABLE]
is l.s.c., and if
[TABLE]
then there exists and such that for all :
[TABLE]
The proofs are very similar as the proofs given in the aforementioned lemmas, so shall omit them.
Finally, we recall two comparison results for BSDE and RBSDE that we have borrowed from Lemma 4.1 and Proposition 4.2, in Dumitrescu et al. [11].
Lemma 4.7**.**
Fix and let be a stopping time with values in . Consider two random variables and and two drivers (a.k.a generators) , such that satisfies (H2) with Lipschitz constant . For , let be the solution in of the BSDE with associated data , and terminal time . In this case, and represent the driver and terminal condition, respectively. Suppose that for some we have
[TABLE]
Then we have , -a.s. for each .
Lemma 4.8** (A comparison result between a BSDE and a RBSDE).**
Fix and let be a stopping time on . Consider the random variable and a driver . Let be the associated BSDE solution with driver , terminal time and terminal condition . Consider also and let be a driver satisfying (H2) with Lipschitz constant . Assume the existence of the solution of the associated RBSDE with driver , terminal time and obstacle , and assume that
[TABLE]
where is a positive constant. Then, we have , , for each .
We now give the main result of this section.
Theorem 4.9**.**
The function , where for each , is defined as in (4.5), is a weak viscosity solution of the system (4.6).
Proof. Step 1: Viscosity sub-solution property on .
Let and be such that , for all and ). Without loss of generality, we can assume that the minimum of attained at is strict. We need to show that if
[TABLE]
then
[TABLE]
We proceed by contradiction; i.e. we shall assume
[TABLE]
then by Lemma 4.5 there exists and such that, for all , we have both
[TABLE]
since , and
[TABLE]
By definition of , there exists a sequence in , such that and . Now let us fix and take the associated state process defined in (4.3) and define the stopping time as
[TABLE]
Applying Itô’s lemma to , it can be seen that
[TABLE]
is the solution of the BSDE with coefficient , terminal time and terminal value . The idea is to compare this BSDE with the solution of the RBSDE with coefficient , barrier and terminal condition . Note that by definition of and inequality (4.23), we have
[TABLE]
for each , where to reach the last inequality we use that , the monotonicity property (H2)-(H2)(iv) and the Remark 4.1. It remains to compare the solution of the BSDE with the barrier of the RBSDE for . From inequality (4.22) and definition of we derive that
[TABLE]
On the other hand, to show that the inequality holds at , we recall that the minimum is strict and hence there exists such that
[TABLE]
In particular, we have
[TABLE]
Therefore, from (4.25), (4.26) and letting , we get
[TABLE]
for . Thus, by the comparison result in Lemma 4.8, we have
[TABLE]
where is a positive constant which only depends on and the Lipschitz constant of . In particular, for , we have
[TABLE]
Now, since and is continuous with , for sufficiently large we have both
[TABLE]
and
[TABLE]
whence , and hence
[TABLE]
But , then by comparison theorem for . Thus, for , we get that produces a contradiction with (4.29). Therefore (4.21) holds true and then also the viscosity subsolution property in .
Step 2: Viscosity super-solution property on .
Let and be such that and , for all . As stated above, we can suppose that the maximum is strict in . Since by construction , then it is easy to see that . Now, we show that
[TABLE]
Similar to the subsolution case, we shall proceed by contradiction, namely, suppose that
[TABLE]
then by Remark 4.6 there exists and such that, for all , we have
[TABLE]
Let be a sequence in such that and . We introduce the state process and define the stopping time as in (4.24). Next, we apply Itô’s formula to in order to obtain
[TABLE]
is the solution of the BSDE associated with terminal time , terminal value and driver . Then by definition of and inequality (4.30), we get
[TABLE]
for a.s., where to reach the last inequality we use the monotonicity property (H2)-(iv) and Remark 4.1 and that for . It remains to compare the terminal conditions of the BSDEs with coefficients and respectively. Since the maximum is strict, there exists (which depends on ) such that on , which implies
[TABLE]
Thus using inequality (4.31) and the comparison result for BSDEs, Lemma 4.7, we derive that
[TABLE]
and therefore, in , we have . As above mentioned, we can assume that is sufficient large so that . We thus get
[TABLE]
and hence
[TABLE]
But , then by Lemma 4.7 we get for , and thus , which is a contradiction with (4.32). Therefore the viscosity supersolution property in holds true.
Step 3: Subsolution property at .
We now show that for any ,
[TABLE]
We follow here the same idea as in Bouchard [2] (see also Theorem 1 in Hamadène and Morlais [17]). We reason by contradiction, namely, we assume that
[TABLE]
Let be a sequence in such that
[TABLE]
Since is u.s.c and of polynomial growth, we can find a sequence of functions of and neighborhood of such that , and hence from the inequality (4.33) we have
[TABLE]
for large enough. On the other hand, after possibly passing to a sub-sequence of we can assume that the previous inequality holds on for some small enough in such a way that . Since is locally bounded (recall it has polynomial growth), there exists such that on . We can then assume that on . Next we define
[TABLE]
Note that and
[TABLE]
Since as , we can choose close enough to to ensure that
[TABLE]
Next let us consider the following stopping times
[TABLE]
and
[TABLE]
where is the complement of .
First note that for a subsequence , . Actually from (4.33), we have
[TABLE]
Therefore taking into account of (4.34), at least for a subsequence, for any ,
[TABLE]
Now let us stick to this subsequence. If , then by the càdlàg property of the processes which define we have , which contradicts the previous inequality and then the claim is valid.
On the other hand the property which characterizes the jumps of in the definition (3.1), implies that on the process is continuous and for . Applying now Itô’s formula to the process and taking expectation, we obtain
[TABLE]
[TABLE]
where the last equality is due to the fact that the process , stopped at time , solves a RBSDE system of the type (3.1) with data given by , and the last inequality is obtained by monotonicity property of and since for . Besides, note that by definition of we have on . Next, we have that both and (t,x)\rightarrow\big{\|}Z^{i,t,x}_{\cdot}\big{\|}_{\mathcal{H}^{2,d}}(t,x) are of polynomial growth. Thus by Assumption (H2)-(i),(iii) and inequality (4.4) we deduce that
[TABLE]
and hence taking the limit in both hand sides of the inequality (4) as yields
[TABLE]
Therefore, taking large enough and recalling that pointwisely, we get a contradiction. Thus for any and we have
[TABLE]
which is the claim.
Step 4: Supersolution property at .
We are going to show that
[TABLE]
Let be a sequence in such that
[TABLE]
Since is deterministic, we have from the definition of that
[TABLE]
where we have used that on . Next taking the limit in both hand sides as , using that is continuous and arguing similarly to (4.41) we have
[TABLE]
that is, . On the other hand, setting , considering the RBSDE (3.1) on , taking expectation to obtain
[TABLE]
since and u^{i}(\tau_{k},X^{t_{k},x_{k}}_{\tau_{k}})\geq\max\limits_{j\in\mathcal{I}^{-i}}\big{(}u^{j}(\tau_{k},X^{t_{k},x_{k}}_{\tau_{k}})-g_{ij}(\tau_{k},X^{t_{k},x_{k}}_{\tau_{k}})\big{)}. It implies that
[TABLE]
The second inequality stems from Fatou’s Lemma while the third one is due to the fact that (\max\limits_{j\in\mathcal{I}^{-i}}\big{(}u^{j}_{*}-g_{ij})\big{)}_{*} is lower semicontinuous and by (4.45), at least for a subsequence, . Thus
[TABLE]
which is the claim. The proof is now complete.
Remark 4.10**.**
If the switching costs are continuous, conditions (4.10) and (4.9), read respectively as:
[TABLE]
and
[TABLE]
Therefore and by the non free-loop property one deduces that which implies that . For more details one can see e.g. Hamadène and Morlais [17].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Barles, G., Perthame, B: (1987) Discontinuous solutions of deterministic optimal stopping time problems. Model. Math. Anal. Num. 21 , pp. 557-579.
- 2[2] Bouchard, B.: (2009) A stochastic target formulation for optimal switching problems in finite horizon. Stochastics . 81 , pp. 171-197.
- 3[3] Brekke, K. A., Oksendal, B.: The high contact principle as a sufficiency condition for optimal stopping, in Stochastic models and Option Values , North-Holland, Amsterdam, 1991, pp. 187-208.
- 4[4] Brekke, K. A., Oksendal, B.: (1994) Optimal switching in an economic activity under uncertainty. SIAM J. Control Optim . 32 , pp. 1021-1036.
- 5[5] Carmona, R., Ludkovski, M.: (2008) Pricing asset scheduling flexibility using optimal switching. Appl. Math. Finance . 15 , pp. 405-47
- 6[6] Chassagneux, J.-F., Elie, R., Kharroubi, I.: (2011) A note on existence and uniqueness for solutions of multidimensional reflected BSD Es. Electron. Commun. Probab. 16 , pp. 120-128.
- 7[7] Dellacherie, C., Meyer, P. A: Probabilités et Potentiel , V-VIII, Hermann, Paris, 1980.
- 8[8] Djehiche, B., Hamadène, S., Popier, A.: (2009) A Finite horizon optimal multiple switching problem. SIAM J. Control Optim. 48 , pp. 2751-2770.
