Optimality conditions and local regularity of the value function for the optimal exit time problem
Luong V. Nguyen

TL;DR
This paper investigates the optimal exit time control problem, establishing conditions for the regularity and differentiability of the value function, and applying the Pontryagin maximum principle to derive optimality and sensitivity conditions.
Contribution
It provides new insights into the regularity properties of the value function for exit time problems, including conditions for continuous differentiability based on proximal subdifferentials.
Findings
The value function is semiconcave and twice differentiable almost everywhere.
Differentiability at a point does not imply continuous differentiability nearby.
Proximal subdifferential nonemptiness implies local continuous differentiability.
Abstract
We consider the control problem with \textit{exit time}. Unlike the Bolza and Mayer problems, in this problem the terminal time of the trajectories is not fixed, but it is the first time at which they reach a given closed subset - \textit{the target}. The most studied example is the \textit{optimal time problem}, where we want to steer a point to the target in minimal time. In this section, we first introduce the exit time problem, then we recall the existence of optimal controls, and some regularity results for the value function. We then use a suitable form of the \textit{Pontryagin maximum principle} to study some optimality conditions and sensitivity relations for the exit time problem. The strongest regularity property for the value function that one can expect, in fairly general cases, is \textit{semiconcavity}. In this case, the value function is twice differentiable almost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Variational Analysis · Nonlinear Partial Differential Equations · Geometric Analysis and Curvature Flows
11institutetext: Luong V. Nguyen 22institutetext: Institute of Mathematics, Polish Academy of Sciences, Warsaw, Poland
22email: [email protected]; [email protected]
Optimality conditions and local regularity of the value function for the optimal exit time problem
Luong V. Nguyen
We consider the control problem with exit time. Unlike the Bolza and Mayer problems, in this problem the terminal time of the trajectories is not fixed, but it is the first time at which they reach a given closed subset - the target. The most studied example is the optimal time problem, where we want to steer a point to the target in minimal time.
In this section, we first introduce the exit time problem, then we recall the existence of optimal controls, and some regularity results for the value function. We then use a suitable form of the Pontryagin maximum principle to study some optimality conditions and sensitivity relations for the exit time problem. The strongest regularity property for the value function that one can expect, in fairly general cases, is semiconcavity111A function is semiconcave if it can be written as a sum of a concave function and a function.. In this case, the value function is twice differentiable almost everywhere. Furthermore, in general, it fails to be differentiable at points where there are multiple optimal trajectories and its differentiability at a point does not guarantee continuous differentiability around this point. In the subsection 0.3 we shown that, under suitable assumptions, the nonemptiness of proximal subdifferential of the value function at a point implies its continuous differentiability on a neighborhood of this point.
0.1 The optimal exit time problem
We assume that a compact nonempty set and a continuous function are given. We consider the control system
[TABLE]
where is a measurable function which is called a control for the system (1). The set is called the control set. We denote by the set of all measurable control functions. We will often require the following assumptions
- (A1)
There exists such that
[TABLE]
- (A2)
exists and is continuous. Moreover, there exists such that
[TABLE]
It is well known from the ordinal differential equations theory that under assumption (A1), for each , (1) has a unique solution. In this case, we will denote by the solution of (1) and call the trajectory (starting at ) of the control system (1).
We now assume that a closed subset with compact boundary of the state space is given and is called the target. For a given trajectory of (1), we set
[TABLE]
with the convention that if for all . Then is the time at which the trajectory reaches the target for the first time, provided and we call the exit time of the trajectory . Denote by the set of all such that for some and we call the reachable set.
Given two continuous functions (called running cost) and (called terminal cost) with positive and is bounded from below, we consider the functional
[TABLE]
We are interested in minimizing , for , over all . If is such that
[TABLE]
then we call an optimal control for . In this case, is called an optimal trajectory.
The value function of the optimal exit time problem is defined by
[TABLE]
From the definition of , we have the so-called dynamic programming principle
[TABLE]
If is optimal then the equality holds.
The maximized Hamiltonian associated to the control system is defined by222Throughout the chapter, is usually a row vector in . However, in this section, and its dual space are identical. We also denote by the inner product between two vectors and .
[TABLE]
It is well-known, under some assumptions (see Theorem 8.18 in CSB04 ), that is a viscosity solution of the Hamilton - Jacobi - Bellman equation
[TABLE]
We now list some more assumptions on the cost functionals and the target which will be used in the sequel.
- (A0)
For all , the following set is convex
[TABLE]
- (A3)
There exist and such that and for all and .
- (A4)
The function is continuous in both arguments and locally Lipschitz continuous with respect to , uniformly in . Moreover, exists for all and is locally Lipschitz continuous in , uniformly in .
- (A5)
There exists a neighborhood of such that is locally semiconcave and is of class in . Moreover, denoting by the Lipschitz constant of in , we assume
[TABLE]
- (A6)
The boundary of is an -dimensional manifold of class and there exists such that for any , we have
[TABLE]
where denotes the unit outward normal to at .
Assumption (A0) is a condition to ensure the existence of optimal trajectories. More precisely, one has
Theorem 0.1
CPS00 ; CSB04 * Under assumptions (A0) - (A5), there exists a minimizer for optimal control problem for any choice of initial point . Moreover, the uniform limit of optimal trajectories is an optimal trajectory; that is, if are trajectories converging uniformly to and every is optimal for the point , then is optimal for .*
The condition in assumption (A5) can be regarded as a compatibility condition on the terminal cost . Together with other assumptions, it ensures the continuity of the value function (see Remark 2.6 in CPS00 and Proposition IV.3.7 in BCD97 ). Furthermore, we have the following regularity property of the value function.
Theorem 0.2
CPS00 ; CSB04 * Under hypothesis (A1)-(A6), the value function is locally semiconcave in .*
Note that in CPS00 ; CSB04 , the semiconcavity result is proved under weaker assumptions on the data. In fact, is only assumed to satisfy an interior sphere condidtion, while , and are assumed to be semiconcave in the -variable and is only continuous.
For the precise definition, properties and characterizations of semiconcave functions, we refer the reader to CSB04 .
0.2 Optimality conditions and sensitivity relations
We present some optimality conditions and sensitivity relations for the optimal exit time problem. One of important tools for our analysis is given by the so-called Pontryagin maximum principle. Before recalling a version of the maximum principle for the optimal control problem under consideration, we need to introduce some notation. For a given subset of , we denote by its boundary, by its complement. The distance function from is define for as
[TABLE]
and the oriented distance function from is defined by , whenever .
Let be an open subset of , be a lower semicontinuous function and . The proximal subdifferential of at is the set
[TABLE]
The Fréchet subdifferential of at is the set
[TABLE]
The Fréchet superdifferential of at is the set
[TABLE]
If is locally Lipschitz, then the reachable gradient of at is the set
[TABLE]
We now start with two technical lemmas.
Lemma 1
(see, e.g. CPS00 ) Assume (A1) - (A6). Given , let be the outer normal to at . Then there exists a unique such that .
Notice that since the boundary of the target is of class , the outer normal to at a point is . From Lemma 1, the function which satisfies is well-defined. Moreover, we have
Lemma 2
LN * Assume (A1) - (A6). The function is continuous.*
We recall the maximum principle in the following form
Theorem 0.3
Assume (A1) - (A6). Let and let be an optimal control for . Set for simplicity
[TABLE]
Let be the solution to the equation
[TABLE]
*with
Then satisfies*
[TABLE]
for a.e.
For the proof of the above maximum principle, we refer the reader to Theorem 4.3 in CPS00 where the principle is proved under weaker assumptions on and .
Given an optimal trajectory , then, by Lemma 1, there is a unique function satisfying the properties of Theorem 0.3 and we call the dual arc associated to the trajectory . Observe that the dual arc is a nonzero function and satisfies where is the exit time of . The following theorem gives a connection between the dual arcs and the Fréchet supperdifferential of the value function.
Theorem 0.4
CPS00 * Under the assumptions of Theorem 0.3, the dual arc satisfies*
[TABLE]
It is proved in CPS00 ; CSB04 that under the assumptions of Theorem 0.3 and the following assumption
- (H)
for any , if for all in a convex set , then is a singleton,
the value function is differentiable along optimal trajectories except the initial and final point points and therefore, by Theorem 0.4 if is the dual arc associated with an optimal trajectory then for all . This property plays an important role to prove an one-to-one correspondence between the number of optimal trajectories starting at a point and the number of elements of the reachable gradient of at . This implies that is differentiable at iff there is a unique optimal trajectory starting at . The following example shows that without assumption (H), may not differentiable along optimal trajectories.
Example 1
We consider the minimum time problem i.e., , for the control system
[TABLE]
with the initial conditions . Define the set
[TABLE]
The target is the set
[TABLE]
The Hamiltonian is
[TABLE]
One can easily check that all assumptions of Theorem 3.8 in CPS00 (which says that the value function is differentiable along optimal trajectories except the starting and the terminal points) are satisfied and assumption (H) is not satisfied. Let be the minimum time to reach the target.
If , then is the optimal control for and
[TABLE]
is the optimal trajectory starting at and we can easily compute that
[TABLE]
If , then is the optimal control for , the optimal trajectory is
[TABLE]
and the minimum time to reach the target from is
[TABLE]
Since is not differentiable when , fails to be differentiable at any point of optimal trajectories starting at .
Later we will see that is still differentiable at a point iff there is a unique optimal trajectory starting at even when (H) is not satisfied. In this case we may not have an one-to-one correspondence between the number of optimal trajectories starting at a point and the number of elements of the reachable gradient .
If we assume that
- (H1)
then we can compute partial derivatives of the maximized Hamiltonian (see Theorem 7.3.6 and also Remark 8.4.11 in CSB04 ).
Theorem 0.5
If (H1) holds, then for any , we have
[TABLE]
and
[TABLE]
where is any element of such that
[TABLE]
Since we are going to evaluate the Hamiltonian along dual arcs which are nonzero, the lack of differentiability of is not an obstacle. From Theorem 0.3 and Theorem 0.5, we have
Theorem 0.6
Assume (A1) - (A6) and (H1). Let be an optimal trajectory and let be the associated dual arc to . Then the pair solves the system
[TABLE]
Consequently and are of class .
The next theorem can be seen as a propagation property of the Fréchet subdifferential of the value function forward in time along optimal trajectories
Theorem 0.7
Assume (A1), (A2) and (A4). Let and let be an optimal control for . Set for simplicity
[TABLE]
Assume that and let be a solution of the equation (2) satisfying . Then for all .
For the proof of the previous Theorem, one can find in LN . Similarly, one can prove the following propagation result for the proximal subdifferential of the value function which will be used to prove the main results in the next section.
Theorem 0.8
Assume (A1), (A2) and (A4). Let and let be an optimal control for . Set for simplicity
[TABLE]
Assume that and let be a solution of the equation (2) satisfying . Then for some and for all , there exists such that, for every ,
[TABLE]
Consequently, for all .
Using above results, we can obtain the following results 9see LN for the proofs).
Theorem 0.9
Assume (A0) - (A6) and (H1). Let be such that is differentiable at . Consider the solution of (3) with initial conditions
[TABLE]
Then is an optimal trajectory for and is the dual arc associated to with for all where is the exit time of . Moreover, is the unique optimal trajectory staring at .
Theorem 0.10
Assume (A0) - (A6) and (H1). Let and . Consider the solution of (3) with initial conditions
[TABLE]
Then is an optimal trajectory for and is the dual arc associated to . Moreover for all where is the exit time of .
Theorem 0.11
Assume (A0) - (A6) and (H1). If there is only one optimal trajectory starting at a point then is differentiable at .
From Theorem 0.9 and Theorem 0.11, we have
Corollary 1
Assume (A0) - (A6) and (H1). The value function is differentiable at a point if and only if there exists a unique optimal trajectory starting at .
In Example 1, the value function is not differentiable at any point although there is a unique optimal trajectory for every with . The reasons are that the maximized Hamiltonian does not belong to and that the target is not smooth. We now give a simple example showing that the value function is differentiable at a point although there are multiple optimal trajectories starting at .
Example 2
We consider the minimum time problem for the control system
[TABLE]
with the initial condition . The target is the set
[TABLE]
The Hamiltonian is defined by . We can easily check that assumptions (A0) - (A6) are satisfied anh (H1) is not satisfied.
Observe that the minimum time function (the value function) is of class (see, e.g., CS95 ; CSB04 ). Therefore is differentiable at . However, there are multiple optimal trajectories starting at . Indeed, the trajectories corresponding to the controls and are optimal for .
We next give a class of control systems which can be applied Corollary 1.
Example 3 (see, e.g. Example 4.12 CPS00 )
We consider the control system with the dynamics given by
[TABLE]
where , and the control set is the closed ball of center zero and radius in . We also consider the running cost of the form
[TABLE]
where .
Since is affine and is convex with respect to and is convex, one can check that assumption (A0) is satisfied. If we assume that are of class , that are bounded and Lipschitz and that is bounded below by a positive constant, then assumption (A1) - (A4) are satisfied. The Hamiltonian
[TABLE]
satisfies assumption (H1). Then if the final cost function and the target satisfy assumption (A5) and (A6) then our result can be applied.
0.3 Local regularity of the value function
In this section, we provide sufficient conditions which guarantee the continuous differentiability of the value function around a given point. Local regularity of is discussed in the subsection 0.3, whereas local () regularity of is established in the subsection 0.3. In both subsections 0.3 and 0.3, the main condition to ensure the continuous differentiability of around a given point is the nonemptiness of the proximal subdifferential of at .
Local regularity
In addition, we require the following assumptions.
- (A7)
is of class in a neighborhood of bdry and bdry is of class .
- (H2)
.
Below we denote by the tangent space to the dimensional -manifold at .
Consider the Hamiltonian system
[TABLE]
on for some , with the final conditions
[TABLE]
where is in a neighborhood of and with satisfying . Note that, by (A7), is of class in a neighborhood bdry (see Proposition 3.2 in P02 ) and therefore is of class in a neighborhood of bdry.
For a given in a neighborhood of , let be the solution of (6) - (7) defined on a time interval with . Consider the so-called variational system
[TABLE]
Then the solution of (8) is defined in and depends on . Moreover
[TABLE]
Definition 1
For , the time
[TABLE]
is said to be conjugate-like for iff there exists such that
[TABLE]
In this case, the point is called conjugate-like for .
Remark 1
In the classical definition of conjugate point it is required, for some , (see e.g. CS951 ; CaF ; P02 and Section 0.3). Here, we have narrowed the set of such getting then a stronger result in Theorem 0.12 below than the one we would have with the classical definition.
Theorem 0.12
Assume (A0) - (A7) and (H2). Let be such that is differentiable at and be the optimal trajectory for . Set . If there is no conjugate-like time in for then is of class in a neighborhood of .
When the maximized Hamiltonian is strictly convex with respect to the second variable, we can progress as in CF1 ; CF2 to obtain the following result.
Theorem 0.13
Assume (A0) - (A7), (H2) and that for all . Let . If , then is of class in a neighborhood of .
Since is locally semiconcave and , is differentiable at . The idea of the proof is to absent a conjugate time for the final point of the optimal trajectory starting at and then apply Theorem 0.12.
In Example 3, if is nonsingular for all then for all . Therefore if and bdry are smooth enough then Theorem 0.13 can be applied.
When the running cost does not depend on , i.e., , the maximized Hamiltonian is never strictly convex with respect to the second variable. In this case for all whenever exists. Following the lines for the minimum time problem in FL13 , we obtain the following particular case
Theorem 0.14
Assume that (A0) - (A7), (H2) hold true, the kernel of has the dimension equal to for every and that . Let . If , then is of class in a neighborhood of .
Example 4 (see, e.g., FL13 )
Consider the control system with the dynamics given by
[TABLE]
where and the control set is the closed ball in of center zero and radius .
Since is affine with respect to , assumption (A0) is verified. Let for all . The Hamiltonian
[TABLE]
satisfies assumption (H2) whenever is also surjective for all and are of class . Furthermore, for all
[TABLE]
and for any ,
[TABLE]
Fix any ker . Then, from the above equality we get
[TABLE]
On the other hand, if , then
[TABLE]
Hence, by (9), . Let be such that . Consequently . Since is surjective, we deduce that and that .
Using the inclusion ker , we deduce that for all , i.e.,
[TABLE]
So, if the target and are of class and for any bdry , the classical inward pointing condition
[TABLE]
holds true, then Theorem 0.14 can be applied.
Local regularity
Let be an integer with . In this subsection, we require the following additional assumptions.
- (A8)
The functions and are of class in both arguments and the boundary of the target is an - manifold of class . Moreover, is of class in a neighborhood of bdry.
- (A9)
For all , there exists a unique such that
[TABLE]
and the function is of class in .
For our analysis, in assumption (A9), we only need that is of class in an open neighborhood of the set . For examples satisfying this condition, one can find in CPS00 ; P02 . It follows from (A8) and (A9) that the Hamiltonian satisfies
- (H3)
.
We next introduce the definition of conjugate times which is related to the Jacobians of solutions of the backward Hamiltonian system considered in P02 . Given , we denote by the solution the backward Hamiltonian system
[TABLE]
with the initinal conditions
[TABLE]
where with satisfying
[TABLE]
Note that and that under our assumptions the function is of class (see, e.g., P02 ).
As shown in P02 , the solution of (11) - (12) is defined for all . Moreover, are of class on .
Now let and denote respectively the Jacobians of and with respect to the pair in where solves (11) - (12). Then is the solution of the system
[TABLE]
with the initial conditions
[TABLE]
where are square matrices depending smoothly on which we can compute.
As explained in P02 , the Jacobian and are understood in the following sense. Fixed and . Since is an -dimensional manifold of class , there exist an open neighborhood and a parameterized function of class with the inverse of class , where is an open neighborhood of . Then and denote the Jacobians of and with respect to the coordinates and the time at the point i.e., and . In this case,
[TABLE]
and one can compute that for some real constant . Therefore, (see proof of Lemma 4.2 in P02 ). Then
[TABLE]
and by properties of linear systems, we have
[TABLE]
Note that this definition of the Jacobian depends on the parameterized function. For our purpose, however, this does not matter because we only focus on the ranks of the matrices and which are independent of the choice of the parameterized functions.
Definition 2
For , the time
[TABLE]
is said to be conjugate for if and only if
[TABLE]
Fixed and . Since is invetible for sufficiently small, if there exists conjugate time for then . On the other hand, if there is no conjugate time for in , then by the continuity and the fact that for all , there exist such that there is no conjugate time for any in with . In this case is an one-to-one correspondence in a neighborhood of . Using this fact, one can prove the following theorem.
Theorem 0.15
Assume (A0) - (A6) and (A8) - (A9). Let be such that is differentiable at . Let be the optimal trajectory starting at and be the exit time of . Set . If there is no conjugate time for in then is of class on an open neighborhood of .
Remark 2
Observe that if is differentiable at a point then V is differentiable along the optimal trajectory starting at except the final point. Then in Theorem 0.15 we can conclude that is of class on an open neighborhood of for all .
Following the idea used in CFS14 where the authors study the regularity of the value function for a Mayer optimal control problem, by using Theorem 0.15, we can prove the following theorem.
Theorem 0.16
AsAssume (A0) - (A6) and (A8) - (A9). Let . If then is of class on an open neighborhood of .
Remark 3
In the case the running cost does not depend on - variable, i.e., , the results of this section still hold true if we repace assumption (A9) by (H3), a weaker assumption.
Acknowledgments. This chapter was started writing when the author was a SADCO PhD fellow, position ESR4 at University of Padua which was supported by the European Union under the 7th Framework Programme “FP7-PEOPLE-2010-IT”, Grant agreement number 264735-SADCO. This work was also supported by funds allocated to the implementation of the international co-funded project in the years 2014-2018, 3038/7.PR/2014/2, and by the EU grant PCOFUND-GA-2012- 600415.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) M. Bardi and I. Capuzzo Dolcetta, Optimal control and viscosity solutions of Hamiltonian - Jacobi - Bellman equations, Birkhauser, Boston, 1997.
- 2(2) P. Cannarsa, H. Frankowska, From pointwise to local regularity for solutions of Hamilton - Jacobi equations, Calc. Var. Partial Differ. Equ. 49 (2014) 1061–1074.
- 3(3) P. Cannarsa, H. Frankowska, Local regularity of the value function in optimal control, Syst. Control Lett. 62 (2013) 791–794.
- 4(4) P. Cannarsa, H. Frankowska, T. Scarinci, Second-order sensitivity relations and regularity of value function for the Mayer problem in optimal control. SIAM J. Control Optim . 53 (2015) 3642–3672.
- 5(5) P. Cannarsa, C.Pignotti and C. Sinestrari, Semiconcavity for optimal control problems with exit time, Discrete Contin. Dynam. Systems , 6 (2000), 975 – 997.
- 6(6) P. Cannarsa and C. Sinestrari, Convexity properties of the minimun time function, Calc. Var. Partial Differential Equations , 3 (1995) 273–298.
- 7(7) P. Cannarsa and C. Sinestrari, On a class of nonlinear time optimal control problems, Discrete Contin. Dynam. Systems , 1 (1995), 285–300.
- 8(8) P. Cannarsa and C. Sinestrari, Semiconcave Functions, Hamilton - Jacobi Equations and Optimal Control, Birkhauser, Boston, 2004.
