Minimax and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations for Time-Delay Systems
Anton Plaksin

TL;DR
This paper investigates Hamilton-Jacobi-Bellman equations for time-delay systems, establishing the existence, uniqueness, and equivalence of minimax and viscosity solutions to the value functional in a delay differential control problem.
Contribution
It introduces a framework for analyzing Hamilton-Jacobi-Bellman equations with delay, proving the equivalence of minimax and viscosity solutions for such systems.
Findings
Existence of minimax and viscosity solutions
Uniqueness of these solutions
Solutions coincide with the value functional
Abstract
The paper deals with a Bolza optimal control problem for a dynamical system which motion is described by a delay differential equation under an initial condition defined by a piecewise continuous function. For the value functional in this problem, the Cauchy problem for the Hamilton-Jacobi-Bellman equation with coinvariant derivatives is considered. Minimax and viscosity solutions of this problem are studied. It is proved that both of these solutions exist, are unique and coincide with the value functional.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
∎
11institutetext: Anton Plaksin 22institutetext: N.N. Krasovskii Institute of Mathematics and Mechanics (IMM UB RAS)
Ural Federal University
Yekaterinburg, Russia,
Minimax and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations for Time-Delay Systems
Anton Plaksin
(Received: date / Accepted: date)
Abstract
The paper deals with a Bolza optimal control problem for a dynamical system which motion is described by a delay differential equation under an initial condition defined by a piecewise continuous function. For the value functional in this problem, the Cauchy problem for the Hamilton-Jacobi-Bellman equation with coinvariant derivatives is considered. Minimax and viscosity solutions of this problem are studied. It is proved that both of these solutions exist, are unique and coincide with the value functional.
Keywords:
optimal control time-delay systems Hamilton-Jacobi equations coinvariant derivatives minimax solution viscosity solution
MSC:
49J25 49K25 49K35 49L20 49L25
††journal: JOTA
1 Introduction
In optimal control problems for dynamical systems which motions are described by ordinary differential equations, studies of infinitesimal properties of a value function lead to a Hamilton-Jacobi-Bellman (HJB) equation, which is a particular case of Hamilton-Jacobi (HJ) equations with partial derivatives. In the case when an optimal control problem is considered on a finite time interval and has a cost functional of Bolza type, a value function satisfies the corresponding natural terminal condition, which, together with the HJB equation, determine the Cauchy problem. Since in many cases Cauchy problems for HJ equations do not have a classical (continuously differentiable) solution, various approaches to a notion of a generalized solution were developed. The main of them are minimax and viscosity approaches. The minimax approach Subbotin_1980; Subbotin_1984; Subbotin_1995 originates in the positional differential game theory Krasovskii_Subbotin_1988; Krasovskii_Krasovskii_1995. According to this approach, a generalized (minimax) solution is a function that satisfies the pair of stability conditions with respect to characteristic differential inclusions. In infinitesimal form, these conditions reduce to the pair of inequalities for directional derivatives. In the viscosity approach Crandall_Lions_1983; Crandall_Evans_Lions_1984, a HJ equation is replaced by the pair of inequalities for sub- and supergradients, and a generalized (viscosity) solution is a function satisfying these inequalities. In investigations of minimax and viscosity solutions of Cauchy problems for HJB equations, it was shown (see, e.g., Subbotin_1995; Barbu_1986; Bardi_Capuzzo-Dolcetta_1997; Evans_1998) that both of these solutions exist, are unique and coincide with the value function in the corresponding optimal control problems. The goal of the paper is to obtain the similar result in the case when a motion of a dynamical system is described by delay differential equations.
The first investigations of control problems for time-delay systems showed (see Krasovskii_1962; Osipov_1971 and also Oguztoreli_1966; Banks_1968; Banks_Manitius_1974) that an analogue of a value function in such problems is a value functional on a space of motion histories. It raises natural questions about the suitable notion of the differentiability of such functionals, the corresponding notions of directional derivatives, sub- and supergradients, and definitions of generalized solutions of the corresponding HJB equations.
The viscosity solution theory for HJ equations with Frechet derivatives began with Crandall_Lions_1985; Crandall_Lions_1986a. In these papers, the definition of the viscosity solution in terms of inequalities for Frechet sub- and supergradients was given, and existence and uniqueness of such solution were proved. After that, a lot of investigations (see, e.g., Barbu_Barron_Jensen_1988; Soner_1988; Cannarsa_Da_Prato_1990; Cannarsa_Frankowska_1992; Li_1995) dealt with applications of the viscosity approach to control problems for abstract evolution systems in Hilbert or Banach spaces. In particular, in Soner_1988; Cannarsa_Da_Prato_1990, for Bolza optimal control problems for evolution systems, modified definitions of viscosity solutions of Cauchy problem for HJB equations were given, their existence, uniqueness and coincidence with the value functional were shown. Note that the conditions in these papers allow to interpret some class of time-delay systems as evolution systems, however, this class is not general enough, since it does not contain systems with discrete delay. The optimal control problem for systems with discrete delay was considered in Barron_1990. It was proved that the value functional is a viscosity solution of the Cauchy problem for the HJB equation, but the uniqueness question of the viscosity solution was not investigated. One could also mention papers Wolenski_1994; Clarke_Wolenski_1996 in which optimization problems for quite general delay differential inclusions (which cover the case of discrete delay) were considered and various necessary optimality conditions were given.
In Kim_1999, for the description of infinitesimal properties of a value functional in optimal control problems for time-delay systems, the notion of coinvariant derivatives was used. Note that such derivatives and their close analogues were applied later to a wide range of control problems for various functional differential systems (see, e.g., Lukoyanov_2000; Lukoyanov_2001; Lukoyanov_2010a; Lukoyanov_2010b; Aubin_Haddad_2002; Pepe_Ito_2012; Lukoyanov_Gomoyunov_Plaksin_2017; Bayraktar_Keller_2018). The theory of minimax and viscosity solutions of Cauchy problems for HJ equations with coinvariant derivatives and its application to differential games for time-delay systems were developed in Lukoyanov_2000; Lukoyanov_2001; Lukoyanov_2010a; Lukoyanov_2010b. In these papers, the class of time-delay systems under consideration is quite general and includes systems with discrete delay. In Lukoyanov_2000; Lukoyanov_2001, it was shown that the value functional is the unique minimax solution. In Lukoyanov_2010a, the description of the value functional in terms of suitable directional derivatives was given. In Lukoyanov_2010b, similar to Soner_1988, the modified definition of a viscosity solution based on a sequence of compact sets is considered. It allows to prove that the viscosity solution exists, is unique and coincides with the minimax solution, however, such definition is not reduced to the classical definition of a viscosity solution in the particular case without delay. For more natural definitions of a viscosity solution, the uniqueness questions is still open.
This paper is aimed to solve this question and to develop the theory of minimax and viscosity solutions of HJB equations for time-delay systems, which generalizes in a natural way the classical theory of both minimax and viscosity solutions of HJB equations for systems of ordinary differential equations.
In the paper, a Bolza optimal control problem for a time-delay system with discrete delay is considered. For the value functional of this problem, a HJB equation with coinvariant derivatives is investigated. Definitions of minimax and viscosity solutions (which are consistent with the classical definitions) of the Cauchy problems for this equation are studied. It is proved that both of these solutions exist, are unique and coincide with the value functional. Besides, the feedback scheme for constructing the optimal control by the minimax (viscosity) solution is given (see the proof of Theorem 2.2 ).
A principle idea for obtaining these results is to use the space of piecewise continuous functions as the state space in which the optimal control problem and the HJB equation are considered. As already noted earlier (see Wolenski_1994), the choice of a suitable state space plays an important role for an application of the viscosity approach to HJB equations for time-delay systems. The space of measurable functions can be used as the state space. But such choice significantly narrows the class of the corresponding time-delay systems and excludes important for applications systems with discrete delay (see Soner_1988). The space of continuous functions can also be used as the state space. It allows to cover the case of systems with discrete delay, but, as mentioned above, it makes it possible to prove the uniqueness only of the modified viscosity solution (see Lukoyanov_2010b). In Barron_1990; Wolenski_1994, other functional spaces were considered as the state space, but the uniqueness question of the viscosity solutions was not investigated. Presented in this paper choice of the space of piecewise continuous functions allows on the one hand to consider the case of systems with discrete delays, and on the other hand, to prove the uniqueness of the viscosity solutions in the classical sense. Note that this proof is based on Lemma LABEL:lem_MVI, which is an analogue of the theorem about ”Mean value inequality” Clarke_Ledyaev_1994; Clarke_Ledyaev_Stern_Wolenski_1998 (see also Subbotin_1993) for functionals defined on the space of piecewise continuous functions.
2 Formulation of Results
Let be the -dimensional Euclidian space with the inner product and the norm . A function is called piecewise continuous if there exist numbers such that, for each , the function is continuous on the interval and there exist a finite limit of as approaches from left. Denote by and the linear spaces of piecewise continuous and Lipschitz continuous functions .
Let and . Let us denote
[TABLE]
Define the following norms on the space :
[TABLE]
Consider a dynamical system which motion is described by the following delay differential equation:
[TABLE]
Here, is the time variable, is the state vector at the time , , is the current control action, is a compact set.
Let . Define
[TABLE]
Denote by the set of measurable functions . Let . By a motion of system (1), we mean a function that satisfies equation (1) for almost every .
Consider the following optimal control problem: for each , minimize the Bolza cost functional
[TABLE]
over all , where is the motion of system (1), is the function defined by , .
We assume that the following conditions hold:
The functions , , , , are continuous.
For every , there exists a number such that
[TABLE]
for any , and .
There exists a constant such that
[TABLE]
for any , and .
()
For every , there exists a number such that
[TABLE]
for any , where
[TABLE]
It is known that, under such conditions, for each and , there exists a unique motion of system (1).
The value functional in optimal control problem (1), (2) is defined by
[TABLE]
One can show (following, e.g., the scheme from (Evans_1998, p. 553)) that, for every and , the functional satisfies the following equation (a dynamic programming principle):
[TABLE]
where is the motion of system (1).
In order to obtain a Hamilton-Jacobi-Bellman (HJB) equation as infinitesimal form of equation (4), we will use the following definition of differentiability of functionals. Following Kim_1999; Lukoyanov_2000; Lukoyanov_2001, a functional is called coinvariantly differentiable (ci-differentiable) at a point , if there exist a number and a vector such that, for any , and , the following relation holds:
[TABLE]
where the function is defined by , , the value depends on the triplet , and as . Then is called the ci-derivative of with respect to and is the gradient of with respect to . Let us note that if does not depend on the functional variable , then the definition of ci-differentiability coincides with the definition of differentiability of functions.
Define the Hamiltonian of problem (1), (2) by
[TABLE]
Consider the following Cauchy problem for the HJB equation
[TABLE]
and the terminal condition
[TABLE]
Define the class of functionals in which we will search a solution of this problem. Denote by the set of functionals , which are continuous with respect to and satisfy the following Lipschitz condition: for every , there exists a number such that
[TABLE]
for any and . The choice of this class is motivated, in particular, the inclusion , which will be shown in Lemma LABEL:lem_rho_Phi.
The following theorem establishes the relation between problem (7), (8) and the value functional in the case when is ci-differentiable.
Theorem 2.1
The following statements hold:
If a functional is ci-differentiable at each point , , satisfies HJB equation (7) at these points and satisfies terminal condition (8), then the identity holds.
- 2.
If the value functional is ci-differentiable at a point , , then it satisfies HJB equation (7) at this point.
The proof of this theorem is described after the Theorem 2.2.
For the case when is not ci-differentiable, definitions of generalized (minimax and viscosity) solutions of problem (7), (8) are given below.
Taking the constant form , we denote
[TABLE]
Let . Denote by the set of the functions that satisfy the following delay differential inclusion:
[TABLE]
Note that the set is not empty. In particular, for each , the motion of system (1) satisfies the inclusion
[TABLE]
Definition 1
A functional is called a minimax solution of problem (7), (8) if satisfies the inclusion , terminal condition (8) and the following inequalities:
[TABLE]
for any , , and .
By analogy with Lukoyanov_2010a, lower and upper right directional derivatives of a functional along at , are defined by
[TABLE]
where and , .
The following sets are called the subdifferential and the superdifferential of the functional at , :
[TABLE]
Note that if a functional is ci-differentiable at , , then
[TABLE]
Definition 2
A functional is called a viscosity solution of problem (7), (8) if satisfies the inclusion , terminal condition (8) and the following inequalities:
[TABLE]
for any , .
Theorem 2.2
For , the following statements are equivalent:
(a)
The identity holds.
(b)
* is a minimax solution of problem (7), (8).*
(c)
* satisfies terminal condition (8) and the following inequalities:*
[TABLE]
for any , and .
(d)
* is a viscosity solution of problem (7), (8).*
In particular, this theorem establishes the existence and uniqueness of the minimax and viscosity solutions since the value functional is uniquely defined.
Note that Theorem 2.1 follows from the equivalence of statements and if we take into account (2). Below in the paper, auxiliary properties of system (1) and inclusion (11) will be given and Theorem 2.2 will be proved.
3 Properties of Time-Delay Systems
Proposition 1
For every , there exist numbers and such that
[TABLE]
for each , and .
Proof
Let . Put , and . Let , and . Then, according to (10), (11), we derive
[TABLE]
Therefore, applying Bellman-Gronwall lemma (see, e.g., (Bellman_Cooke_1963, p. 31)), we obtain , . Then, from (11), we deduce for almost every , which concludes the proof.
For and , we denote
[TABLE]
where is the motion of system (1).
Proposition 2
For every , there exists a number such that, for each , and , the motions and of system (1) satisfy the inequality
[TABLE]
[TABLE]
