On the Relation Between Two Approaches to Necessary Optimality Conditions in Problems with State Constraints
Andrei Dmitruk, Ivan Samylovskiy

TL;DR
This paper explores two methods for deriving necessary optimality conditions in control problems with state constraints, demonstrating how differentiating along boundary subarcs yields comprehensive stationarity conditions including measure sign definiteness.
Contribution
It introduces a two-stage variation approach to derive full stationarity conditions, linking Gamkrelidze's and Dubovitskii-Milyutin's methods for problems with boundary state constraints.
Findings
Full stationarity conditions include measure sign definiteness.
Differentiating along boundary subarc simplifies the problem.
Two-stage variation approach clarifies the relation between methods.
Abstract
We consider a class of optimal control problems with a state constraint and investigate a trajectory with a single boundary interval (subarc). Following R.V. Gamkrelidze, we differentiate the state constraint along the boundary subarc, thus reducing the original problem to a problem with mixed control-state constraints, and show that this way allows one to obtain the full system of stationarity conditions in the form of A.Ya. Dubovitskii and A.A. Milyutin, including the sign definiteness of the measure (state constraint multiplier), i.e., the nonnegativity of its density and atoms at junction points. The stationarity conditions are obtained by a two-stage variation approach, proposed in this paper. At the first stage, we consider only those variations, which do not affect the boundary interval, and obtain optimality conditions in the form of Gamkrelidze. At the second stage, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Relation Between Two Approaches
to Necessary Optimality Conditions in Problems
with State Constraints
Andrei Dmitruk [email protected] Central Economics and Mathematics Institute of the Russian Academy of Sciences
Lomonosov Moscow State University, Faculty of Computational Mathematics and Cybernetics
Ivan Samylovskiy [email protected] Lomonosov Moscow State University, Faculty of Computational Mathematics and Cybernetics
Abstract
We consider a class of optimal control problems with a state constraint and investigate a trajectory with a single boundary interval (subarc). Following R.V. Gamkrelidze, we differentiate the state constraint along the boundary subarc, thus reducing the original problem to a problem with mixed control-state constraints, and show that this way allows one to obtain the full system of stationarity conditions in the form of A.Ya. Dubovitskii and A.A. Milyutin, including the sign definiteness of the measure (state constraint multiplier), i.e. the non-negativity of its density and atoms at junction points. The stationarity conditions are obtained by a two-stage variation approach, proposed in this paper. At the first stage, we consider only those variations, which do not affect the boundary interval, and obtain optimality conditions in the form of Gamkrelidze. At the second stage, the variations are concentrated on the boundary interval, thus making possible to specify the stationarity conditions and obtain the sign of density and atoms of the measure.
1 Introduction
It is a well-known fact that optimality conditions in problems with state constraints are difficult for application in view of a nonstandard character of the state constraint multiplier. In their seminal work [1], A.Ya. Dubovitskii and A.A. Milyutin suggested to take this multiplier in the form of non-negative measure concentrated on the boundary set of the optimal trajectory (see also later works [2, 3]). This corresponds to the functional meaning of the state constraint, but then the adjoint equation contains a measure (more precisely, its generalized derivative 111 For a function of bounded variation, its generalized derivative is a generalized function in the sense that is the Riemann–Stieltjes measure generated by the function If is absolute continuous, then is a usual Lebesgue integrable function; if is discontinuous at a point then contains the Dirac function at ); hence, one comes to a differential equation of a new, yet uninvestigated type. Therefore, from the very beginning of studying such problems, many specialists tried to avoid somehow this difficulty in order to keep the adjoint equation as an ODE of convenient type.
If the boundary set of the trajectory is a segment, one can differentiate the state constraint and reduce it to a mixed control-state constraint, for which the stationarity conditions can be formulated with the usage of standard objects. The result can be then represented in terms of the original problem. This way was firstly suggested by R.V.Gamkrelidze in the classical book [4], earlier than paper [1], but its realization involves a nontrivial further step: one has to obtain the non-negativity of the measure (the state constraint multiplier), including the sign of the atoms of measure at junction points, which was not completely done in [4].
Thus, for the problems with state constraints there are two forms of optimality conditions (say, the maximum principle): the form of Gamkrelidze and the form of Dubovitskii–Milyutin. A natural question is how these two forms are connected. In paper [5] and then in [6], it was shown, by a simple change of the adjoint variable 222If is the adjoint variable in the Dubovitskii–Milyutin form, is the state constraint, and a monotone function generates the corresponding measure, then is the adjoint variable in the Gamkrelidze form., that one can pass from the conditions in the Dubovitskii–Milyutin form to the conditions in the form of Gamkrelidze, but the possibility of the inverse passage was not investigated.
In this paper, we consider a special class of problems and reference trajectories, in which the connection between the non-negativity of the measure and the minimization of the cost is the most transparent. In this class, one can completely fulfill Gamkrelidze’s idea and prove the non-negativity of the measure, thus showing that Gamkrelidze’s approach allows to obtain the conditions in Dubovitskii–Milyutin’s form. For simplicity, here we consider only necessary conditions of the so-called extended weak minimality (i.e., stationarity conditions), leaving the question about conditions of the strong minimality (the maximum principle) for further investigations.
2 Problem Statement
On a fixed time interval, consider the following optimal control problem with a state constraint:
[TABLE]
Here, and are state variables, is a control, the functions and are absolute continuous, is measurable and bounded. We will assume that the functions of dimensions respectively, are defined and continuous on an open subset together with their first-order partial derivatives w.r.t (The function can be formally considered as a function of variables ). Note that the state constraint is imposed only on the scalar state coordinate so it has the simplest form
Definition 2.1**.**
A triple of functions of the corresponding functional classes defined on and satisfying equations is called a process of problem A. A process is called admissible if it satisfies all the constraints of the problem.
3 The Reference Trajectory
Consider a reference process such that the trajectory touches the state boundary only on a segment where In other words, the interval is divided into parts and such that on on and on In addition, we suppose the control to be continuous on and Lipschitz continuous on (for convenience, we assume that the function at time moments has both left and right values), moreover, on for all and the following strict inequalities hold at the moments :
[TABLE]
which mean that the landing to the state boundary and the leaving it occurs with nonzero time derivatives. We also suppose that on the boundary arc i.e., that the state constraint is of order 1, and the gradients are positive independent for all (i.e., their nontrivial linear combination with non-negative coefficients cannot vanish). Here is the set of active indices.
For short, we will write the control constraints in the vector form
Throughout this paper, we assume that the above assumptions are satisfied for problem A.
Note that these assumptions are not easily verifiable a priori; however, they are often satisfied in typical real problems. As any other a priori assumptions, they can be considered, together with necessary conditions of optimality, as a united collection of conditions for the search of optimal trajectories. In the book [4], a less restrictive assumption on the reference trajectory is imposed: it may touch the state boundary not on one segment, but on a finite number of segments. The reference control is not assumed in [4] to lie in the interior of the set on instead, it is assumed that the gradient together with the active gradients are linearly independent on We do not consider here these more complicated cases in order to avoid more cumbersome technicalities, which would distract the reader’s attention from the main line of argumentation.
4 The Type of Minimum
We admit not only uniformly small variations of the control, but also small variations of its discontinuity points. This corresponds to consideration of the “extended” weak minimality. Recall its definition (see, e.g. [7]) for a problem of type A.
Definition 4.1**.**
An admissible process provides the extended weak minimumality in problem A if there exists an such that, for any Lipschitz continuous surjective mapping satisfying and and for any admissible process satisfying the conditions
[TABLE]
one has
The conditions on imply and If we take then relations (3) describe the usual uniform closeness between the processes and both in the state and control variables. However, for an arbitrary relations (3) extend the set of ”competing” processes, and thus the extended weak minimality is stronger than the classical weak minimality. The choice of arbitrary close to corresponds to a variation (deformation) of the current time within the interval in addition to the usual uniformly small variations of and for the fixed values of
If the control is continuous, the notion of extended weak minimality reduces to the usual notion of weak minimality. However, in the case of discontinuous the usual small variations of the control (corresponding to the weak minimality) leave the points of discontinuity of invariable, whereas the extended weak minimality allows for small variations of them.
5 Passage from Problem A to a Problem with Mixed Control-State Constraints
Following [8], we introduce a new time variable and consider the initial time variable on each segment as a new state variable subject to equation where the functions are additional controls.
On the segment introduce the state variables and the controls Hence, the following equations are satisfied:
[TABLE]
Thus, we “replicate” the variables of the original problem by taking their reductions to the intervals and considering all of these reductions as new variables of the new time 333This natural trick of replication of variables was first proposed, probably, in [9], and later was also used, may be independently, by many authors, e.g. in [10, 11, 12, 13, 8, 14, 15].. In terms of these new variables, we now formulate a new problem related to our problem A.
Since the original state variables are continuous at times (close to the new state variables should satisfy the junction conditions
[TABLE]
Moreover, since the time interval is fixed, the variables should satisfy the boundary conditions and
Instead of state constraint on we will consider the following pair of an endpoint and a mixed control-state constraints:
[TABLE]
while the control constraints will be now written in the form
[TABLE]
In the new problem, we will consider the “classical” weak minimality. Therefore, we do not need to consider the open constraints as well as the constraint since under our assumptions the control lies strictly in its interior.
Thus, we come to the following optimal control problem on the time interval
[TABLE]
under the following constraints:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
This problem will be called problem B. Here, are the controls and the state variables, Note that constraints (5) (included in (8) and (10)) define a smaller class of admissible trajectories than the state constraint does, so the new problem is not equivalent to the initial problem A. Later, in Sec. 8, we will also take into account nonconstant variations of i.e., of on the boundary interval. On the other hand, the new problem does not involve the state constraints and so it allows for a bigger class of admissible trajectories.
It is easy to see that, to each admissible process of problem A with on an interval one can associate a (not unique) admissible process of problem B (by choosing, e.g. ), and to each admissible process of problem B one can associate, simply by setting a unique admissible process of problem A with on
Let us establish a relation between the extended weak minimality in problem A and the “classical” weak minimality in problem B.
Lemma 5.1**.**
Let the process with the boundary arc provide the extended weak minimality in problem A. Then the corresponding process provides the weak minimality in problem B.
Proof.
Suppose that the process does not provide the weak minimality in problem B. Then, there exists a sequence of uniformly convergent processes of problem B, such that According to (2), there exist such and that
[TABLE]
Then, for sufficiently far members of the sequence, we get
[TABLE]
From here with account of , we get on Consider the segment Here hence for some Therefore, for sufficiently far members of the sequence. Thus, on the whole semi-open interval Similarly, one can prove that on the whole semi-open interval The inequality obviously holds on since and
Thus, for the corresponding processes of problem A, we get
[TABLE]
where The constraints are satisfied in view of inequalities
So, the prelimiting processes are admissible in problem A with the cost a contradiction with the extended weak minimality in problem A at the process ∎∎
6 Stationarity Conditions for Problem B
Let us agree to denote the derivatives of w.r.t. first, second, and third arguments as respectively, no matter on which variables these functions depend.
The three constraints (10) will be treated as mixed control-state ones. In order to apply the known stationarity conditions, we have to check whether these constraints are regular along the reference process
According to [16, 17, 18], mixed control-state constraints and of equality and inequality type given by smooth functions on are called regular at a point if their gradients w.r.t control are positive–linearly independent, which means that there do not exist multipliers and with and such that
[TABLE]
Applying this to the constraints (10), one can easily see that the gradients w.r.t control of these constraints are positive–linearly independent along the reference process (since they decompose into the gradients w.r.t each component hence their gradients w.r.t the “full” control vector are the more so positive–linearly independent, and thus, the mixed constraints in problem B are regular.
Assume the process := provides the weak minimality in problem B. Then it satisfies the stationarity conditions, which say the following (see, e.g. [16, 17, 18]): there exist multipliers Lipschitz functions measurable bounded functions of dimension and a measurable bounded scalar function such that the following conditions are satisfied:
nontriviality condition
[TABLE]
non-negativity condition
[TABLE]
complementary slackness condition
[TABLE]
and such that, in terms of the endpoint Lagrange function
[TABLE]
and the extended Pontryagin function
[TABLE]
the following conditions are also satisfied:
adjoint equations and transversality conditions
[TABLE]
[TABLE]
[TABLE]
stationarity conditions w.r.t controls
[TABLE]
and stationarity conditions w.r.t controls
[TABLE]
Here, are the derivatives of w.r.t the corresponding variables, taken at the point
Note that, since the function is Lipschitz continuous on the second equation in (19) and the nondegeneracy of implies that is also Lipschitz continuous.
Fist of all, let us state the following
Lemma 6.1**.**
* (hence, one can set ).*
Proof.
Suppose that Then by (16)–(17), the pair satisfies a linear system of ODEs with initial conditions whence and identically vanish. Similarly, and vanish too, hence and
In view of (19), we get and with some Lipschitz continuous functions moreover, since and we have Thus, in view of (16)–(17), and satisfy a system of linear ODEs with zero boundary values at which implies that and Therefore, and by (17) then in view of (20) we get hence Thus, the whole collection of multipliers is trivial, a contradiction with (11). ∎∎
7 Stationarity Conditions in Terms of the Original Problem A
Let us rewrite the stationarity conditions from Sec. 6 in terms of the original problem (1). To do this, define functions and on the interval as follows:
[TABLE]
Notice that function is Lipschitz continuous on the intervals By we will denote its generalized derivative. Since \dfrac{d\psi}{dt}=\dfrac{d\psi}{d\tau}\Big{/}\dfrac{dt}{d\tau}, then, getting back from the new time to the original time in equations (16)–(18), we obtain the following equations on the whole interval :
[TABLE]
Since the state variables of problem B are continuously joined at the corresponding ends of interval by the junction conditions (4), the state variables of problem A are continuous (and moreover, Lipschitz continuous). By similar arguments, the adjoint variables of problem A are also Lipschitz continuous. Consider the function .
Note first that it is Lipschitz continuous on every interval . Rewriting the transversality conditions for in terms of problem A, we get the following junction conditions:
[TABLE]
i.e., is continuous at and has the jump at the point At the ends of interval it satisfies the transversality conditions
[TABLE]
If we introduce the extended Pontryagin function for the problem with mixed control-state constrains
[TABLE]
then, in view of (22), we obtain the fulfilment of adjoint equations
[TABLE]
on the interval except the points and the fulfilment of stationarity condition w.r.t. control for all :
[TABLE]
Let us now rewrite these conditions in terms of problem A involving a state constraint. To do this, set introduce the Pontryagin function of this problem
[TABLE]
and the extended Pontryagin function
[TABLE]
with a multiplier at the state constraint. It is easy to verify that, along the interval except the points the adjoint equations
[TABLE]
and stationarity condition w.r.t. control hold.
The transversality conditions (24) are obviously still satisfied. In view of (21) and (23), the adjoint variable has the jumps
[TABLE]
Since , equation (20) rewritten in time turns into
[TABLE]
which is equivalent to the “energy conservation law”
Note that we get on while outside we get i.e., the complementary slackness condition for the state constraint holds:
[TABLE]
The definition of and condition (13) imply that the complementary slackness condition holds also for the control constraint:
[TABLE]
8 Non-negativity of Multiplier at the State Constraint
We have obtained stationarity conditions in problem A, in which the measure is absolute continuous on the interval with density and has the jumps (atoms) and at the points respectively. Our next aim is to define the sign of its density and jumps. To this end, we take into account that we have feasible variations on in our disposal.
Consider first any triple satisfying the linearized system in variations along the process on :
[TABLE]
The main technical formula to use is defined by the following
Lemma 8.1**.**
Let be given Lipschitz continuous functions and measurable bounded functions on an interval Let be also given functions Lipschitz continuous on intervals with possible jumps at the points where such that the following relations hold on every above interval:
[TABLE]
Then any solution of system (34) on satisfies the following equality:
[TABLE]
where are the jumps of at the points
Proof.
In view of (35), we have, on every interval :
[TABLE]
Integrating this equality on the whole interval (on we integrate by parts) and taking into account possible jumps of at the points we get that the left hand part of (36) is equal to
[TABLE]
which implies the required equality (36). ∎∎
Now, we introduce variations of some special type.
Lemma 8.2**.**
For any Lipschitz continuous function defined on the interval there exists a solution of system (34) on such that
Proof.
Let us set where is a scalar function to be found. Since from the second equation of system (34) we obtain v(t)=\,\big{(}\dot{\varkappa}-g^{\prime}_{z}\bar{z}-g^{\prime}_{x}\varkappa\big{)}/\,|g^{\prime}_{u}|^{2}\,. Substituting the corresponding into the first equation of system (34), we come to the following nonhomogeneous equation with respect to
[TABLE]
Setting for definiteness we get the solution of this equation, and then define and ∎∎
Consider now any on By Lemma 8.2, the system (34) has a solution on with To construct the corresponding process, which will be compared with the optimal one we have to go back to the original nonlinear system Note that (34) is the variational system for the latter one. According to the main property of variational equation, for any there exists a correction with as such that the triple satisfies the original system on It is easy to verify that this triple satisfies also conditions and on
Now, let us extend this triple, defined only on to a process defined on the whole interval To do this, on we set (i.e., ) and solve the nonlinear system with initial conditions On we again set and solve the nonlinear system with the initial conditions Thus, we get a process on the whole interval that by definition satisfies the constraint
Note that where on and on is the above function from Lemma 8.2, satisfies the linear system (34) on In particular, the pair satisfies on the system of linear equations in variations
[TABLE]
Lemma 8.3**.**
* on for small except the points *
Proof.
Define on and consider the interval On this interval, the pair satisfies the same nonlinear system as the pair but with the corrected initial conditions Then, the pair satisfies the linear system (37) with initial conditions Since there exists such that on Then on this interval for small enough
Since on we get for small Thus, on the whole The interval is considered similarly. ∎∎
Thus, the constructed process satisfies all the constraints of problem (1) and, since the process provides the weak minimality, we have
[TABLE]
Let us apply Lemma 8.1 to the constructed triple and functions defined in (21)–(24). Since on the first two integrals in (36) disappear, and since on and on the last integral disappears too. According to transversality conditions (24), the left hand part of relation (36) is exactly
[TABLE]
hence
[TABLE]
This inequality holds for any Lipschitz continuous function on Now, take any on Approximating it uniformly by functions and passing to the limit in (39), we obtain that inequality (39) holds for any Lipschitz continuous function on Considering only with zero values at the endpoints of we get which implies almost everywhere on
Consider now functions that vanish on for a small and satisfy with If the integral in the right hand side of (39) tends to zero, thus In view of equality we get Similarly, we get in view of continuity of at the point
Thus, we have proved the following
Lemma 8.4**.**
Let the process provide the extended weak minimality in problem A. Then on (i.e., decreases on moreover, and
Let us get back to the function having the jumps (30).
To “equalize” these jumps, we introduce the function
[TABLE]
Then, according to Lemma 8.4,
[TABLE]
and for The jumps of adjoint variable at junction points have now a “symmetric” form: The adjoint equation for (see (29) now looks as follows:
[TABLE]
where is the derivative in the sense of generalized functions. This equation should be regarded as an equality between measures:
[TABLE]
9 The Final Result
We now summarize our findings:
Theorem 9.1**.**
Let be an admissible process in problem A such that on on on assumption (2) holds, and let this process provide the extended weak minimality. Then there exist a Lipshitz continuous function a constant functions and Lipschitz continuous on each interval with possible jumps at and a measurable bounded function which generate the Pontryagin function
[TABLE]
and the extended Pontryagin function
[TABLE]
such that the following conditions hold:
- (a)
non-negativity conditions
[TABLE] 2. (b)
complementary slackness
[TABLE] 3. (c)
adjoint equations
[TABLE] 4. (d)
transversality conditions
[TABLE] 5. (e)
jumps conditions for the adjoint variable
[TABLE] 6. (f)
the energy conservation law
[TABLE] 7. (g)
stationarity condition w.r.t. control
[TABLE]
Remark 9.1**.**
Note again that theorem 9.1 is not new; in fact, it is the stationarity conditions in the Dubovitskii–Milyutin’s form with some refinements for our specific problem A. The novelty is only in the way of obtaining this result.
Remark 9.2**.**
If the functions are convex and the function turns out to be concave in then, as is known, stationarity condition (47) is equivalent to the maximality condition over the set
[TABLE]
i.e., the necessary conditions for the extended weak minimality and for the strong minimality are equivalent. However, if the cost is not convex, then neither strong, nor even weak minimality can be guaranteed.
Remark 9.3**.**
Note that, in the proof of theorem 9.1, the variation of the reference process are made in two stages, not in one, as usual. First, we use not the whole class of possible variations, but only those for which on the boundary interval In the second stage, we consider the stationarity conditions obtained for this reduced class, and substitute to them the “remaining” variations concentrated inside the boundary interval and near its endpoints, which makes it possible to specify these conditions. This approach might be feasible not only for the given class of problems, but also for some other problems (see, e.g. Sec. 12–14 below).
10 On the Jumps of Measure – the Multiplier at the State Constraint
Of special interest is the question, in which cases the adjoint variable and the function generating the measure do not have jumps at junction points? Studies show (see, e.g. the book [19, §6] or papers [5, 20, 15, 21, 22, 23]) that in case of strong (or at least Pontryagin type [16, 17]) minimality, the adjoint variable and measure do not have jumps under condition (2). However, this result is not, in general, valid in the case of extended weak minimality (the reason is that one cannot rely upon the maximality of Pontryagin function w.r.t. having in disposal only the stationarity of the extended Pontryagin function). Here, we specify a class of problems where the adjoint variable and measure have no jumps, and also present an example where the adjoint variable and measure corresponding to a stationary (but not optimal) trajectory do have nonzero jumps at junction points.
10.1 On the Absence of Atoms of Measure
Consider the case when the dynamics of the “free” state variable does not depend on Applying (47) to the interval we get
[TABLE]
whence, in view of assumption obtain on Thus, and so
According to the energy conservation law (46), the jump of the so-called switching function (the -dependent term of Pontryagin function) at the point is zero:
[TABLE]
where according to (2), and therefore One can similarly show that either.
Thus, in the considered case, the measure has no atoms, and the adjoint variables are continuous. In the general case, the question of presence or absence of atoms is open. We leave it for further research.
10.2 An Example Where the Measure Has Atoms
Consider the following problem:
[TABLE]
where and a parameter is arbitrary. Let Consider a trajectory generated by the control on for which on respectively. The value of is defined up to an additive constant, which does not matter.
Let this trajectory satisfy the stationarity conditions of Theorem 9.1, i.e., let there exist Lipschitz continuous function Lipschitz continuous on functions and with possible jumps at and a constant and a measurable bounded function which generate the Pontryagin function and the extended Pontryagin function
[TABLE]
such that the following condition hold:
- (a)
adjoint equations
[TABLE] 2. (b)
transversality conditions
[TABLE] 3. (c)
complementary slackness conditions
[TABLE] 4. (d)
stationarity conditions w.r.t. control
[TABLE]
that imply the adjoint variable to be as follows
[TABLE] 5. (e)
and the energy conservation law
[TABLE]
From (50)–(51) it follows that Set on The complementary slackness conditions are then obviously hold. Thus, according to (53), we get
[TABLE]
while the energy conservation law reads as follows:
[TABLE]
Conditions (56) are definitely satisfied if, e.g., is such that
[TABLE]
Then, the transversality conditions (51) hold too, and the jumps of at the points and are
[TABLE]
Now, it remains to find a smooth function satisfying conditions (57).
To this purpose one can use, e.g. the following polynomial:
[TABLE]
Thus, we get a stationary trajectory for which the adjoint variable has jumps at the points (Choosing a corresponding one can make these jumps not equal.)
Note that here the Pontryagin function is not concave in (on the contrary, it is convex), so the stationarity conditions w.r.t. control does not ensure the maximum of i.e., the reference trajectory does not satisfy the maximum principle, and hence, it is just stationary but does not provide the strong minimality.
Thus, the stationarity conditions do not guarantee the absence of atoms, while, according to [5, 20, 19, 15, 21, 22, 23]), the maximum principle does. If a trajectory is not just stationary, but provides the strong (or at least Pontryagin type) minimality, then it satisfies the maximum principle, and therefore, the corresponding measure cannot have atoms.
11 An Example Where the Measure Has a Negative Density
Let us present an example showing that the condition of non-negativity of the measure density is essential, i.e., it does not follow from other stationarity conditions. Consider the following problem:
[TABLE]
Here the parameters are fixed, while the parameters are also fixed and will be defined below. The function thus The endpoints of the trajectory are free.
Consider a trajectory with on the intervals respectively, and on Thus, for and for Check, whether stationarity conditions (43)–(47) hold.
The extended Pontryagin function is
[TABLE]
where, in view of the complementary slackness conditions, outside of and on
The adjoint equations and transversality conditions are as follows:
[TABLE]
By the first equation, hence, if we set the transversality condition for is satisfied.
From (59), we get equations for :
[TABLE]
Solving the initial value problems on and we get
[TABLE]
and, since is continuous everywhere and constant on it should satisfy the equality i.e.,
[TABLE]
Obviously, there exists such that it holds. Fix this .
Similarly, for we get from (59):
[TABLE]
Solving the initial value problems on and with we get
[TABLE]
The condition i.e., implies on According to (45), hence If then in a left neighborhood of a contradiction with Therefore, Similarly, we get i.e., has no jumps at and
The fulfillment of the obtained equalities is equivalent to the following linear relations on the parameters :
[TABLE]
Obviously, such do exist. Fix these values.
Finally, from (63) it follows that on and so on and on and then the condition implies that on these intervals. Thus, for the chosen parameters of problem and for the examined trajectory, there exists a unique collection of multipliers satisfying all the conditions of Theorem 9.1 except (41). Here, condition (63) implies that on which contradicts the condition (41). Thus, the last condition does not follow from the others, and the examined trajectory does not provide the extended weak minimality.
12 Generalization of the Obtained Result
An important feature of problem (1) is that the state constraint has the form i.e., it is imposed only on one state coordinate. Let us show how it is possible to use the above result to formulate stationarity conditions in a more general
[TABLE]
Here the state variable is absolutely continuous, and the control is measurable bounded functions. We assume that the data functions and are defined and twice continuously differentiable on an open subset
As before, we suppose that the reference process is such that the trajectory touches the state boundary only on a segment where In other words, the interval is divided into three parts and such that on on and on The control is continuous on Lipschitz continuous on and, moreover, on for all and the landing to the state boundary and the leaving it occurs with nonzero time derivatives:
[TABLE]
As before, we assume that the gradients are positive independent for all and on .
13 Reduction of Problem C to Problem A
We accept the following technical
Assumption C. There exist an open subset containing the curve and twice continuously differentiable functions such that the gradients are linearly independent at any point and, moreover, the mapping defined by
[TABLE]
is an injection. In other words, realizes a nondegenerate change of variables in :
[TABLE]
Herewith, the set is also open, and there exists a inverse mapping so that
[TABLE]
In what follows, we will always assume that satisfy the following relations
[TABLE]
Note that differentiation of (69) yields the equality
[TABLE]
where the right hand part is the identity matrix of dimension .
Remark 13.1**.**
It is sufficient to assume that contains not the entire curve but only part of it for Then, by extending the definition of the function out of one can reduce the situation to the case of containing the entire curve Here we do not dwell on the corresponding technical details. Note only that Assumption C is really satisfied in all reasonable, especially applied, problems with state constraints.
Obviously, the dynamics of state variables obeys the system
[TABLE]
therefore, problem (65) in these new variables transforms to the following problem of type (1) on the same time interval :
[TABLE]
To each process of problem C one can associate a process of problem D, and vice versa. Obviously, the process provides the extended weak minimality in problem C if and only if the corresponding process provides the extended weak minimality in problem D.
Therefore, we can use the fact that the process satisfies the stationarity conditions given in Theorem 9.1.
14 Stationarity Conditions for Problem C
In further transformations, we have to differentiate vector-valued and matrix-valued functions w.r.t a vector argument. To avoid cumbersome formulas in the coordinate form, let us accept the following notation. If is any tensor of a given rank (in particular, a vector or a matrix), every element of which is a smooth function of then its directional derivative along a vector will be denoted as The last one is still a tensor of the same rank and dimension, whose elements are the scalar directional derivatives of the corresponding elements along the vector .
According to Theorem 9.1, if the process provides the extended weak minimality in problem D, then there exist a Lipschitz continuous adjoint variable dimensional row vector) on a constant scalar functions and Lipschitz continuous on each interval such that and a measurable bounded function which generate the Pontryagin function
[TABLE]
and the extended Pontryagin function
[TABLE]
such that the following conditions hold:
complementary slackness
[TABLE]
adjoint equations
[TABLE]
[TABLE]
(these equalities hold for any “test” constant vectors and
transversality conditions
[TABLE]
jump conditions for the adjoint variable
[TABLE]
the energy conservation law
[TABLE]
and stationatity condition w.r.t. control
[TABLE]
Now, rewrite the obtained conditions in terms of problem C. First, denote
[TABLE]
This is a row vector of dimension Then, since condition (79) takes the form
[TABLE]
Further, multiplying by a test (constant) vector we get a scalar function
[TABLE]
(for short, we drop the arguments of and ), which time derivative is
[TABLE]
where denotes the time derivarive of the function in brackets.
Let us write the first two terms of this expression in view of equations (74) and (75) for :
[TABLE]
[TABLE]
The other two terms of (82) in view of identities are equal to
[TABLE]
Summing up the right parts of equalities (83)–(85), we get
[TABLE]
Note that the matrix is the second derivative of the scalar function hence it is symmetric. Therefore, the first and the last terms in the right hand part of obtained expression (which differ only in the positions of multipliers and ) cancel each other, and in view of relation equation (86) takes the form
[TABLE]
whence, since the test vector is arbitrary, we get
[TABLE]
If we introduce the Pontryagin function and the extended Pontryagin function for problem C, then equalities (81) and (87) transform to and respectively.
According to (77), the function has jumps at the points :
[TABLE]
The transversality conditions for take the form
[TABLE]
Finally, the complementary slackness conditions and the energy conservation law are rewritten automatically in terms of problem C.
Summarizing our findings, we come to the following
Theorem 14.1**.**
Let be an admissible process such that on on on assumption (66) holds, and let this process provide the extended weak minimality in problem C. Then there exist a constant functions Lipschitz continuous on every interval and a measurable bounded function which generate the Pontryagin function
[TABLE]
and the extended Pontryagin function
[TABLE]
such that the following conditions hold:
- (a)
non-negativity conditions
[TABLE] 2. (b)
complementary slackness
[TABLE] 3. (c)
adjoint equation
[TABLE] 4. (d)
transversality conditions
[TABLE] 5. (e)
jumps conditions for the adjoint variable
[TABLE] 6. (f)
energy conservation law
[TABLE] 7. (g)
and stationarity condition w.r.t. control
[TABLE]
Remark 14.1**.**
The performed transformation is a particular case of the general one-to-one change of variables under which problem C transforms to the following
[TABLE]
Clearly, the extended weak minimality at a process in problem C corresponds to that at the process in problem E. The multipliers in both problems are the same, the extended Pontryagin functions for problems C and E are, respectively,
[TABLE]
while the adjoint variables are connected by the following equality:
[TABLE]
The proof of this assertion is left to the reader as an excercise.
In the case of problem D, we have and hence
[TABLE]
i.e., we get exactly formula (80).
Remark 14.2**.**
For simplicity, we considered problem A with free endpoints of the trajectory. If they are restricted by terminal constrains
[TABLE]
then, to obtain stationarity conditions, one should replace the cost by the endpoint Lagrange function (with corresponding multipliers) and then apply Theorem 9.1. The same concerns problem C.
Remark 14.3**.**
We suppose that the state constraint in problem (1) or (65) is of first order, and the reference trajectory lands on the state boundary with a nonzero first time derivative, i.e., satisfies conditions (2) or (66), respectively. Perhaps, the same approach would also work in the case of higher order state constraints, if the reference trajectory lands on the state boundary with a nonzero time derivative of the corresponding order. Obviously, the technique would be then more complicated.
15 Conclusions
We consider a specific class of optimal control problems with a single state constraint of order 1 and a specific trajectory in it. Basing on the approach by R.V. Gamkrelidze, consisting in differentiating the state constraint along the boundary subarc and reducing the original problem to a problem with mixed control-state constraints, we obtain the full system of stationarity conditions in the form of A.Ya. Dubovitskii and A.A. Milyutin, including the sign definiteness of the measure, a multiplier at the state constraint. To obtain these conditions, we propose an approach of two-stage varying. At the first stage, we consider only those variations, which preserve a constant value of the state constraint along the boundary interval, and obtain preliminary, incomplete optimality conditions. At the second stage, we take into account the remaining variations, concentrated on the boundary interval, and obtain the sign definiteness of the measure, thus specifying the stationarity conditions. Two illustrative examples are given, one showing that the condition of non-negativity of the measure density is essential and another with nonzero atoms of the measure at the junction points.
Acknowledgements 15.1**.**
This research was partially supported by the Russian Foundation for Basic Research under grant No. 16-01-00585. The authors thank Nikolai Osmolovskii for useful discussions and the anonymous referees for valuable remarks.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Dubovitskii, A. Ya., Milyutin, A. A.: Extremum problems in the presence of restrictions. USSR Comput. Math. and Math. Phys. 5(3), 1–80 (1965) · doi ↗
- 2[2] Girsanov I.V.: Lectures on Mathematical Theory of Extremum Problems. Springer-Verlag Berlin, Heidelberg (1972)
- 3[3] Ioffe, A.D., Tikhomirov, V.M.: Theory of extremal problems. North-Holland Publishing Company, Amsterdam, New Yourk, Oxford (1974)
- 4[4] Pontryagin, L. S., Boltyanskii, V. G., Gamkrelidze, R. V., Mishechenko, E. F.: The Mathematical Theory of Optimal Processes. John Wiley & Sons, New York/London (1962)
- 5[5] Hartl, F. H, Sethi, S. P., Vickson, R. G.: A survey of the maximum principles for optimal control problems with state constraints . SIAM Review 37(2), 181–218 (1995) · doi ↗
- 6[6] Arutyunov, A. V., Karamzin, D. Y., Pereira, F. L.: The Maximum Principle for Optimal Control Problems with State Constraints by R.V. Gamkrelidze: Revisited . J. Optim. Theory Appl. 149(3), 474–493 (2011) · doi ↗
- 7[7] Dmitruk, A. V., Osmolovskii, N. P.: Necessary conditions for a weak minimum in optimal control problems with integral equations on a variable time interval . Discrete and Continuous Dynamical Systems 35(9), 4323–4343 (2015) · doi ↗
- 8[8] Dmitruk, A. V., Kaganovich A. M.: The Hybrid Maximum Principle is a consequence of Pontryagin Maximum Principle . Systems & Control Letters 57(11), 964–970 (2008) · doi ↗
