A sharp error analysis for the discontinuous Galerkin method of optimal control problems
Woocheol Choi, Young-Pil Choi

TL;DR
This paper provides precise error estimates for the discontinuous Galerkin method applied to nonlinear optimal control problems governed by ordinary differential equations, supported by numerical validation.
Contribution
It offers sharp error bounds for the DG method of arbitrary order in nonlinear optimal control, under regularity assumptions, advancing the theoretical understanding of discretization accuracy.
Findings
Sharp error estimates for the DG method of arbitrary order
Numerical experiments confirm theoretical error bounds
Enhanced understanding of discretization errors in optimal control
Abstract
In this paper, we are concerned with a nonlinear optimal control problem of ordinary differential equations. We consider a discretization of the problem with the discontinuous Galerkin method with arbitrary order . Under suitable regularity assumptions on the cost functional and solutions of the state equations, we provide sharp estimates for the error of the approximate solutions. Numerical experiments are presented supporting the theoretical results.
| 1.9455e-03 | 6.2543e-04 | ||||
| 4.8861e-04 | 1.6088e-04 | 2.00 | 1.96 | ||
| 1.2240e-04 | 4.0780e-05 | 2.00 | 1.98 | ||
| 3.0629e-05 | 1.0264e-05 | 2.00 | 1.99 | ||
| 7.6607e-06 | 2.5748e-06 | 2.00 | 2.00 | ||
| 1.9156e-06 | 6.4477e-07 | 2.00 | 2.00 | ||
| 2.6708e-05 | 1.3269e-05 | ||||
| 3.3523e-06 | 1.6837e-06 | 2.99 | 2.98 | ||
| 4.1979e-07 | 2.1202e-07 | 3.00 | 2.99 | ||
| 5.2518e-08 | 2.6599e-08 | 3.00 | 3.00 | ||
| 6.5673e-09 | 3.3308e-09 | 3.00 | 3.00 | ||
| 8.2108e-10 | 4.1672e-10 | 3.00 | 3.00 | ||
| 2.8964e-07 | 9.5564e-08 | ||||
| 1.8172e-08 | 6.0617e-09 | 4.00 | 3.98 | ||
| 1.1377e-09 | 3.8151e-10 | 4.00 | 3.99 | ||
| 7.1152e-11 | 2.3918e-11 | 4.00 | 4.00 | ||
| 4.4370e-12 | 1.4871e-12 | 4.00 | 4.01 | ||
| 2.7555e-13 | 8.4657e-14 | 4.01 | 4.13 |
| 1.3006e-02 | 2.6587e-03 | ||||
| 4.5715e-03 | 6.8872e-04 | 1.51 | 1.95 | ||
| 1.3286e-03 | 1.7024e-04 | 1.78 | 2.02 | ||
| 3.5677e-04 | 4.2187e-05 | 1.90 | 2.01 | ||
| 9.2305e-05 | 1.0492e-05 | 1.95 | 2.01 | ||
| 2.3420e-05 | 2.6101e-06 | 1.98 | 2.01 | ||
| 7.9288e-04 | 7.1751e-05 | ||||
| 1.6928e-04 | 6.8412e-06 | 2.23 | 3.40 | ||
| 2.7566e-05 | 7.2059e-07 | 2.62 | 3.25 | ||
| 3.9391e-06 | 8.4373e-08 | 2.81 | 3.10 | ||
| 5.2676e-07 | 1.0332e-08 | 2.90 | 3.03 | ||
| 6.8107e-08 | 1.2833e-09 | 2.95 | 3.01 | ||
| 4.8978e-05 | 2.3326e-06 | ||||
| 5.8217e-06 | 2.0158e-07 | 3.07 | 3.53 | ||
| 5.0236e-07 | 1.3655e-08 | 3.53 | 3.88 | ||
| 3.6929e-08 | 8.7619e-10 | 3.77 | 3.96 | ||
| 2.5037e-09 | 5.5551e-11 | 3.88 | 3.98 | ||
| 1.6329e-10 | 3.6858e-12 | 3.94 | 3.91 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Methods in Computational Mathematics · Differential Equations and Numerical Methods · Numerical methods for differential equations
A sharp error analysis for the discontinuous Galerkin method of optimal control problems
Woocheol Choi
Department of Mathematics
Sungkyunkwan University, Suwon 16419, Korea (Republic of)
and
Young-Pil Choi
Department of Mathematics
Yonsei University, Seoul 03722, Korea (Republic of)
Abstract.
In this paper, we are concerned with a nonlinear optimal control problem of ordinary differential equations. We consider a discretization of the problem with the discontinuous Galerkin method with arbitrary order . Under suitable regularity assumptions on the cost functional and solutions of the state equations, we provide sharp estimates for the error of the approximate solutions. Numerical experiments are presented supporting the theoretical results.
2000 Mathematics Subject Classification:
65L05, 49J15, 49M25, 65L60
Contents
-
5 Existence and Convergence results for the semi-discrete case
-
6 Existence and Convergence results for the fully discrete case
-
B Gronwall-type inequality for the DG discretization of ODEs
-
D Derivations of the first order derivative of cost functionals
-
E Derivations of the second order derivative of cost functionals
1. Introduction
In the present work, we discuss discontinuous Galerkin (DG) approximations to a nonlinear optimal control problem (OCP) of ordinary differential equations (ODEs). More precisely, we consider the following optimal control problem:
[TABLE]
subject to
[TABLE]
Here is the control, and is the state of the system at time . Further, and are given, and the set of admissible controls is given by
[TABLE]
for some .
There have been a lot of study on the numerical computation for the above problem. The numerical schemes need a discretization of the ODEs, for example, the Euler discretization for the OCPs of ODEs are well studied for sufficiently smooth optimal controls based on strong second-order optimality conditions [1, 5, 6]. For optimal control problems with control appearing linearly, the optimal control may be discontinuous, for an instance, bang-bang controller, and such conditions are not satisfied. In that respect, there have been many studies to develop new second-order optimality conditions for the optimal control problems with control appearing linearly [2, 9, 12, 13].
The Pseudo-spectral method is also popularly used for the discretization due to its capability of high-order accuracy for smooth solutions to the OCPs [7, 14]. However, the high-order accuracy of the Pseudo-spectral method is known to be often lost for bang-bang OCPs, where the solutions may not be smooth enough. To handle this issue, Henriques et al. [10] proposed a mesh refinement method based on a high-order DG method for the OCPs of ODEs. The DG method discretizes the time interval in small time subintervals, in which the weak formulation is employed. The test functions are usually taken as piecewise polynomials which can be discontinuous at boundaries of the time interval, see Section 2 for more detailed discussion. We refer to [3, 8, 15] and references therein for DG methods for ODEs.
In this paper, we provide a rigorous analysis for the DG discretization applied to the nonlinear OCP (1.1)-(1.2) with arbitrary order for general functions and with suitable smoothness. Motivated from a recent work by Neitzel and Vexler [11], we impose the non-degeneracy condition (2.4) on an optimal control of the OCP (1.1)-(1.2). We obtain the existence and convergence results for the semi-discretized case and the fully discretized case. The rates of the convergence results depend on the regularity of the optimal solution and its adjoint state with the degree of piecewise polynomials mentioned above, see Section 2 for details.
It is worth noticing that the control is not required to be linear in the state equations (1.2), and the control space allows to take into account discontinuous controls. The constraints for controls are defined by lower and upper bounds. Moreover, the cost functional is also given in a general form, it may not be quadratic.
For notational simplicity, we denote by , , and . We also use simplified notations:
[TABLE]
for . Throughout this paper, for any compact set , we assume that satisfy
[TABLE]
for some .
We next introduce the control-to-state mapping , , with solving (1.2). It induces the cost functional , . This makes the optimal control problem (1.1)-(1.2) equivalent to
[TABLE]
Definition 1.1**.**
A control is a local solution of (1.4) if there exists a constant such that holds for all with .
In the proof of the existence and convergence results, the main task is to show that the strong convexity of induced by the second order condition (2.4) is preserved near the optimal control and also for its DG discretized version . It will be achieved using the second-order analysis in Section 4. As a preliminary, we also justify that and are twice differentiable, by showing the differentiability of the control-to-state mapping and its discretized version in the appendix.
In Section 2, we explain the DG discretization of the ODEs and the OCP. Then we present the main results for the semi-discretized case and provide some preliminary results. In Section 3, the adjoint problems are studied. Section 4 is devoted to study the second order analysis of the cost functionals and . In Section 5, we prove the existence of the local solution and obtain the convergence rate for the semi-discretized case. Section 6 is devoted to establish the existence and convergence results for the fully discretized case. Finally, in Section 7, we perform several numerical experiments for linear and nonlinear OCPs. In Appendix A, we obtain first and second order derivatives of the control-to-state mapping . Appendix B is devoted to prove a Gronwall-type inequality for the discretization of the ODEs (1.2) involving the control variable. It is used in Appendix C to establish the differentiability of the discrete control-to-state mapping and obtain the derivatives. In Appendix D, we prove Lemma 3.2 and Lemma 3.4, which reformulate the first derivatives of the cost functionals in terms of the adjoint states. In Appendix E, we derive the formulas on the second order derivatives of the cost functionals.
2. DG formulation
In this section, we describe the approximation of the OCP (1.1)-(1.2) with the DG method, and then we state the main results on the semi-discrete case. First we consider the discretization of the following ODEs:
[TABLE]
where , is uniformly Lipschitz continuous with respect to , i.e.,
[TABLE]
with a constant . By the Cauchy Lipschitz theorem, we have the existence and uniqueness of classical solution of (2.1).
Given an integer we consider a partition of into -intervals given by with nodes . Let be the length of , i.e., , and we set . For a piecewise continuous function , we also define
[TABLE]
We also denote the jumps across the nodes by for . For we define
[TABLE]
where represents the set of all polynomials of up to order defined on with coefficients in . Then the DG approximate solution of (2.1) is given as
[TABLE]
for all . Here denotes the inner product in , and
[TABLE]
for integrable functions .
We recall the error estimate for the DG approximation of (2.1) from [15, Corollary 3.15 & Theorem 2.6].
Theorem 2.1**.**
Let be the solution of (2.1) such that for some . Suppose that . Then there exists a unique DG approximate solution to (2.2) of order . Furthermore, we have
[TABLE]
where is determined by , , and .
Now, for given , we consider the approximate solution of the control problem (1.2) satisfying
[TABLE]
for all .
Throughout the paper, we will consider local solutions to (1.4) satisfying the following non-degeneracy condition.
Assumption 1. Let be the local solution of (1.1). We assume that it satisfies
[TABLE]
for some .
In addition, we assume that has bounded total variation, i.e., for a fixed value . Here the total variation for is defined as
[TABLE]
where is any partition .
Considering a discrete control-to-state mapping , , where is the solution of (2.3), we introduce the discrete cost functional . We consider the following discretized version of (1.1):
[TABLE]
where
[TABLE]
We now define the local solution to (2.5) as follows.
Definition 2.2**.**
A control is called a local solution of (2.5) if there exists an such that holds for all with .
In the first main result, we prove the existence of the local solution to the approximate problem (2.5).
Theorem 2.3**.**
Let be a local solution of (1.1) satisfying Assumption 1. Then, there are constants and such that for the approximate problem (2.5) has a local solution satisfying .
The second main result is the following convergence estimate of the approximate solutions.
Theorem 2.4**.**
Let be a local solution of (1.4) satisfying Assumption 1, let be the approximate solution found in Theorem 2.3, and let be the adjoint state defined in Definition 3.1 below. Assume that the state belongs to and the adjoint state belongs to for some . Then we have
[TABLE]
The above result establishes the error estimate concerning the discretization of the ODEs in the OCPs. On the other hand, to implement a numerical computation to the OCP (1.4), one need also consider an approximation of the control space with a finite dimensional space. In Section 6, we will see that the proof of Theorem 2.4 can be extended to obtain the error analysis incorporating the discretization of the control space.
3. Adjoint states
This section is devoted to study the adjoint states to the OCP (1.1) and its discretized version (2.5).
We introduce a bilinear form for and by
[TABLE]
Then, for a fixed control and initial data , a weak formulation of (1.2) can be written as
[TABLE]
for all with .
Definition 3.1**.**
For a control , we define the adjoint state as the solution to
[TABLE]
with . It satisfies the weak formulation
[TABLE]
for all with .
For , the derivative of at in the direction is defined by
[TABLE]
It is well-known that the derivative of the cost functional can be calculated with the adjoint state, as described below.
Lemma 3.2**.**
We have
[TABLE]
for all , where .
Proof.
For the completeness of the paper, we give the proof in Appendix D. ∎
Next we describe the adjoint problem for the approximate problem (2.5). For , we define
[TABLE]
For approximate solution , the equation (2.3) with control can be written as
[TABLE]
Now we define the adjoint equation for the approximate problem (2.5).
Definition 3.3**.**
The adjoint state is defined as the solution of the following discrete adjoint equation:
[TABLE]
In Appendix D, we briefly explain how the adjoint equation (3.8) can be derived from the Lagrangian related to (2.5). We also have an analogous result to Lemma 3.2.
Lemma 3.4**.**
We have
[TABLE]
where .
Proof.
The proof is given in Appendix D. ∎
In order to prove the main results in Section 2, we shall use the following lemma.
Lemma 3.5**.**
Let . Suppose that and for some . Then we have
[TABLE]
Proof.
We recall from (3.4) and (3.8) that solves
[TABLE]
and solves
[TABLE]
Here and . The estimate of is induced from Theorem 2.1 as follows:
[TABLE]
As an auxiliary function, we consider solving
[TABLE]
which is the DG discretization of (3.11) in a backward way (see Lemma 3.6 below). Then, by Theorem 2.1, we have
[TABLE]
By (3.13), we obtain
[TABLE]
and
[TABLE]
Combining these estimates with (3.12) and (3.14) we find
[TABLE]
where satisfies . This, together with Lemma B.4, yields
[TABLE]
Combining this estimate with (3.15), we find that
[TABLE]
which completes the proof. ∎
With abusing a notation for simplicity, let us define as the interval given a partition with . Also we set as the DG space with the new partition. Then we have the following lemma.
Lemma 3.6**.**
Assume that is a solution to
[TABLE]
Then defined by for satisfies
[TABLE]
Proof.
By an integraion by parts, we have
[TABLE]
which leads to
[TABLE]
We now observe that satisfies and . We also set . Then and we have . Considering , it holds that for . Using these notations, we write (3.16) as
[TABLE]
Rearranging this, we get
[TABLE]
which is the desired equation . The proof is finished. ∎
4. Second order analysis
In this section, we analyze the second order condition of the functions and , which are essential in the existence and convergence estimates in the next sections.
4.1. Second order condition for
We defined the solution mapping in the previous section. Here we present Lipschitz estimates for the solution mapping , its derivative , and the solution to the adjoint equation (3.4).
Lemma 4.1**.**
There there exists such that for all and we have
[TABLE]
and
[TABLE]
Proof.
Let us denote by and . Then it follows from (3.2) that
[TABLE]
By (1.3), there exists a constant such that
[TABLE]
Using this estimate and applying the Gronwall inequality in (4.1), we get the inequality
[TABLE]
This gives the first inequality. For the second one, if we set and , then we find from Lemma B.1 that
[TABLE]
This together with the first assertion above yields
[TABLE]
For notational simplicity, we denote by and . Then, we get
[TABLE]
with . By applying the Gronwall inequality in a backward way, we obtain
[TABLE]
where we used
[TABLE]
due to (3.3) and . This completes the proof. ∎
We now show that the second order condition of holds near the optimal local solution .
Lemma 4.2**.**
Suppose that satisfies Assumption 1. Then there exists such that
[TABLE]
holds for all and all with . Here is given in (2.4).
Proof.
Let and . By using Lemma E.1, we find
[TABLE]
where we denoted by , , , and . On the other hand, it follows from Lemma 4.1 that
[TABLE]
This together with the following estimate
[TABLE]
yields
[TABLE]
Combining this with (2.4) we have
[TABLE]
By choosing here, we obtain the desired result.
∎
As a consequence of this lemma, we have the following result.
Theorem 4.3**.**
Let satisfy the first optimality condition and Assumption 1. Then, there exist a constant such that
[TABLE]
for any with .
Proof.
Choose as in Lemma 4.2. By Taylor’s theorem, we get
[TABLE]
where for some . On the other hand, by the first optimality condition, we have
[TABLE]
Moreover, we also find
[TABLE]
Using these observations and Lemma 4.2, we conclude
[TABLE]
∎
4.2. Second order condition for
In this part, we investigate the second order condition for the discrete cost functional . Similarly as in the previous subsection, we first provide the Lipschitz estimates for and the discrete adjoint state.
Lemma 4.4**.**
Let and be given. Then, there exists , independent of , such that
[TABLE]
and
[TABLE]
Proof.
The first and the third assertions are proved in Lemma B.5. The second estimate is proved in Lemma C.2. ∎
Lemma 4.5**.**
For , let be given by the solution of the state equation (1.2), and let for . Let be the solution of the discrete state equation (3.7), and let . Then we have
[TABLE]
Proof.
Define by the solution to
[TABLE]
Recall from Lemma B.1 that satisfies
[TABLE]
Combining these two equations, we get
[TABLE]
Using the Gronwall inequality here with (4.2) and (3.13), we find that
[TABLE]
On the other hand, satisfies
[TABLE]
which is the DG discretization of (4.4) in a backward way in view of Lemma 3.6. Thus, we may use Theorem 2.1 to obtain the following error estimate:
[TABLE]
This, together with (4.5) gives us the estimate
[TABLE]
The proof is finished. ∎
Lemma 4.6**.**
For given in Lemma 4.2, there exists such that for we have the following inequality
[TABLE]
for any satisfying .
Proof.
We first claim that
[TABLE]
for small enough, where is independent of . Let , , , and . Also we let and . It follows from Lemmas E.1 and E.2 that
[TABLE]
In order to show (4.6), by using a similar argument as in the proof of Lemma 4.2, it suffices to show that there exists , independent of , such that
[TABLE]
[TABLE]
and
[TABLE]
The first and second inequalites in (4.7) hold due to Theorem 2.1 and Lemma 4.5. For the third one in (4.7) is proved in (C.2). By Lemma 3.5, the second inequality in (4.8) holds. We also find
[TABLE]
which asserts the first inequality in (4.8). Finally, we obtain
[TABLE]
due to (4.7). All of the above estimates enable us to prove the claim (4.6). This together with Lemma 4.2 yields
[TABLE]
for . The proof is finished. ∎
5. Existence and Convergence results for the semi-discrete case
We first prove the existence of the local solution to the approximate problem (2.5).
Proof of Theorem 2.3.
Choose as in Theorem 4.3. We consider the following set
[TABLE]
and recall the space We will find a minimizer of in the space , and then show that . It will imply that is a local solution to (2.5).
Since is lower bounded on , there exists a sequence such that
[TABLE]
Moreover, since is compactly embedded in for any , up to a subsequence, converges to a function in and converges a.e. to . By definition, the function satisfies
[TABLE]
for all . Note that is a bounded set in the finite dimensional space by Theorem 2.4 (see also Lemma B.4). Therefore we can find a subsequence such that converges uniformly to a function . We claim that . Indeed, since converges a.e. to for and is Lipsichtiz continuous, we may take a limit to infinity in (5.2) to deduce
[TABLE]
for all . This yields that , which enables us to derive
[TABLE]
This together with (5.1) implies that satisfies
[TABLE]
It remains to show that the minimizer is achieved in the interior of . To show this, we recall that
[TABLE]
and
[TABLE]
Since for all , we see from Theorem 2.1 that
[TABLE]
Combining this with the Lipshitz continuity of yields that
[TABLE]
Taking . Using this and the estimate
[TABLE]
from Theorem 4.3, it follows that for we have
[TABLE]
Thus, the minimizer is achieved in . It gives that for all with . ∎
We now provide the details of the convergence estimate of the approximate solutions.
Proof of Theorem 2.4.
Analogous to (4.3), the discrete first order necessary optimality condition for reads
[TABLE]
Inserting here and summing it with (4.3), we get
[TABLE]
Now, by applying the mean value theorem with a value , we have
[TABLE]
where we used Lemma 4.6 in the first inequality and (5.4) in the second inequality. For our aim, it only remains to estimate the right hand side. Let us express it using the adjoint states. From (3.5), we have
[TABLE]
and it follows from (3.9) that
[TABLE]
Here we remind that denotes the solution to (2.3) with control and initial data . Combining (5.6) and (5.7) we find
[TABLE]
Applying Hölder’s inequality here and using (1.3), we deduce
[TABLE]
Now we apply (3.10) and (3.13) to get
[TABLE]
Combining this with (5.5), we finally obtain
[TABLE]
This completes the proof. ∎
6. Existence and Convergence results for the fully discrete case
This section is devoted to the existence and convergence results for the fully discrete case. We consider a finite dimensional space which discretizes the control space , for example, the space of step functions
[TABLE]
or the high-order DG space with .
We say that is a local solution to
[TABLE]
if there is a value such that for all with .
We provide the existence result of local solution in the following theorem.
Theorem 6.1**.**
Choose as in Theorem 4.3. Let be a local solution of (1.4) satisfying Assumption 1. Fix any . Then there exists such that for problem (6.1) has a local solution such that .
Proof.
By compactness and continuity, has a minimizer in
[TABLE]
since is finite dimensional. Next we aim to show that the minimizer satisfies
[TABLE]
To show this, we recall from (5.3) that there is a value such that for we have
[TABLE]
Combining this with the minimality of for in , we find that . It then yields that
[TABLE]
Thus is a local solution of (6.1). ∎
We establish the convergence result in the following theorem.
Theorem 6.2**.**
Assume the same statements for and in Theorem 2.4. In addition, suppose that there exists a projection operator and a value such that
[TABLE]
Let be a local solution to (6.1) constructed in Theorem 6.1. Then the following estimate holds:
[TABLE]
If we further assume that , then the above estimate can be improved to
[TABLE]
Proof.
In this case, by the first optimality conditions on and , we have
[TABLE]
The latter condition can be written as
[TABLE]
where . Summing up the above two inequalities, we get
[TABLE]
i.e.,
[TABLE]
By the assumption of the theorem, we have
[TABLE]
On the other hand, by applying the mean value theorem and Lemma 4.6, we obtain
[TABLE]
Combining this with (6.2) yields
[TABLE]
Applying here the estimate (5.9) in the previous proof, we have
[TABLE]
which together with (6.3) gives the desired estimate
[TABLE]
When we further assume , we have
[TABLE]
Using this and the estimates in (5.8), we find
[TABLE]
Inserting this into (6.4) we obtain
[TABLE]
It gives the desired estimate
[TABLE]
This concludes the desired result. ∎
7. Numerical experiments
In this section, we present several numerical experiments which validate our theoretical results. We employed the forward-backward DG methods [4] to solve the examples of the OCPs.
7.1. Linear problem
Let us consider the following simple one dimensional OCP, which has been used as an example [16], that consists of maximizing the functional
[TABLE]
subject to the state equation
[TABLE]
and . Using a similar idea as in Section 3 based on the maximum principle, we can derive the adjoint equation to the above optimal control problem:
[TABLE]
Furthermore, we also find that the optimal solutions and satisfies (7.2). Thus we have the solution
[TABLE]
and
[TABLE]
For fixed , we use for the approximate space of . In Table 1, we report the discrete error between optimal solutions and its approximations for the above optimal control problem. Here is the number of grid points on each time interval , and we used the equidistant points for our numerical computations. The numerical result confirms that the error is of order as proved in Theorem 2.4.
7.2. Nonlinear problem
In this part, we consider the following nonlinear optimal control problem:
[TABLE]
subject to the state equation
[TABLE]
In this case, the corresponding adjoint equation and optimal control are given as follows.
[TABLE]
and thus the optimal solution solves
[TABLE]
In this case, since we have no explicit form of the actual solutions, we take the reference solutions (resp., ) with instead of (resp., ). In Table 2, we arrange the discrete error between reference solutions and its approximations.
Acknowledgments
Acknowledgments The work of W. Choi is supported by NRF grant (No. 2017R1C1B5076348) and Faculty Research Fund, Sungkyunkwan University. The work of Y.-P. Choi is supported by NRF grant (No. 2017R1C1B2012918) and Yonsei University Research Fund of 2019-22-021.
Appendix A Differentiability of the control-to-state mapping
In this section, we show that the control-tostate mapping is twice differentiable, and obtain the derivatives.
Lemma A.1**.**
Let and be the solution of
[TABLE]
Then we have
[TABLE]
Proof.
Recall that and satisfy
[TABLE]
respectively. Using this, we find that satisfies
[TABLE]
where
[TABLE]
and
[TABLE]
Given that and (1.3), an elementary calculus shows that and . With these bounds, we may apply the Grönwall’s lemma for (C.3) to deduce for . From this we find
[TABLE]
which yields that
[TABLE]
∎
Next we show the twice differentiablity of the mapping at .
Lemma A.2**.**
Let be the solution of
[TABLE]
Then we have
[TABLE]
Proof.
Let
[TABLE]
Then we get
[TABLE]
where
[TABLE]
[TABLE]
By Lemma 4.1 we have . Given this estimate and that
[TABLE]
from Lemma B.1, an elementary calculus shows that and . Inserting this estimate into (C.5) and applying the Grönwall’s lemma, we find
[TABLE]
It proves that
[TABLE]
This implies that
[TABLE]
since
[TABLE]
This completes the proof. ∎
Appendix B Gronwall-type inequality for the DG discretization of ODEs
In this section, we provide a Gronwall-type inequality for the DG discretization of ODEs with inputs. It will be used in Section C to establish the differentiability of the discrete control-to-state mapping .
We begin with recalling from [15, Lemma 2.4] the following lemma.
Lemma B.1**.**
Let and . Then we have
[TABLE]
for all , , where
[TABLE]
The next result is from [15, Lemma 3.1].
Lemma B.2**.**
For and , we have
[TABLE]
for all . Here is independent of , , , and .
We shall use the following Grönwall inequality.
Lemma B.3**.**
Let and be sequences of non-negative numbers satisfying and . Assume that for a value we have
[TABLE]
for . Then there exists a constant independent of and such that
[TABLE]
for any with .
Proof.
The proof can be obtained by induction. ∎
Now we obtain the Grönwall-type inequality.
Lemma B.4**.**
Suppose that
[TABLE]
for all . Then there exists a constant independent of such that
[TABLE]
for all and small enough.
Proof.
From the condition (B.1) we have
[TABLE]
for all . To obtain the desired estimates, for each we shall take the following test functions supported on given as
[TABLE]
where denotes the indicator function, that is, for and for . First we take for . Then,
[TABLE]
where for we abuse a notation to mean . Notice that
[TABLE]
where for the above is understood as Using this in (B.2), we find
[TABLE]
By applying Cauchy-Schwarz inequality, we obtain
[TABLE]
Secondly, we take to have
[TABLE]
By using Hölder’s inequality, we get
[TABLE]
Notice that
[TABLE]
Thus, choosing gives
[TABLE]
and subsequently, this yields
[TABLE]
where . This together with Lemma B.1 asserts
[TABLE]
for small enough. Combining (B.3) and (B.4), we find
[TABLE]
where we applied Lemma B.1 in the second inequality. This, together with (B.5), we obtain
[TABLE]
for small enough, where for one has . This inequality trivially gives
[TABLE]
for . Now, by applying Lemma B.3 to find an estimate of and inserting it into (B.6), we achieve
[TABLE]
Finally, by applying Lemma B.2 to the above, we obtain the desired estimate. ∎
As a corollary, we have the following Lipshitz estimates.
Lemma B.5**.**
For we have
[TABLE]
and
[TABLE]
Proof.
Let us denote by and . Then it follows from (2.3) that
[TABLE]
By (1.3), there exists a constant such that
[TABLE]
By applying Lemma B.4, we get the inequality
[TABLE]
This gives the first inequality. For the second one, we denote by and . Then, we see from Lemma 3.8 that
[TABLE]
By applying Lemma B.4 again in a backward way (see Lemma 3.6), we obtain
[TABLE]
where we used
[TABLE]
due to Lemma B.4. This completes the proof. ∎
Appendix C Differentiability of discrete control-to-state mapping
This section is devoted to prove that the discrete control-tostate mapping is twice differentiable. We also obtain the first and second derivatives of .
Theorem C.1**.**
We denote and set be the solution of the following discretized equation:
[TABLE]
where . Then we have .
Proof.
By Theorem 2.1 there exists a solution to
[TABLE]
By Lemma B.4 we get
[TABLE]
Recall that and satisfy
[TABLE]
Using this, we find that satisfies
[TABLE]
for all , where
[TABLE]
and
[TABLE]
Given that and (1.3), an elementary calculus shows that and . With these bounds, we may apply Lemma B.4 to deduce for . From this we find that
[TABLE]
which yields that
[TABLE]
This completes the proof. ∎
Lemma C.2**.**
The following holds.
[TABLE]
Proof.
Let and . Then we obtain
[TABLE]
and
[TABLE]
for all . Combining these equalities, we have
[TABLE]
for all . On the other hand, the following two inequalities hold:
[TABLE]
and
[TABLE]
Given these estimates, by applying Lemma B.4 to (C.4), we obtain
[TABLE]
where we used Lemma B.5 in the second inequality. ∎
Lemma C.3**.**
Let be the solution of the following discretized equation:
[TABLE]
for any , where is the solution of (C.1). Then we have
[TABLE]
Proof.
Let
[TABLE]
It then follows that
[TABLE]
where
[TABLE]
[TABLE]
We obtain from Lemma C.2 the estimate . Upon this estimate and that from Lemma C.1, an elementary calculus reveals that and . Putting this estimate into (C.5) and using Lemma B.4, we find
[TABLE]
This yields that
[TABLE]
and so we have
[TABLE]
since
[TABLE]
The proof is done. ∎
Appendix D Derivations of the first order derivative of cost functionals
In this part, we give the proofs of Lemma 3.2 and Lemma 3.4. Before presenting it, we shall explain how to derive the discrete adjoint equation (3.8) from the Lagrangian associated to (2.5).
Let us first write the Lagrangian of the problem (1.1) and (3.7) as follows:
[TABLE]
where the bilinear operator is given by (3.7). If we compute the functional derivatives of the above Lagrangian (D.1) with respect to the adjoint state , then leads (3.7). We now derive the equation of discrete adjoint state. Using the integration by parts, we find
[TABLE]
This enables us to rewrite the Lagrangian (D.1) as
[TABLE]
and this further implies
[TABLE]
for all , where we applied the integraion by parts for to derive the second equality. The above equality corresponds to the adjoint equation (3.8).
Proof of Lemma 3.2.
In order to compute the functional derivative of with respect to , we consider with and . If we set it follows from Lemma B.1 that satisfies
[TABLE]
with the initial condition . Recall from (3.4) that the adjoint state satisfies
[TABLE]
Then we have
[TABLE]
where we used
[TABLE]
due to (D.3), (D.4), , and . ∎
Proof of Lemma 3.4.
The proof is very similar to Lemma 3.2. We consider with and . We recall from Lemma C.1 that the function is differentiable at with
[TABLE]
where satisfies the following equation:
[TABLE]
Using this, we obtain
[TABLE]
We then take in (D.2) to get
[TABLE]
On the other hand, by using the integration by parts, we find
[TABLE]
where is appeared in (3.6). This yields
[TABLE]
due to (D.5). This together with (D.6) concludes
[TABLE]
where . ∎
Appendix E Derivations of the second order derivative of cost functionals
In this appendix, we provide details of the derivation of the second order derivative of cost functional and its discrete version .
Lemma E.1**.**
Let be the cost functional for the optimal control problem (1.1)-(1.2). Then, for and , we have
[TABLE]
Proof.
Similarly as in Appendix D, we consider with and and set . By Lemma B.1 and Lemma A.2 it follows that
[TABLE]
where is given as in (D.3) and is the solution to
[TABLE]
with the initial condition . Then we obtain
[TABLE]
On the other hand, we use (D.4) to get
[TABLE]
where we used and . By combining the above with (E.1), we have
[TABLE]
This completes the proof. ∎
Next we proceed the similar calculation for the approximate solution.
Lemma E.2**.**
Let be the discrete cost functional for the optimal control problem (1.1)-(1.2). Then, for and , we have
[TABLE]
Proof.
Similarly as in the proof of Lemma 3.4, we consider with and and set . We recall from Theorem C.1 and Theorem C.3 that
[TABLE]
where satisfies
[TABLE]
Now a straightforward computation gives
[TABLE]
Note that the discrete adjoint state satisfies
[TABLE]
for all . Thus by considering , we find
[TABLE]
Combining the above equalities, we have
[TABLE]
This completes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] W. Alt, On the approximation of infinite optimization problems with an application to optimal control problems, Appl. Math. Optim., 12, (1984), 15–27.
- 2[2] W. Alt, U. Felgenhauer, and M. Seydenschwanz, Euler discretization for a class of nonlinear optimal control problems with control appearing linearly, Comput. Optim. Appl., 69, (2018), 825–856.
- 3[3] M. Baccouch, Analysis of a posteriori error estimates of the discontinuous Galerkin method for nonlinear ordinary differential equations, Appl. Numer. Math., 106, (2016), 129–153.
- 4[4] M. Delfour, W. Hager, and F. Trochu, Discontinuous Galerkin methods for ordinary differential equations, Math. Comp., 36, (1981), 455–473.
- 5[5] A. L. Dontchev and W. W. Hager, Lipschitzian stability in nonlinear control and optimization, SIAM J. Control Optim., 31, (1993), 569–603.
- 6[6] A. L. Dontchev and W. W. Hager, The Euler approximation in state constrained optimal control, Math. Comp., 70, (2000), 173–203.
- 7[7] G. Elnagar, M. A. Kazemi, M. Razzaghi, The pseudospectral Legendre method for discretizing optimal control problems, IEEE T. Automat. Contr., 40, (1995), 1793–1796.
- 8[8] D. Estep, A posteriori error bounds and global error control for approximation of ordinary differential equations, SIAM J. Numer. Anal., 32, (1995), 1–48.
