The regular indefinite linear quadratic optimal control problem: stabilizable case
Marijan Vukosavljev, Angela P. Schoellig, and Mireille E. Broucke

TL;DR
This paper extends the theory of indefinite linear quadratic optimal control to stabilizable systems, providing explicit solutions and conditions for optimal control existence in a more general setting.
Contribution
It generalizes previous controllable case results to stabilizable systems, explicitly characterizing the unique algebraic Riccati equation solution.
Findings
Explicit characterization of the unique Riccati solution.
Necessary and sufficient conditions for optimal control existence.
Extension of control theory to stabilizable systems.
Abstract
This paper addresses an open problem in the area of linear quadratic optimal control. We consider the regular, infinite-horizon, stability-modulo-a-subspace, indefinite linear quadratic problem under the assumption that the dynamics are stabilizable. Our result generalizes previous works dealing with the same problem in the case of controllable dynamics. We explicitly characterize the unique solution of the algebraic Riccati equation that gives the optimal cost and optimal feedback control, as well as necessary and sufficient conditions for the existence of optimal controls.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStability and Controllability of Differential Equations · Numerical methods for differential equations · Stability and Control of Uncertain Systems
\newsiamthm
problemProblem \newsiamthmdefnDefinition \newsiamthmassumAssumption \newsiamthmremarkRemark
\headersIndefinite linear quadratic optimal control Marijan Vukosavljev, Angela P. Schoellig, and Mireille E. Broucke
The regular indefinite linear quadratic optimal control problem: stabilizable case††thanks: Published electronically Feb. 20 2018.
\fundingSupported by the Natural Sciences and Engineering Research Council of Canada (NSERC).
Marijan Vukosavljev Dept. of Electrical and Computer Engineering, University of Toronto, Canada (, ). [email protected]
Angela P. Schoellig University of Toronto Institute for Aerospace Studies, University of Toronto, Canada (). [email protected]
Mireille E. Broucke22footnotemark: 2
Abstract
This paper addresses an open problem in the area of linear quadratic optimal control. We consider the regular, infinite-horizon, stability-modulo-a-subspace, indefinite linear quadratic problem under the assumption that the dynamics are stabilizable. Our result generalizes previous works dealing with the same problem in the case of controllable dynamics. We explicitly characterize the unique solution of the algebraic Riccati equation that gives the optimal cost and optimal feedback control, as well as necessary and sufficient conditions for the existence of optimal controls.
keywords:
linear quadratic optimal control, indefinite cost functional, stability-modulo-a-subspace, stabilizability
{AMS}
93C05, 93C35
1 Introduction
In this paper we consider the regular, infinite-horizon linear quadratic optimal control problem in which the cost functional is the integral of an indefinite quadratic form. The regular linear quadratic (LQ) problem, when the quadratic form in the cost functional is positive definite in the control variables, has been studied extensively in the literature [3, 22, 2]. It has been especially well studied under the standard assumption, the so-called positive semidefinite case, when the quadratic form in the cost functional is positive semidefinite in the control and state variables simultaneously. The more general indefinite case imposes no definiteness condition in the control and state variables simultaneously [19, 17]. The LQ problem is termed infinite-horizon if the cost functional is integrated over time from zero to infinity. Finally, the most typical treatment of the LQ problem is the fixed-endpoint problem where the state is required to converge to zero as time tends to infinity. The case when no such condition is imposed has also been studied and is referred to as the free-endpoint problem [17, 16, 8]. In fact, an entire family of LQ problems can be obtained by requiring that the state converges to a subspace. This so-called stability-modulo-a-subspace family of LQ problems includes the fixed- and free-endpoint problems as special cases [16, 8]. For the remainder of the paper, we restrict our attention to the regular and infinite-horizon versions of the problem, for otherwise the optimization problem may yield optimal controllers that are not static linear state feedbacks [20, 2]. Also, we focus on stability-modulo-a-subspace, since it is the more general case.
Traditionally, a complete solution of any variant of the LQ problem requires to find necessary and sufficient conditions for the existence of a finite optimal cost and optimal controls. Existence of a finite optimal cost is called well-posedness, while existence of an optimal control is called attainability. Further, when they exist, a complete solution involves determining the optimal cost and an optimal control. Both should be expressed in terms of the given problem data; that is, the system matrices, the instantaneous cost matrices, and the desired subspace.
In the regular, infinite-horizon, fixed-endpoint, positive semidefinite case, the LQ problem was fully resolved in 1968 by Wonham [21, 22], resulting in the well known necessary and sufficient conditions involving stabilizability and detectability. The corresponding free-endpoint LQ problem was fully characterized much later [6, 18], resulting in conditions involving output stabilizability, a condition less strict than stabilizability [6, 18]. In the regular, infinite-horizon, indefinite case, the fixed-endpoint problem was solved in 1971 by Willems [19], while the free-endpoint problem and general stability-modulo-a-subspace were addressed in 1989 by Trentelman [17, 16]. Importantly, all of the indefinite cases made use of the assumption that the dynamics are controllable. Moreover the solutions are incomplete in that only sufficient conditions for the existence of a finite optimal cost were given (except for the fixed-endpoint problem). The main contribution of this paper is to extend the above results for the regular, infinite-horizon, stability-modulo-a-subspace, indefinite case of the LQ problem. Rather than assuming controllability, we only require stabilizability.
It is well known that in both the positive semidefinite and indefinite cases of the regular, infinite-horizon, stability-modulo-a-subspace LQ problem, the optimal cost and optimal controls are given in terms of a particular solution of the algebraic Riccati equation (ARE) [18, 17]. In the treatment of the regular, infinite-horizon, indefinite LQ problem, the controllability assumption is crucial in order to utilize the geometry of the set of all real symmetric solutions of the ARE [19, 11]. In particular, if this solution set is nonempty, there exist a maximal and minimal solution of the ARE [11]. The regular, infinite-horizon, fixed-endpoint LQ problem, both definite and indefinite cases, has always been easier in the sense that the optimal cost and feedback control law are given in terms of the maximal solution, which is the only solution that can stabilize the closed-loop system [19, 21]. For the regular, infinite-horizon, stability-modulo-a-subspace, indefinite case and under the assumption of controllability, the optimal cost and feedback control law are given by a real symmetric solution to the ARE that depends on both its maximal and minimal solutions [16]. In contrast, under the stabilizability assumption, it is unclear which solution of the ARE to select because the geometry of the set of all real symmetric ARE solutions is less well-behaved. In particular, the minimal solution may no longer exist [10, 11]. This ambiguity of the correct choice of ARE solution for the regular, infinite-horizon, stability-modulo-a-subspace, indefinite LQ problem under merely stabilizable dynamics was discussed by Geerts [7, 8], but it has remained elusive.
In this paper we give the exact form of the optimal feedback that solves the regular, infinite-horizon, stability-modulo-a-subspace, indefinite LQ problem under stabilizable dynamics. Thus we resolve the ambiguity regarding which solution of the ARE to take. Our result requires two assumptions, which are precisely our sufficient conditions for well-posedness: existence of a negative semidefinite solution to the algebraic Riccati inequality (ARI) and stabilizability of the system dynamics. These assumptions may be compared to the sufficient conditions for well-posedness in [17]: existence of a negative semidefinite solution to the ARE and controllability of the system dynamics. The first assumption on existence of a negative semidefinite solution of the ARE or ARI provides for a lower bound on the value function, based on a result of Molinari [12]. Our generalization to the ARI is based on an observation by Geerts [7]. The generalization to the case when the dynamics are stabilizable proves to be the more difficult challenge, as discussed above. This extension constitutes the central contribution of the paper. Finally, we give necessary and sufficient conditions for optimal controls to exist, which, as pointed out in [17], are nontrivial for regular, infinite-horizon, non-fixed-endpoint, indefinite LQ problems.
As a further validation of the correctness of our results, we recover known results for other variants of the regular, infinite-horizon LQ problem by adding assumptions to match those problems. In the regular, infinite-horizon, stability-modulo-a-subspace, indefinite case, if we assume controllable dynamics, we obtain the same necessary and sufficient conditions for the existence of optimal controls, the same form of the optimal cost, and the same form of the optimal control as stated in [19, 17, 16]. In the regular, infinite-horizon, positive semidefinite LQ problem, for both the fixed- and free-endpoint cases, if we assume positive semidefineness, then we again obtain the same necessary and sufficient conditions for the existence of optimal controls, the same form of the optimal cost, and the same form of the optimal control as stated in [18].
Our resolution of the gap in the LQ literature provides more than just an answer to an academic question. Recently, the work in [13] considered a linear term in the state of the cost functional and a free-endpoint objective, albeit over the finite-horizon; with a transformation, this cost can be converted to an indefinite problem with stabilizable but not controllable dynamics. The gap was also recently discussed in [4], which deals with the cooperative indefinite LQ problem. As such, our result has application to game theoretic formulations and economics.
The outline of this paper is as follows. In the remainder of this section we will introduce most of the notational conventions that will be used. In Section 2 we present the problem statement. In Section 3 we summarize the key ingredients needed regarding the geometry of the ARE solutions. In Section 4 we state and prove our main results. In Section 5 we compare our main result to existing results in the literature.
Notation. We use the following notation. Let be the identity matrix (the subscript is omitted if the dimension is clear from the context). Let denote the (unique) pseudo-inverse of . The set of eigenvalues of is denoted by . A subspace is invariant if . We use the following subsets of the complex plane: , , and . Given a real monic polynomial there is a unique factorization into real monic polynomials with , , and having all roots in , , and , respectively. Then if and if is its characteristic polynomial, then we define the spectral subspaces , , and . Each of these subspaces are invariant and the restriction of to has characteristic polynomial . For two subspaces and , let denote their direct sum and let denote that they are isomorphic. For an arbitrary matrix and subspace we define the subspace , and by further writing for some we also define . For a linear time-invariant system, , the controllable subspace will be denoted in the usual way . If there is an output , then denotes the unobservable subspace of . If is a real matrix and is a subspace of , then . If is a subspace of then denotes its orthogonal complement with respect to the standard Euclidean inner product.
Let and . Additionally, given a function , the statement that exists in means that is either equal to a real number, , or in the usual sense.
We denote the space of all measurable vector-valued functions on that are locally square integrable as . Let denote the function giving the minimum Euclidean distance from a point to a set .
Given a quadratic form on , , it is said to be positive definite if for all , , and if and only if ; positive semidefinite if for all , ; negative definite if is positive definite; negative semidefinite if is positive semidefinite; and indefinite if is neither positive semidefinite nor negative semidefinite. Writing for some symmetric matrix , we say that the matrix is positive definite if the quadratic form is positive definite and so on. We write , , , and if the matrix is positive definite, positive semidefinite, negative definite, and negative semidefinite, respectively. Given symmetric matrices , we write if , and likewise for the other inequalities. Let denote a subset of the set of all symmetric matrices in . We say that * () is the maximal (minimal) element on * if () and for all , (). The maximal and minimal elements, which are called the extremal elements on , are unique if they exist since forms a partially ordered set.
2 Problem Statement
We consider the linear control system
[TABLE]
where and . For a control function , let denote the state trajectory of (1) starting at . Then for , the cost function is
[TABLE]
with a quadratic instantaneous cost
[TABLE]
We allow to be indefinite, whereas . More general quadratic cost functions can be considered, but they can be converted via a feedback transformation to the form we use here, as in Chapter 10 of [18]. This feedback transformation does not affect solvability of the problem; hence, there is no loss of generality in our choice of .
Because may be indefinite, we define the set of control inputs that yield a cost that is either finite, , or :
[TABLE]
Let be a subspace. The set of permissible control inputs such that the state asymptotically converges to is
[TABLE]
For , we define
[TABLE]
We define the optimal cost or value function to be
[TABLE]
Now we define the linear quadratic optimal control problem with stability-modulo- .
Problem 2.1** ().**
*Consider the system (1) with the quadratic cost criterion (2). Let be a given subspace. For all , find the optimal cost and an optimal control such that . *
The is called regular (as opposed to singular) if . It is called positive semidefinite if is positive semidefinite on , and indefinite otherwise. If , the is called a free-endpoint problem, and if , it is called a fixed-endpoint problem. We are particularly interested in characterizing two properties of the .
Definition 1**.**
*We say the is well-posed if for all , . We say the is attainable if for all , there exists a control such that . Such an input is called optimal. We say the is solvable if it is both well-posed and attainable. *
3 Preliminaries
The main results on the are centered on the algebraic Riccati equation (ARE):
[TABLE]
The algebraic Riccati inequality (ARI) is given by . For convenience, we define
[TABLE]
Also we define the following solution sets:
[TABLE]
The geometry of the solutions to the ARE can be studied in both the controllable and stabilizable cases; see, in particular, Chapters 7 and 8 of [11] and also [17]. First we consider the case when is controllable. The next result summarizes what is known about the extremal solutions in and in .
Theorem 3.1**.**
Suppose is controllable.
- (i)
If , then the maximal and minimal solutions in exist, , its maximal and minimal solutions exist, and they are identical to the maximal and minimal solutions in .
- (ii)
If , then its maximal and minimal solutions satisfy: , . Moreover, they are the unique solutions in such that and .
Proof 3.2**.**
*The first statement is Theorem 14(b) in [14]. The second statement was proved in [19]. See also Theorem 7.5.1, p. 168, in [11]. *
If , define the gap of the ARE to be . Let denote the set of all invariant subspaces contained in . The following theorem was first proven by Willems [19]; see also [11].
Theorem 3.3** (Theorem 3.1, [17]).**
Let be controllable and suppose . If , then . There exists a bijection defined by
[TABLE]
*where is the projection onto along . If , then , , and . *
An application of Theorem 3.3 is the main result of [16], which provides a solution of the when is controllable. To state the sufficient condition for well-posedness, an additional definition is needed from [16]: for a given subspace and symmetric matrix , is said to be negative semidefinite on if for all , , and if and only if . Notice that implies that for all , is negative semidefinite on . To see this, fix and note that implies that there exists for some such that . Then for all , obviously , implies , and
[TABLE]
Theorem 3.4** (Theorem 4.1, [16]).**
Let be controllable. Assume and is negative semidefinite on . Then we have
- (i)
For all , is finite. 2. (ii)
For all , , where and . 3. (iii)
For all , there exists an optimal input if and only if . 4. (iv)
If , then for each , there exists exactly one optimal input , and it is given by the feedback .
This paper can be regarded as a generalization of the previous result to the stabilizable case. That is, we require weaker assumptions for the sufficient condition of well-posedness to be able to provide the form of the value function, necessary and sufficient conditions for attainability, and the form of the optimal control. Our new assumptions involve the stabilizability of rather than controllability, and the existence of a negative semidefinite solution to the ARI rather than imposing that specifically , a solution to the ARE, is negative semidefinite on . Because necessary and sufficient conditions for well-posedness are still an open problem, note that we have not attempted to generalize our second condition in terms of the existence of an ARI solution that is negative semidefinite on . Regardless, the main technical obstacle is that there is no direct generalization of Theorem 3.3 to the stabilizable case; indeed the minimal solution may not exist in this case.
Now supposing that is stabilizable, we can write the system (1) in the Kalman controllability decomposition. Let be the controllable subspace with dimension . Also, let be any complement such that
[TABLE]
Then the system matrices have the block form:
[TABLE]
It can be shown that coordinate transformations only affect the solutions of the (in any endpoint case) up to a congruent transformation, so there is no loss of generality to assume that already has the form (13). If we write the symmetric matrices and in block form
[TABLE]
then also can be decomposed in block form:
[TABLE]
We note that is symmetric, and is defined below in (17). Let
[TABLE]
Then gives rise to three equations
[TABLE]
The first equation (17) is a quadratic equation with controllable. Its solutions are decoupled from and , so this lower order () ARE equation can be solved first. The relevant solution sets are denoted as:
[TABLE]
Using any solution , if it exists, (18) is a linear (Sylvester) equation for which may have no solutions, infinitely many solutions, or a unique solution. The third equation (19) is also a linear (Sylvester) equation. Using any solution , if it exists, gives a unique solution to . To see this, recall that if , , and are given matrices, then the Sylvester equation has a unique solution exactly when [5]. Because stabilizability of implies , then by applying the Sylvester solvability criteria to (19), we have that , and so is unique for any given .
In preparation for characterizing the existence and form of the value function analogously to Theorem 3.4 (i) and (ii), we consider existence of extremal solutions in . It is known that when is stabilizable, then the maximal solution exists, whereas the minimal solution may not exist.
Theorem 3.5** (Theorem 2.1, [10]; Theorem 7.9.3, p. 195, [11]).**
*Suppose is stabilizable and . Then the unique maximal solution exists. Moreover, . *
To obtain a generalization of Theorem 3.4 to the stabilizable case, one of the major steps in the sequel is to apply Theorem 3.4 to the controllable subsystem and its ARE (17). Theorem 3.4 requires that the minimal solution of (17) exists and is negative semidefinite on within the controllable subspace. The following lemma provides for the existence of this minimal, negative semidefinite solution.
Lemma 3.6**.**
*Suppose is stabilizable, , and the state space is decomposed as in (12). Then the minimal solution exists. *
Proof 3.7**.**
*Let so that and . Consider , , and in block form (14)-(15). Applying Theorem .1 to both and , we obtain and , which implies . Since also is controllable, we can apply Theorem 3.1(i) to conclude , the maximal and minimal solutions, exist. Moreover and its maximal and minimal elements are precisely and . Because , is minimal, and , we have that . That is, , as desired. *
4 Solution of the
In this section we present the solution of the . That is, we give sufficient conditions for well-posedness, the form of the value function, necessary and sufficient conditions for attainability, and form of the optimal control. We assume that is a given subspace. Well-posedness and the form of the value function are addressed through the following sufficient condition, which are also found in [7, 8].
Assumption 2**.**
*We assume that is stabilizable and . *
The following theorem states that the value function is given in terms of a quadratic form of a particular solution to the ARE.
Theorem 4.1** (Theorem 2.1 [7], Lemma 5 [12]).**
*Consider the and suppose Assumption 2 holds. Then there exists a unique such that for all , . *
Next we turn to the form of . Our approach is to choose a suitable basis based on the Kalman controllability decomposition (12) and on Theorem 3.3, following the same method in [17]. Then we systematically determine each of the blocks of . First we determine using results from [16]; second, we compute assuming is known; finally, we compute assuming is known. Now we give a more detailed roadmap on how the technical results are obtained.
The choice of is resolved by applying Theorem 3.4 to the controllable subsystem. We construct a smaller optimal control problem on the controllable subsystem. Intuitively, the smaller optimal control problem should be equivalent to the original for initial conditions in the controllable subspace. After proving this equivalence, we apply Theorem 3.4 to obtain , where is defined in (22) below. Next, we fix the choice of that solves (17) and turn to the solution set of (18). Generally, this linear Sylvester equation may have an infinite number of solutions, making the choice of nontrivial to determine. However, once is determined, then is uniquely determined from the linear Sylvester equation (19), since is stabilizable. Thus is the main obstacle. Interestingly, under a restrictive regularity assumption introduced in [10], the solution set of (18) collapses to a single element. On the other hand, Theorem 4.1 states that exists without the regularity assumption. We forego the assumption and search for a more general principle that can resolve the choice of .
Our approach involves exploiting the structure within the Kalman controllability decomposition, similarly as in [17]. Based on a modal decomposition of , the Sylvester equation (18) with splits into three decoupled linear Sylvester equations (34)-(36). The problematic part of , denoted is then isolated to (34) only. Regarding the solution of (34), it is well known (see Theorem 10.13 of [18]) that for stabilizable systems with positive semidefinite cost in the free endpoint case, the solution of the ARE is given by the smallest positive semidefinite solution in . Also, if and only if (see for example equation (1.16) of [8]) and so and gives a lower bound on the value function. Using the previous two observations, we find through repeated trials that in the positive semidefinite case. At this point we make a guess that the same form of would arise in the indefinite case. Finally, we unambiguously deduce that .
Once we have fully characterized the form of , obtaining necessary and sufficient conditions for attainability follows analogously to the proof presented in [17, 16]. We require only a few augmentations to account for the uncontrollable (but stable) dynamics. Now we proceed to the actual development.
The first step is to fix a suitable basis so that the blocks of can be computed. Consider the Kalman controllability decomposition (12), and suppose Assumption 2 holds. Then by Lemma 3.6, the unique minimal solution exists and . Similarly, because is controllable and , we can apply Theorem 3.1 to obtain the unique maximal solution . Let be the gap associated with (17), the ARE in the controllable subspace. Following [17, 16], we can further decompose the controllable subspace based on Theorem 3.3. To that end, define the following subspaces of :
[TABLE]
Here and for the remainder of this section, for simplicity we do not notationally differentiate a subspace that can belong to various vector spaces of different dimensions. For example, although technically , we can view as a subspace of .
Let be the projection onto along . Because is an -invariant subspace contained in for any , we can apply Theorem 3.3 to obtain a solution of the ARE of the form
[TABLE]
Following Theorem 3.3, define the following subspaces in :
[TABLE]
Then the state space decomposition (12) splits further into
[TABLE]
Let for so that . Without loss of generality (after a change of coordinates), the system matrices have the block form
[TABLE]
The cost matrix and each have the block form
[TABLE]
Our goal is to compute all of the blocks in (28) for . First we resolve the choice of .
Theorem 4.2**.**
*Consider the and suppose Assumption 2 holds. Then in the state space decomposition (12), , as given in (22). *
Proof 4.3**.**
Since is stabilizable, without loss of generality, has the form (13), and and have the block form (14). Defining , the Kalman controllability decomposition is
[TABLE]
The controllable subspace is . If , then for all , and . Thus, we can define a new on with dynamics , , and is controllable. The cost function is with . Let be the terminal subspace and let be the distance function. The input spaces are
[TABLE]
The optimal cost is . The ARE for the is as in (17) with solution set . Consider any initial condition and any control . Then and , so for all , . Consequently, we have . Also, is equivalent to . Thus . With all the above, we conclude that for .
*Since is controllable, we can apply the results of [16] to solve the . Since , we can apply Lemma 3.6 to get that the minimal solution exists. Since , from (11) it follows that is negative semidefinite on . By Theorem 3.4(ii), with given in (22). Since we have already shown that for , it can be easily shown that . *
To resolve the remaining blocks of , we recall some results from [17]. For this to apply, we continue to assume that the state space is decomposed according to (26). It was shown in (5.5) and (5.7) of [17] that in (22) and the closed-loop system matrix using have the form
[TABLE]
where , , and . For the choice of and substituting (27), (28), and (33), the second ARE equation (18) splits into three linear Sylvester equations:
[TABLE]
Using these facts, we can now resolve the remaining blocks of . The main difficulty is that (34) may have an infinite number of solutions for the block since is not necessarily empty. The key insight is that can be unambiguously determined by invoking Theorem 4.6(ii) given below, that any negative semidefinite solution to the ARI provides a lower bound to the value function. In order to utilize this property to resolve the choice of , the next lemma describes the block structure of any .
Lemma 4.4**.**
Suppose Assumption 2 holds and the state space is decomposed as in (26). Then for all , has the block form
[TABLE]
Proof 4.5**.**
Let have the block form in (28). Since and is stabilizable, we can apply Lemma 3.6 to obtain that the minimal solution exists. Also . Because , by Theorem .1 we establish that its upper left block satisfies . Since is controllable and , we can apply Theorem 3.1(i) to get that the maximal solution also exists. Moreover, Theorem 3.1(i) also implies that , and consequently . Since , it has been shown (see equation (5.6) in [17] and equation (5.4) in [16]) that , , and have the block form
[TABLE]
where , , and . Now consider in block form, assuming the decomposition of in (37). We have
[TABLE]
Using Theorem .1, we find . Since by assumption, . Applying Theorem .1 to , we get . Thus . Now consider again with the information that :
[TABLE]
where we have and as in (37). We claim that , , , and . First, we have
[TABLE]
Applying Theorem .1 again, we get , so that and . Then reduces to
[TABLE]
which implies by Theorem .1 that
[TABLE]
Similarly, gives
[TABLE]
Applying Theorem .1 to the previous two statements, we get , so . Then rewriting the previous inequality (38)
[TABLE]
Applying Theorem .1, we get , so . So far we have for
[TABLE]
Then has the block form:
[TABLE]
*Applying Lemma .2, we get . *
In the next result we completely characterize the form of . Before proceeding with this result, we collect some well known results about the cost function.
Theorem 4.6**.**
Consider the system (1) with the cost function (2) - (3). Let , , and .
- (i)
Let . Then , where . 2. (ii)
For all and , . 3. (iii)
Suppose Assumption 2 holds. If , then and .
Proof 4.7**.**
*Statement (i) is standard. See for instance [19] or [17]. Statement (ii) is Proposition 1.8 of [7]. See also Lemma 4.4 of [17]. Statement (iii) is Theorem 2.8(c) of [7]. See also the proof of Theorem 5.1(iii) in [17]. *
Theorem 4.8**.**
Consider the and suppose Assumption 2 holds. Then in the state space decomposition (26), has the form
[TABLE]
*where is the unique solution to (35), is the unique solution to (36), and is the unique solution to (19) with . *
Proof 4.9**.**
By Theorem 4.2, with the form of given in (22). By Theorem 4.1, . Next we consider (18). Using the decompositions above and with the choice , the second ARE equation (18) splits into (34), (35), and (36). Since , , and , (35) and (36) have unique solutions and , respectively [5]. Similarly, (19) has a unique solution , assuming . At this point we know that has the block form:
[TABLE]
Comparing to (39), it remains only to show that . By Theorem 4.1, . Let . By Theorem 4.6(ii), for all , ; that is, . Using the block form of in (40) and the block form of in Lemma 4.4, we have
[TABLE]
*Applying Lemma .2 yields that , as desired. *
Remark 4.10**.**
We observe from the form of that . If we substitute into (34), we get that . One can derive the fact that via a separate argument, and this provides an independent validation of our result that . Suppose Assumption 2 holds. Take any symmetric with the special form:
[TABLE]
We decompose and as in (27). Using a result analogous to equation (5.2) in [17], it can be shown that is -invariant, and this implies . Then by direct computation has the form:
[TABLE]
Now choose the upper left block of the above to be . By (37) this choice is consistent with the form of above. Since the upper left block of is written as and we know that , it immediately follows that , , and . Next, since , let . By Lemma 4.4, has the special form above. Then we have
[TABLE]
*By applying Lemma .2, we conclude that . *
We conclude this section by applying Theorem 4.8 to obtain necessary and sufficient conditions for attainability of the . Remarkably, the attainability result depends only on the controllable subspace.
Theorem 4.11**.**
*Suppose Assumption 2 holds and the state space is decomposed as in (12). Then the is attainable if and only if . *
Proof 4.12**.**
Due to Assumption 2, we may further assume that the state space is decomposed according to (26). Let be a matrix such that and let be the distance function in to . Since , we have . We claim
[TABLE]
Proof of Claim: Let . Then . Also since , in coordinates it has the form . Then . Since is arbitrary, we get , as desired.
* Suppose the is attainable. Let . By definition there exists such that . By Theorem 4.1 we know where , and by Theorem 4.8, is given in (39). Now we can apply Theorem 4.6(iii) to get . The closed-loop dynamics are . Let according to the decomposition (26). Then using the block form of in (33), we have*
[TABLE]
where , , , and by stabilizability, . Using the variation of constants formula we get that at
[TABLE]
Since , . Using (43) for , , and the fact that , we also get . Now using (39), the block form of given in (37), and the fact that , we have
[TABLE]
Using this expression combined with the fact that , , and from Theorem 4.6(iii), we get
[TABLE]
Now we observe that because and . Returning to (45), this implies that also .
We have assumed that and is stabilizable. Therefore, , and thus within the controllable subspace . Since , . Meanwhile by Lemma 3.6, . Since , by taking the limit in (11) we have that . Overall, we have . Using (41), this gives . We already know that , so we get . However, and is arbitrary, so . Finally, we observe that if , then since . That is, . In sum, we have
[TABLE]
* Suppose that . Let . To show attainability, we must find an optimal control. Consider the candidate , where is given in (39). We must show and . The closed-loop dynamics using are given in (42). Following the same arguments as above we have that and . By assumption, . From above, . We claim that . To see this, let . Then . Since we have . Since is arbitrary, . Using the block form of in (37), we have*
[TABLE]
This implies . Now we observe by Theorem 4.1 and for any fixed . Therefore, we can apply Theorem 4.6(i) with and to get
[TABLE]
We claim that . Using the expansion of given in (44), and the fact that and , we get . Using the available information about the block form of and that , we find
[TABLE]
Returning to (47), we have , as desired.
Finally, we must show , and particularly . Since , we have that
[TABLE]
*Thus, , which implies . Since and , we have . Thus, , as desired. *
We collect all of the previous results to obtain the culminating result on the solution of the . It is a generalization of Theorem 3.4 for the case of controllable to the case when is stabilizable.
Theorem 4.13**.**
Consider the . Suppose Assumption 2 holds and the state space is decomposed as in (12). Then we have
- (i)
The problem is well-posed. 2. (ii)
For all , . 3. (iii)
For all , the problem is attainable if and only if . 4. (iv)
If the problem is attainable, then for each , there exists exactly one optimal input , and it is given by .
Proof 4.14**.**
*Statements (i) and (ii) follow from Theorem 4.1. The form of follows from Theorem 4.8. Statement (iii) is an immediate consequence of Theorem 4.11. Statement (iv) follows from Theorem 4.6 (iii). *
5 Discussion
In this section we discuss several special cases of our main result. This includes a comparison with classical results in the positive semidefinite case. First, we consider the special case when which was also treated in Theorem 6.1 of [17]. From our experience it is only in exceptional cases that . The following result shows that when , then , the maximal solution in . This result has practical significance because there are many powerful algorithms for numerically finding the maximal solution of the ARE.
Theorem 5.1**.**
*Consider the , suppose that Assumption 2 holds, and that the state space decomposed as in (12). Then if and only if , where is the maximal solution. *
Proof 5.2**.**
(Only if) Suppose . By Theorem 4.8, , where . By assumption, , and then (22) gives , where is the maximal solution in . By Theorem 3.1(ii), we also know is the unique maximal solution such that . Furthermore, by stabilizability, . Therefore, , so is the unique solution of the Sylvester equation (18). Similarly, since , is the unique solution of the Sylvester equation (19).
Meanwhile, since , by Theorem 3.5, the maximal solution exists and satisfies . We claim . Let in block form. Since , we have that . Using (13), . Then since , we have . However, by Theorem 3.1(ii), and together imply , the unique maximal solution in . It immediately follows that and , as desired.
*(If) Suppose , the maximal solution in . By writing in block form, , we have . We also have that is the maximal solution in using an argument analogous to the one above. That is, using (13), . By Theorem 3.5, . Since , we get . Then by Theorem 3.1(ii), . Meanwhile by Theorem 4.2, . Putting this altogether, we have that . Finally, using in (22) gives that , so . *
Next we discuss how Theorem 4.13 recovers well known results for the free-endpoint and fixed-endpoint problems when is positive semidefinite and is stabilizable. First, we observe that when , then so . Therefore, Assumption 2 holds. We also assume that the state space is decomposed as in (26) wherever needed.
The main results on the free endpoint problem are summarized in Theorem 10.13 in [18]. In particular, when , , where is the smallest positive semidefinite solution to the ARE, and the optimal control is . We would like to verify that our Theorem 4.13 recovers these results. We will show that when , given in (39) satisfies . To aid in this endeavor, we invoke a result from [17]. Let .
Theorem 5.3** (Theorem 6.3 [17]).**
*Assume is controllable and . Then the following hold: if , then (i) and (ii) implies . *
Lemma 5.4**.**
*Consider the . Suppose is stabilizable, , and . Then . *
Proof 5.5**.**
We begin by applying Theorem 5.3 to show that is the smallest solution in . To that end, we must show that and . First, since Assumption 2 holds, we can apply Lemma 3.6 to get exists, so . Second, because , we know , so . By Theorem 4.1, . Applying Theorem 4.6(ii) with , we get , for all , so . That is, . By Theorem .1, this implies , so . Now we can apply Theorem 5.3 to get is the smallest solution in .
It remains to show that is the smallest solution in . To arrive at a contradiction, suppose there exists such that and . There are two cases. First, suppose with such that , where is the upper left block of . Since , , so , implying . By Theorem .1, implies , so . Again by Theorem .1, implies . Thus, we have such that , which contradicts that is the smallest solution in .
For the second case, suppose with such that . By (33), has the form
[TABLE]
*Since , we can apply Lemma .2 to find that . Then since , , and , the solutions for and are unique and match and , respectively. Thus , a contradiction. We conclude that is the smallest solution in . This proves that for the free endpoint case when that . Also, Theorem 4.13 (iv) gives the optimal control since . *
Next we consider attainability in the free endpoint case. Since Assumption 2 holds, we can apply Theorem 4.13(iii). In the free endpoint problem, , so by Theorem 4.13(iii), the problem is attainable if and only if . By Proposition 6.4 of [17], the latter condition always holds. Thus, we recover the well-known fact that for the free endpoint case in the positive semidefinite case, the problem is always attainable.
Now we discuss the fixed endpoint problem. The main results are summarized in Theorem 10.18 in [18]. In particular, when , , where is the largest positive semidefinite solution to the ARE, and the optimal control is . We would like to verify that our Theorem 4.13 recovers these results. We must show that when , then . For the fixed endpoint problem, , so . The desired result is then immediately obtained from Theorem 5.1.
Now we consider attainability in the fixed endpoint case. The well-known necessary and sufficient conditions for attainability in the positive semidefinite case, stated in Theorem 10.18(iii) of [18], is that every eigenvalue of on the imaginary axis is observable. We must show that this statement is equivalent to our attainability result in Theorem 4.13(iii), which for the fixed-endpoint case requires that , or equivalently, . This connection is resolved by the following result, whose proof is found in the Appendix.
Theorem 5.6**.**
*Suppose is stabilizable and . Then every eigenvalue of on the imaginary axis is observable if and only if . *
The final verification of our result in the fixed endpoint case is to show that the closed-loop system, , is asymptotically stable, thereby recovering Theorem 10.18(v) in [18]. Note that so that . By Theorem 5 in [19], we have that if and only if . Since by stabilizability and by attainability, we have , as desired.
6 Conclusion
In this paper we address a problem in the area of linear quadratic optimal control which has been open for the last 20 years. Specifically, we consider the regular, infinite-horizon, stability-modulo-a-subspace, indefinite LQ problem when the dynamics are stabilizable. Previous works have also addressed this problem, but under the restrictive assumption that the dynamics are controllable. The generalization from controllable to stabilizable dynamics is significant in that there is a lack of structure in the solutions of the algebraic Riccati equation in the stabilizable case. Consequently the connection between the ARE solution set and the LQ problem under consideration has remained elusive. We resolved this gap by combining a suitable sufficient condition for a finite optimal cost with a specific decomposition to unambiguously deduce the correct form of the optimal cost and control. The determination of necessary and sufficient conditions for a finite value function in the regular, infinite-horizon, stability-modulo-a-subspace, indefinite LQ problem is still open. As future work, we are also interested in applying our result to reachability problems, namely by employing an indefinite cost functional on a stabilizable linear system to characterize the convergence of trajectories to a nontrivial subspace over the infinite time horizon.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Albert. Conditions for Positive and Nonnegative Definiteness in Terms of Pseudoinverses. SIAM J. Applied Mathematics . vol. 17, no. 2, pp. 434-440, 1969.
- 2[2] B. D. O. Anderson and J. B. Moore. Optimal Control: Linear Quadratic Methods . Prentice-Hall International, Inc., 1989.
- 3[3] R. W. Brockett. Finite Dimensional Linear Systems . 1970.
- 4[4] J. Engwerda. The Regular Convex Cooperative Linear Quadratic Control Problem. Automatica . vol. 44, no. 9, pp. 2453-2457, 2008.
- 5[5] F. R. Gantmacher. The Theory of Matrices . vol. 1. Chelsea Publishing, 1959.
- 6[6] T. Geerts. A Necessary and Sufficient Condition for Solvability of the Linear-Quadratic Control Problem without Stability. Systems and Control Letters . vol. 11, no. 1, pp. 47-51, 1988.
- 7[7] T. Geerts. Structure of Linear-Quadratic Control . Ph. D. Thesis, Eindhoven University of Technology, Eindhoven 1989.
- 8[8] T. Geerts. A Priori Results in Linear-Quadratic Optimal Control Theory. Kybernetika . vol. 27, no. 5, pp. 446-457, 1991.
