On a frame theoretic measure of quality of LTI systems
Mohammed Rayyan Sheriff, Debasish Chatterjee

TL;DR
This paper introduces a novel measure of control system quality based on frame theory, showing that optimal controllability corresponds to tight frames in the controllability matrix.
Contribution
It links classical controllability measures to frame theory, proposing a new quality measure for LTI systems rooted in tight frame properties.
Findings
Optimal control measures occur when the controllability matrix columns form a tight frame.
The proposed measure aligns with classical measures like trace, eigenvalues, and determinant.
Provides a new perspective connecting control theory and frame theory.
Abstract
It is of practical significance to define the notion of a measure of quality of a control system, i.e., a quantitative extension of the classical notion of controllability. In this article we demonstrate that the three standard measures of quality involving the trace, minimum eigenvalue, and the determinant of the controllability grammian achieve their optimum values when the columns of the controllability matrix from a tight frame. Motivated by this, and in view of some recent developments in frame theoretic signal processing, we provide a measure of quality for LTI systems based on a measure of tightness of the columns of the reachability matrix .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On a frame theoretic measure of quality of LTI systems
Mohammed Rayyan Sheriff, Debasish Chatterjee D Chatterjee was supported in part by the grant 12IRCCSG005 from IRCC, IIT Bombay.M Rayyan and Debasish Chatterjee are with the interdisciplinary program in Systems and Control Engineering, IIT Bombay, India. Emails: [email protected], [email protected]
Abstract
It is of practical significance to define the notion of a measure of quality of a control system, i.e., a quantitative extension of the classical notion of controllability. In this article we demonstrate that the three standard measures of quality involving the trace, minimum eigenvalue, and the determinant of the controllability grammian achieve their optimum values when the columns of the controllability matrix from a tight frame. Motivated by this, and in view of some recent developments in frame theoretic signal processing, we provide a measure of quality for LTI systems based on a measure of tightness of the columns of the reachability matrix .
I Introduction
Let us consider a linear time invariant controlled system
[TABLE]
where and , with and some positive integers 111The set of real symmetric positive definite and positive semi definite matrices are denoted by and respectively, is the column space of the matrix , is the trace of the matrix , is the transpose of the vector , is the -norm of the vector .. For , let . Then the reachable space and the reachability index of the LTI system (1) are defined by
[TABLE]
We say that the control system (1) is controllable in the classical sense if it admits a control sequence \big{(}u(0),\ldots,u(T-1)\big{)} that can transfer the states of the system (1) from to for preassigned values of whenever . It is common knowledge that the LTI system (1) is controllable if and only if the matrix is of rank .
However, in most of the practical scenarios, information about the system behavior provided by the classical notion of controllability is limited. In particular, no assessment of “how controllable” is a given LTI system is provided by the classical ideas. Naturally, together with knowing whether a control system is controllable or not, one would also like to know how controllable or how good is the control system. In other words, there is a clear need to quantify the notion of controllability.
For instance, let us consider two second order LTI systems described by the pairs and such that for the columns of the matrices , are as shown in the following figure.
We see that for the system , since the vectors form small angles with each other, intuitively, the transfer of the system state to final states that is in approximately orthogonal directions to these vectors will demand control with higher magnitudes. In contrast, for the system the vectors are spread out in space and will in general require lower control magnitude. Such a collection of vectors are called as tight frames. It turns out that tight frames have many practical advantages over generic collection of vectors [1], [2], [3], [4], [5] from signal processing perspective. In this article we shall establish that if the columns of constitute a tight frame, one also gets several control theoretic advantages.
In particular, we provide a notion of quantitative controllability, which we shall refer to as the Measure Of Quality (MOQ) of a control system. Intuitively speaking, any measure of quality should relate to some important characteristics of the system like average control energy / control effort required, robustness to input / noise, ability to control the systems with sparse controls etc We shall see below that such intuitive ideas are indeed justified.
Let us define the control effort of a control sequence \big{(}u(0),\ldots,u(T-1)\big{)}, defined by
[TABLE]
is a quantity of practical significance. For example, the value of the control effort in an electronic circuit system would involve information of the amount of power drawn to control the system. One of the primary objectives in controlling a dynamical system efficiently could be to minimize the required control effort. Therefore, let us consider the following optimal control problem:
[TABLE]
Let the grammian be defined by
[TABLE]
We know that the optimal control problem (2) admits an unique optimal control sequence \big{(}u^{\ast}(0),\ldots,u^{\ast}(T-1)\big{)} with the optimum control effort given by:
[TABLE]
We briefly review some of the most well known MOQ’s studied in the literature [6], [7], [8]; they capture someinformation pertaining to optimum control effort needed to control the system:
- (i)
\operatorname{tr}\big{(}G_{(A,B,T)}^{-1}\big{)}: This is proportional to the average optimal control effort needed to transfer the system state from origin (i.e., ,) to a random point that is uniformly distributed on the unit sphere. 2. (ii)
\lambda_{\min}^{-1}\big{(}G_{(A,B,T)}\big{)}: This gives the maximum control effort needed to transfer the system state from origin to any point on the unit sphere. 3. (iii)
\det\big{(}G_{(A,B,T)}\big{)}: This proportional to the volume of the ellipsoid containing points to which the system state can be transferred to from the origin using at most unit control effort.
A finite value of the MOQ in (i) and (ii), and a nonzero value in (iii), implies that the system (1) is controllable in the classical sense. In addition, since these quantities also contain some information about the optimal control effort, they can be considered as valid candidates for an MOQ. However, these MOQ’s are increasingly difficult to compute as the size of the system becomes larger, and since they arise primarily in the context of the optimal control problem (2), they do not provide meaningful insights about properties such as immunity to noise, robustness, ability to generate sparse controls etc.
For the dynamical system (1), we shall motivate the notion of an MOQ from the perspective of the orientation of the columns of the matrix . We shall prove in this article that the three MOQ’s discussed above achieve their optimum value when the columns of the matrix constitute a tight frame. We provide an MOQ for the system (1) as a measure of the tightness of the columns of , and provide a sufficient condition for the classical controllability of the system (1) in terms of the proposed MOQ.
II Review of tight frames
Definition 1**.**
For an dimensional Hilbert space with an inner product , a sequence of vectors is said to be a frame of if it spans ; a frame is said to be tight, if there exists a number such that for every , the equality holds.
A classical example of a tight frame is an orthonormal basis of that satisfies the definition with and . Some other examples of the tight frames of are shown in figure below.
The collection of vectors pointing towards the corners of platonic solids also constitute tight frames; several other examples may be found in [9].
The study of frames was started by Duffin and Schaeffer [10] and improvised greatly by work of Daubechies et al. in [11]. Recent work on characterization of tight frames as minimizers of a certain potential function was done in [9] and [12]. The growing interest in developing the theory of tight frames is mainly because, tight frames possess several desirable properties. For example, the representation of signals using tight frames exhibits better resilience to noise and quantization [2], [3]. Tight frames are also considered to be good for representing signals sparsely, and are the -optimal dictionaries for representing vectors that are uniformly distributed over spheres [1, Proposition 2.13, p. 10]. These are just a few among the plethora of useful properties of tight frames that make them relevant and important for many applications.
We now formally discuss the concept of tight frames, their properties, and their characterization.
Definition 2**.**
The fame operator of a sequence of vectors is defined by
[TABLE]
The following result provides the necessary and sufficient conditions for tightness of a frame.
Theorem 3**.**
[12*, Proposition 1, p. 2, ]** [9, Theorem 2.1, p. 4]
A sequence of vectors is a tight frame to with some constant , if and only if*
- •
The lengths of the frame vectors satisfy
[TABLE]
- •
The frame operator satisfies , where is the identity operator on .
II-A Characterization of Tight Frames
We observe that the definition of a tight frame does not give a direct method to compute them. One of the ways to compute tight frames is by minimizing a certain potential function that is motivated from physical examples briefly outlined in [9], [12]. The following definitions are relevant for this article:
Definition 4**.**
The frame force experienced by a point due to another point is defined by
[TABLE]
Definition 5**.**
The frame potential function of a sequence of vectors is defined by
[TABLE]
Observe that the frame force between any two points in pushes them towards orthogonality. In this sense we can say that a tight frame is the “most orthogonal” collection of vectors in ; alternatively, a tight frame is a collection of vectors that are maximally spread out in the space. It was established in [12], [9] that finite tight frames for are in an equilibrium configuration under the frame force, and hence are the local minimizers of the associated frame potential. Thus, tight frames are characterized as the minimizers of the frame potential, and can be computed by variational techniques. This particular fact is the assertion of the following theorem.
Theorem 6**.**
[12*, Proposition 4, p. 8]**
For any sequence of vectors , the following holds*
[TABLE]
The preceding inequality is satisfied with equality if and only if is a tight frame of .
From Theorem 6 we conclude that tight frames are not only local, but are global minimizers of the frame potential, subject to the constraint that the length of each vector in the sequence is fixed. Thus, we can say that the frame potential of a given sequence gives us a measure of its tightness. However, the lower bound of the frame potential given by Theorem 6 depends on the sequence itself. This restricts the use of the frame potential of a sequence as a measure of its tightness when comparing two generic frames. For example, let us consider the frames F_{1}\coloneqq\big{(}(1,0)^{\top},(1,1)^{\top}\big{)} and F_{2}\coloneqq\big{(}(10,0)^{\top},(-5,-5\sqrt{3})^{\top},(-5,5\sqrt{3})^{\top}\big{)} of . From Definition 5, the frame potential of the frames and are and respectively. Even though , from the second assertion of Theorem 3 we see that the frame is not tight, where as is a tight frame. To address this issue, we normalize the frame potential by appropriately scaling the vectors so that the frame potential calculated this way is a constant for every tight frame.
Let be any sequence of vectors, we define the Normalized Frame Potential ( ) of the sequence using its frame operator as
[TABLE]
We get the following analogue of Theorem 6.
Proposition 7**.**
For any sequence of vectors , we have
[TABLE]
The preceding inequality is satisfied with the equality if and only if is a tight frame of .
Proof.
From [13, Lemma 1, p. 7]
[TABLE]
Then
[TABLE]
and the assertion follows immediately from Theorem 6. ∎
The of the frames and are and respectively, which clearly indicates that the frame is tighter than the frame . Our (4) allows us to compare the tightness of any two arbitrary sequences of vectors in and therefore can be regarded as a valid measure of tightness of the given sequence of vectors.
III Main Results
We would like to find the optimal orientations of the columns of the matrix , that optimize the three measure of qualities discussed in Section I, namely, \operatorname{tr}\big{(}G_{(A,B,T)}^{-1}\big{)}, and . Let us consider a generic sequence of vectors. Then the three MOQ’s stated above evaluated for the generic sequence are given in terms of the corresponding frame operator by \operatorname{tr}\big{(}G_{(v_{1},\ldots,v_{K})}^{-1}\big{)}, \lambda_{\min}^{-1}\big{(}G_{(v_{1},\ldots,v_{K})}\big{)} and \det\big{(}G_{(v_{1},\ldots,v_{K})}\big{)} respectively. Let \big{(}\alpha_{1},\ldots,\alpha_{K}\big{)} be a non increasing sequence of positive real numbers such that .
In order to find the optimal orientation of the vectors, we optimize the three objective functions \operatorname{tr}\big{(}G_{(v_{1},\ldots,v_{K})}^{-1}\big{)}, \lambda_{\min}^{-1}\big{(}G_{(v_{1},\ldots,v_{K})}\big{)} and \det\big{(}G_{(v_{1},\ldots,v_{K})}\big{)}, subject to the constraint that the lengths of the vectors are fixed, i.e., for all . Thus, we have the following three optimization problems:
[TABLE]
[TABLE]
[TABLE]
Note that the optimization problems (5), (6) and (7) are optimization problems over sequences of vectors. The next Theorem characterizes and provides solutions of (5), (6) and (7); in its proof we get equivalent optimization problems with non negative definite matrices as variables.
Theorem 8**.**
A sequence of vectors , that is feasible for the optimization problems (5), (6) and (7) is an optimal solution if and only if it is a tight frame of .
Proof.
For the non increasing finite sequences , and of positive real numbers we define the relation if the following two conditions hold:
[TABLE]
The conditions of (8) are analogue of the standard majorization conditions [14, Chapter 1].
For a symmetric non negative definite matrix , we define \lambda(G)\coloneqq\big{(}\lambda_{1}(G),\ldots,\lambda_{n}(G)\big{)} to be the non increasing sequence of the eigenvalues of the matrix . Let \textbf{a}\coloneqq\big{(}\frac{1}{n}\sum_{i=1}^{K}\alpha_{i}\big{)}.
We will use the following result related to the decomposition of non-negative definite matrices.
Lemma 9**.**
[15*, Theorem 4.6]** [13, Theorem 2.8, p. 4]
For any given sequence of positive real numbers and a non negative definite matrix with , the following statements are equivalent:*
- •
There exists a sequence of vectors such that and for all .
- •
.
On the one hand, for any sequence of vectors that are feasible for the optimization problems (5), (6) and (7), from Lemma 9 it follows that and . On the other hand, for any symmetric positive definite matrix such that , we conclude again from Lemma 9 that there exists a sequence of vectors such that , , and for all . Therefore, under the mapping
[TABLE]
the optimization problems:
[TABLE]
[TABLE]
[TABLE]
are equivalent to (5), (6) and (7) respectively.
Claim 10**.**
* is the unique (corresponding to a given ) optimal solution of the optimization problem (10).*
Proof.
Let us consider the following optimization problem:
[TABLE]
The optimization problem (13) is the same as (22) in [1, p. 17] with , whose solution is given in (25) of the same article. Therefore, from [1] we conclude that is the unique optimal solution to the problem (13).
It is easy to see that for a symmetric non negative definite matrix , the condition that is sufficient for the equality to hold. Therefore, the optimum value in the optimization problem (13) is a lower bound of the optimal value (if it exists) of the problem (10).
However, it can be easily verified that ; thus, is feasible for the optimization problem (10), and the objective function evaluated at is equal to , which is a lower bound of the optimal value of (10). Therefore, the optimization problem (10) admits an optimal solution. Since is the unique solution to (13), we conclude that is the unique optimal solution to (10) as well. ∎
Claim 11**.**
* is the unique (corresponding to a given ) optimal solution of the optimization problem (11).*
Proof.
The objective function and the constraint in problem (11) can be explicitly characterized in terms of the eigenvalues of the variable matrix . Then under the map
[TABLE]
we get the following optimization problem equivalent to (11).
[TABLE]
For every sequence that is feasible for the optimization problem (15), we see that . Therefore,
[TABLE]
Therefore, the value is a lower bound of the optimal value (if it exists) of the optimization problem (15). Let us define for all . It can be easily verified that
[TABLE]
From (16) it follows that the sequence is feasible for the optimization problem (15), and that the objective function evaluated at is equal to the lower bound . Therefore, we conclude that the optimization problem (15) admits an optimal solution, and that the sequence is an optimal solution. It should also be noted that is the unique sequence of positive real numbers that satisfies (16). Therefore, the sequence is the unique optimal solution of the problem (15).
Due to the equivalence of the optimization problems (11) and (15), we conclude that the optimization problem (11) also admits an optimal solution. A non negative definite symmetric matrix is an optimal solution of the problem (11) if and only if it satisfies
[TABLE]
and we know for a fact that is the only matrix that satisfies (17) that is also feasible for the problem (11). Thus, is the unique optimal solution of the problem (11). ∎
Claim 12**.**
* is the unique (corresponding to a given ) optimal solution to the optimization problem (12).*
Proof.
Writing the objective function and the constraints of the optimization problem (12) in terms of the eigenvalues of the matrix , we get the following optimization problem equivalent to (12).
[TABLE]
For every sequence that is feasible for the problem (18), it follows that
[TABLE]
Therefore, the value is an upper bound to the optimal value (if it exists) of the optimization problem (18). Let us define for all , it follows that
[TABLE]
It should also be noted that is the unique sequence of positive real numbers that satisfies (20). Therefore, the optimization problem (18) admits an unique optimal solution and the sequence is the unique optimizer.
Equivalently, as we have seen in the proof of Claim 11, is the unique optimal solution to the optimization problem (12). ∎
Due to the equivalence of the optimization problem pairs (5)-(10), (6)-(11) and (7)-(12), we conclude from the Claims 10, 11, and 12 that a sequence of vectors is an optimal solution if and only if its frame operator satisfies , which in turn is true if and only if is a tight frame (from Theorem 3). ∎
In addition to the fact that tight frames are optimal solutions of the optimization problems (5), (6) and (7), we have previously listed some desirable properties of tight frames for representation of generic vectors in Section II. Most of the reachability(both ballistic and servomechanism [16, p. 74-75]) control problems arising in practice, involve solving the following linear equation for various values of and :
[TABLE]
either directly or indirectly. This is equivalent to expressing the vector as a linear combination of the columns of . We would like to have the the columns of to be oriented as tightly as possible in order to inherit those advantages. Thus, a measure of quality of the LTI system (1) should be the measure of tightness of the columns of the matrix ; this is a key proposal of this article.
Definition 13**.**
We define a measure of quality for the LTI system (1) as a measure of the tightness of the columns of ; defined by
[TABLE]
From Proposition 7, for any LTI system given by the pair and , we have
[TABLE]
and equality holds only when the columns of matrix constitute a tight frame for . A direct application of the Proposition 7 gives us the following sufficient condition for the controllability of the LTI system in terms of the MOQ .
Proposition 14**.**
For the LTI system (1), if for some , then the LTI system is controllable in the classical sense.
Proof.
Suppose not, i.e., the system is not controllable but satisfies for some . Let be the column of the matrix for . Then is a dimensional subspace of , where . In particular, each belongs to this subspace, which is a Hilbert space in its own right. From Proposition 7 we conclude that
[TABLE]
which is a contradiction. The assertion follows. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. R. Sheriff and D. Chatterjee, “Optimal dictionary for least squares representation,” ar Xiv preprint ar Xiv:1603.02074 , 2016.
- 2[2] L. Shen, M. Papadakis, I. A. Kakadiaris, I. Konstantinidis, D. Kouri, and D. Hoffman, “Image denoising using a tight frame,” IEEE transactions on image processing , vol. 15, no. 5, pp. 1254–1263, 2006.
- 3[3] W. Zhou, S. Yang, C. Zhang, and S. Fu, “Adaptive tight frame based multiplicative noise removal,” Springer Plus , vol. 5, no. 1, p. 122, 2016.
- 4[4] T. Strohmer and R. W. Heath, “Grassmannian frames with applications to coding and communication,” Applied and computational harmonic analysis , vol. 14, no. 3, pp. 257–275, 2003.
- 5[5] R. B. Holmes and V. I. Paulsen, “Optimal frames for erasures,” Linear Algebra and its Applications , vol. 377, pp. 31–51, 2004.
- 6[6] P. Müller and H. Weber, “Analysis and optimization of certain qualities of controllability and observability for linear dynamical systems,” Automatica , vol. 8, no. 3, pp. 237–246, 1972.
- 7[7] C. Johnson, “Optimization of a certain quality of complete controllability and 0bservability for linear dynamical systems’,” 1969.
- 8[8] F. Pasqualetti, S. Zampieri, and F. Bullo, “Controllability metrics, limitations and algorithms for complex networks,” IEEE Transactions on Control of Network Systems , vol. 1, no. 1, pp. 40–52, 2014.
