Expression of a Real Matrix as a Difference of a Matrix and its Transpose Inverse
Mil Mascaras, Jeffrey Uhlmann

TL;DR
This paper presents a novel way to express any real matrix as the difference between a matrix and the transpose of its inverse, potentially aiding control theory and matrix analysis applications.
Contribution
It introduces a new representation of real matrices as a difference involving transpose-inverse terms, addressing existing obstacles in control and computational applications.
Findings
Provides a general formula for the representation
Facilitates analysis in control theory
Potentially simplifies matrix computations
Abstract
In this paper we derive a representation of an arbitrary real matrix M as the difference of a real matrix A and the transpose of its inverse. This expression may prove useful for progressing beyond known results for which the appearance of transpose-inverse terms prove to be obstacles, particularly in control theory and related applications such as computational simulation and analysis of matrix representations of articulated figures.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Expression of a Real Matrix as a Difference
of a Matrix and its Transpose Inverse
Mil Mascaras111Work initiated while honorary visiting professor.
Paper submitted to Journal de Ciencia e Ingenieria on 28 Dec 2018. Accepted with revisions on 2 Apr 2019. To be published: Vol. 11, No. 1, 2019. and Jeffrey Uhlmann
Dept. of Electrical Engineering and Computer Science
University of Missouri-Columbia
Abstract
In this paper we derive a representation of an arbitrary real matrix as the difference of a real matrix and the transpose of its inverse. This expression may prove useful for progressing beyond known results for which the appearance of transpose-inverse terms prove to be obstacles, particularly in control theory and related applications such as computational simulation and analysis of matrix representations of articulated figures.
Keywords: Articulated figure analysis, algebraic Riccati equation, control systems, matrix analysis, matrix decompositions, matrix splitting, octonions, quaternions, real matrices, relative gain array, RGA, singular value decomposition, SVD.
AMS Code: 15A09
1 Introduction
A common way to glean information about a given matrix, e.g., to reveal opportunities for manipulating or transforming it, is to express it as the sum or difference of matrices with particular structure or properties. This may be a simple expression of a singular matrix as a sum of nonsingular ones [7] or a given matrix expressed as the sum of a symmetric matrix and a skew-symmetric matrix [3]. Matrix splitting is an example of a widely-used technique that relies on the expression of a matrix as the difference of two matrices with special properties, e.g., for solving systems of differential equations [16].
Transpose-inverse terms {(\mbox{{\bf A}^{!\mbox{\tiny-1}}})}^{\mbox{\tiny T}} \left(\mbox{or, equivalently,\,}{(\mbox{{\bf A}^{!\mbox{\tiny T}}})}^{\mbox{\tiny-1}}\right), commonly abbreviated as , arise naturally in a variety of control system contexts, e.g., the relative gain array (RGA) [1] and formulations of the controllability Gramian [5]. As an example, classical solution methods for the discrete-time algebraic Riccati equation involve the symplectic form [6]:
[TABLE]
Unfortunately, there are very few matrix factorizations or decompositions involving transpose-inverse terms to aid in the manipulation of equations for theoretical analysis or practical implementation. In this paper we provide an incremental improvement to this state of affairs with a representation of an arbitrary nonsingular real matrix as the difference of a real matrix and its transpose inverse:
[TABLE]
We begin with a derivation of this result in the next section and then develop related results for singular and complex matrices.
2 The Real Nonsingular Case
The main result can be derived from an application of the singular value decomposition (SVD) of the real nonsingular matrix as
[TABLE]
where and are real orthonormal and is a positive diagonal matrix of the singular values of . It can be observed that defining with diagonal as
[TABLE]
satisfies
[TABLE]
only if each diagonal element \mbox{{\bf E}}_{ii} satisfies
[TABLE]
This defines a quadratic constraint on each \mbox{{\bf E}}_{ii} that can be verified to admit solutions
[TABLE]
which can be taken as real for all \mbox{{\bf D}}_{ii} by positivity of the singular values of . Furthermore, the nonnegative solution can be taken for each \mbox{{\bf E}}_{ii} so that is determined completely up to the uniqueness (or lack thereof) provided by the SVD evaluation method.
The case of singular can be handled in two ways. As it stands, the previous construction can be restricted to only the nonzero singular values to give
[TABLE]
where now has the same rank as and its inverse is replaced with a pseudoinverse. Alternatively, the zero singular values of above can be replaced with unity222This works because the unit singular values will cancel in the difference with their corresponding ones in the transpose inverse, and in fact is the positive solution to Eq. (11) for \mbox{{\bf D}}_{ii}=0. so that singular is expressed as the difference of nonsingular matrices in the form of Eq. (2). In summary, the expression of Eq. (2) can be obtained for all real square matrices whereas that of Eq. (12) extends to real of any shape.
3 Variations and Generalizations
The results from the previous section can be verified to generalize directly to the case of complex if conjugate-transpose is used in place of the transpose operator
[TABLE]
However, variants in which the matrix difference of Eq. (2) is replaced with a sum, or the transpose operator is maintained for complex , cannot generally be obtained in a similar form.
The challenge posed by the form
[TABLE]
is that the resulting analog of Eq. (11) becomes
[TABLE]
which admits real solutions only for \mbox{{\bf D}}_{ii}\geq 2. In other words, the form of Eq. (14) can only be obtained with real if the smallest singular value of satisfies \mbox{\sigma_{\mbox{\tiny min}}}\geq 2. On the other hand, from this we can conclude that the form of Eq. (14) can always be obtained for \frac{2}{\mbox{\sigma_{\mbox{\tiny min}}}}\mbox{{\bf M}}, i.e., when real nonsingular is scaled to ensure its smallest nonzero singular value is 2. This yields the slightly less pleasing form
[TABLE]
which can be obtained for all real nonsingular by letting c=2/\mbox{\sigma_{\mbox{\tiny min}}}.
For completeness we note that Eq. (11) & Eq. (15) can be applied with matrix arguments in place of diagonal elements to obtain, respectively, the non-transpose forms:
[TABLE]
and
[TABLE]
which are not relevant to the focus of this paper, e.g., because they are not applicable to rectangular matrices, but may be of independent interest.
4 Application to Articulated Figure Analysis
For purposes of kinematic simulation and analysis of articulated figures, e.g., for motion prediction and/or animation, there are multiple mathematical representations. Most commonly, a human figure is represented with segments and joints (see Fig. 1) with the mobility constraints expressed using Euler angles, quaternions (or double quaternions), and/or exponential maps [15, 11, 2].
More generally, the structure of the figure may be represented simply as a matrix. For example, compositions of rigid rotations involving a heirarchy of coordinate axes can be represented in the form of a real orthonormal rotation matrix, , such that the transformation of a given matrix representing an articulated figure can be expressed as {\bf R}$${\bf M}$${\bf R}^{\mbox{\tiny T}}. The price paid for the generality of representing figures and shapes using matrices is the challenge of how to manipulate and analyze such expressions, e.g., for operations such as graph matching or simply to gain enhanced intuitive insights. As indicated in [9]: “it is not yet clear how to choose and characterize the group of transformations under which such shapes should be studied.”
A key property that clearly must be maintained is consistency with respect to real orthonormal transformations. It can easily be verified that this property is maintained by the transformation f(\mbox{{\bf M}})\rightarrow\mbox{{\bf A}}, \mbox{{\bf M}}=\mbox{{\bf A}}-\mbox{{\bf A}^{!\mbox{\tiny-T}}}, by virtue of its derivation via the SVD, or explicitly as:
[TABLE]
where the fact that {\mbox{{\bf R}}}^{\mbox{\tiny-T}}=\mbox{{\bf R}} for real orthonormal has been exploited. This therefore demonstrates the desired consistency property:
[TABLE]
In the previous section we discussed the generalization from transpose to conjugate-transpose, which can support consistency with respect to unitary transformations, and it is natural to consider a further generalization to the nonassociative octonions. This would permit forces to be more flexibly incorporated [17], and it would permit temporal sequences of non-compositional operators to be expressed uniquely based on a specified associativity rule [8]. More specifically, a temporal sequence of non-associative operators applied at times could be derived as a solution to a given problem and expressed in directional time-assymetric form as
[TABLE]
Unfortunately, the proposed matrix decomposition does not appear to practically accommodate unitary transformations over the octonions because there presently does not exist an efficiently computable octonion analog of the SVD333However, the recent approach of [10] is of potential relevance in this regard..
5 Discussion
The main result of this paper is that every real matrix can be represented as the difference of two real matrices in the form
[TABLE]
A measure of the potential utility of a given factorization or decomposition is the extent to which it can be used to obtain nontrivial derivative results. Eq. (24) admits a variety of trivial ones, e.g., multiplication of by or gives a symmetric difference involving a positive semidefinite matrix and the identity matrix. However, additional structural properties can be derived based on known results involving transpose inverses, e.g., the relative gain array (RGA) mentioned in the introduction. The RGA of a square nonsingular matrix is defined [1] as
[TABLE]
where represents the elementwise Hadamard matrix product. Because of its practical importance in control applications the RGA has been shown [4] to have a variety of interesting mathematical properties, and those properties therefore carry over to the Hadamard product of the matrix terms of Eq. (24). For example, it is known that \mbox{{\mathcal{R}GA}}(\mbox{{\bf G}}) is invariant with respect to diagonal scalings of and that its rows and columns have unit sum, i.e., \mbox{{\mathcal{R}GA}}(\mbox{{\bf G}}) is generalized doubly stochastic. This implies that by letting \mbox{{\bf B}}=\mbox{{\bf A}^{!\mbox{\tiny-T}}} in Eq. (24) one can obtain an expression with significantly less explicit structural information
[TABLE]
which would still be of interest based solely on the property that \mbox{{\bf A}}\circ\mbox{{\bf B}} is generalized doubly stochastic444It has recently been shown that a new generalized inverse [12] can be used to preserve some RGA properties in the case of singular [13, 14], thus conferring related properties to the expression of Eq. (12).. In summary, the main result of this paper is a specialized form of matrix splitting that offers a diverse set of interesting mathematical properties that derive from and thus may prove useful to control theory and related applications.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. Bristol, “On a new measure of interaction for multivariable process control,” IEEE Transactions on Automatic Control , vol. 11, no. 1, pp. 133-134, 1966.
- 2[2] F.S. Grassia, “Practical Parameterization of Rotations Using the Exponential Map,” The Journal of Graphics Tools , volume 3.3, 1998.
- 3[3] R.A. Horn and C.R. Johnson, Matrix Analysis , Cambridge University Press, 1999.
- 4[4] C.R. Johnson and H.M. Shapiro, “Mathematical Aspects of the Relative Gain Array ( A ∘ A − T ) 𝐴 superscript 𝐴 𝑇 (A\circ A^{-T}) ,” SIAM. J. on Algebraic and Discrete Methods , 7(4), 627-644, 1985.
- 5[5] I. Klickstein, A. Shirin, and F. Sorrentino, “Energy scaling of targeted optimal control of complex networks,” Nature Communications , 8:15145, 2017.
- 6[6] A.J. Laub, “A Schur Method for Solving Algebraic Riccati Equations,” MIT Technical Report , LIDS-R-859, 1978.
- 7[7] N.L. Lord, “Matrices as Sums of Invertible Matrices,” Mathematics Magazine , February, 1987.
- 8[8] M. Mascaras and J. Uhlmann, Unpublished open-forum discussion during Lucharama-06 (filming) , 2006.
