Principal bundle structure of matrix manifolds
M. Billaud-Friess ,
A. Falcó11footnotemark: 1 , A. Nouy11footnotemark: 1
Department of Computer Science and Mathematics,
GeM, Ecole Centrale de Nantes,
1 rue de la Noë, BP 92101, 44321 Nantes Cedex 3, France. Email: [marie.billaud-friess,anthony.nouy]@ec-nantes.fr.Departamento de Matemáticas, Física y Ciencias Tecnológicas, Universidad CEU Cardenal Herrera, San Bartolomé 55, 46115 Alfara del Patriarca (Valencia), Spain. E-mail: [email protected].
Abstract
In this paper, we introduce a new geometric description of the manifolds of matrices of fixed rank.
The starting point is a geometric description of the Grassmann manifold Gr(Rk) of linear subspaces of dimension r<k in Rk which avoids the use of equivalence classes. The set Gr(Rk) is equipped with an atlas which provides it with the structure of an analytic manifold modelled on R(k−r)×r.
Then we define an atlas for the set Mr(Rk×r) of full rank matrices and prove that the resulting manifold is an analytic principal bundle with base Gr(Rk) and typical fibre GLr, the general linear group of invertible matrices in Rk×k. Finally, we define an atlas for the set Mr(Rn×m) of non-full rank matrices and prove that the resulting
manifold is an analytic principal bundle with base Gr(Rn)×Gr(Rm) and typical fibre GLr.
The atlas of Mr(Rn×m) is indexed on the manifold itself, which allows a natural definition of a neighbourhood for a given matrix, this neighbourhood being proved to possess the structure of a Lie group.
Moreover, the set Mr(Rn×m) equipped with the topology induced by the atlas is
proven to be an embedded submanifold of the matrix space Rn×m
equipped with the subspace topology. The proposed geometric description then results in a description of the matrix space Rn×m, seen as the union of manifolds Mr(Rn×m), as an analytic manifold equipped with a topology for which the matrix rank is a continuous map.
Keywords: Matrix manifolds, Low-rank matrices, Grassmann manifold, Principal bundles.
1 Introduction
Low-rank matrices appear in many applications involving high-dimensional data. Low-rank models are commonly used in statistics, machine learning or data analysis (see [18] for a recent survey). Also, low-rank approximation of matrices is the cornerstone of many modern numerical methods for high-dimensional problems in computational science, such as model order reduction methods for dynamical systems, or parameter-dependent or stochastic equations [4, 5, 14, 6].
These applications yield problems of approximation or optimization in the sets of matrices with fixed rank
[TABLE]
A usual geometric approach is to endow the set
Mr(Rn×m) with the structure of a Riemannian manifold [16, 3], which is seen as an embedded submanifold of
Rn×m equipped with the topology τRn×m given by matrix norms. Standard algorithms then work in the ambient matrix space Rn×m and do not rely on an explicit geometric description of the manifold using local charts (see, e.g., [17, 12, 13, 8]). However, the matrix rank considered as a map is not continuous for the topology τRn×m, which can yield undesirable numerical issues.
The purpose of this paper is to propose a new geometric description of
the sets of matrices with fixed rank which is amenable for numerical use, and which relies on the natural
parametrization of matrices in Mr(Rn×m) given by
[TABLE]
where U∈Rn×r and V∈Rm×r are matrices with full rank r<min{n,m}, and
G∈Rr×r is a non singular matrix.
The set Mr(Rn×m) is here endowed with the structure of analytic principal bundle, with an explicit description of local charts. This results in a description of the matrix space Rn×m as an analytic manifold with a topology induced by local charts which is different from τRn×m and for which the rank is a continuous map. Note that the representation (1) of a matrix Z is not unique because
Z=(UP)(P−1GPT)(VP−1)T
holds for every invertible matrix P in Rr×r. An argument used to dodge this undesirable property is the possibility to uniquely define a tangent space (see for example Section 2.1 in [8]), which is a prerequisite for standard algorithms on differentiable manifolds. The geometric description proposed in this paper avoids this undesirable property. Indeed, the system of local charts for the set Mr(Rn×m) is indexed on the set itself. This allows a natural definition of a neighbourhood for a matrix where all matrices
admit a unique representation.
The present work opens the route for new numerical methods for optimization or dynamical low-rank approximation, with algorithms working in local coordinates and avoiding the use of a Riemannian structure, such as in
[10], where a framework is introduced
for generalising iterative methods from Euclidean space to manifolds which ensures that local convergence rates are preserved.
The introduction of a principal bundle representation of matrix manifolds
is also motivated by the importance of this geometric structure in the concept of gauge potential in physics [11].
We would point out that the
proposed geometric description has a natural extension to the case of fixed-rank operators on infinite dimensional spaces
and is consistent with the geometric description of manifolds of tensors with fixed rank
proposed by Falcó, Hackbush and Nouy [7], in a tensor
Banach space framework.
Before introducing the main results and outline of the paper, we recall some elements of geometry.
1.1 Elements of geometry
In this paper, we follow the approach of Serge Lang [9] for the definition of a manifold M.
In this framework, a set M is equipped with an atlas
which gives M the structure of a topological space, with a topology induced by local charts, and the structure of differentiable manifold compatible with this topology.
More precisely, the starting point is the definition of
a collection of non-empty subsets Uα⊂M, with α in a set A, such that {Uα}α∈A is a covering of M.
The next step is the explicit construction for any α∈A of a local chart φα which is
a bijection from Uα to an open set Xα of the finite dimensional space RNα such that
for any pair α,α′∈M such that Uα∩Uα′=∅, the following properties hold:
- (i)
φα(Uα∩Uα′) and φα′(Uα∩Uα′) are open sets in Xα and Xα′ respectively, and
2. (ii)
the map
[TABLE]
is a Cp differentiable diffeomorphism, with p∈N∪{∞} or p=ω when the map is analytic.
Under the above assumptions, the set A:={(Uα,φα):α∈A}
is an atlas which endows M with a structure of Cp manifold. Then
we say that (M,A) is a Cp manifold, or an analytic manifold when p=ω. A consequence of the condition (ii) is that when Uα∩Uα′=∅ holds for α,α′∈A, then
Nα=Nα′.
In the particular case where Nα=N for all α∈A,
we say that (M,A) is a Cp manifold
modelled on RN. Otherwise, we say that it is a manifold
not modelled on a particular finite-dimensional space. A paradigmatic example is the Grassmann manifold G(Rk) of all
linear subspaces of Rk, such that
[TABLE]
where G0(Rk)={0} and Gk(Rk)={Rk} are trivial manifolds
and Gr(Rk) is a manifold modelled on the linear space R(k−r)×r
for 0<r<k. In consequence, G(Rk) is a manifold not modelled on a
particular finite-dimensional space.
The atlas also endows M with a topology given by
[TABLE]
which makes (M,τA) a topological space where
each local chart
[TABLE]
considered as a map between topological spaces,
is a homeomorphism.111Here (X,τ) denotes a topological space and if X′⊂X, then
τ∣X′ denotes the subspace topology.
1.2 Main results and outline
Our first remark is that the matrix space Rn×m is an analytic manifold modelled on itself and its geometric structure is fully compatible with the topology τRn×m induced by a matrix norm. In this paper, we define an atlas on
Mr(Rn×m) which gives this set the structure of an
analytic manifold, with a topology induced by the atlas fully
compatible with the subspace topology
τRn×m∣Mr(Rn×m). This implies that
Mr(Rn×m) is an embedded submanifold of the matrix manifold Rn×m modelled on itself222Note that the set M0(Rn×m)={0}
is a trivial manifold, which is trivially embedded in Rn×m..
For the topology τRn×m, the matrix rank considered as
a map is not continuous but only lower semi-continuous.
However, if Rn×m is seen as the disjoint union of sets of matrices with fixed rank,
[TABLE]
then Rn×m has the structure of an analytic manifold not modelled
on a particular finite-dimensional space equipped with a topology
[TABLE]
which is not equivalent to τRn×m, and
for which the matrix rank is a continuous map.
Note that in the case when r=n=m, the set Mn(Rn×n) coincides with the general linear group GLn of invertible matrices in
Rn×n, which is an analytic manifold trivially embedded in Rn×n. In all other cases, which are addressed in this paper,
our geometric description of Mr(Rn×m) relies on a geometric description of the Grassmann manifold Gr(Rk), with k=n or m.
Therefore, we start in Section 2 by introducing a geometric description of Gr(Rk).
A classical approach consists of describing Gr(Rk) as the quotient manifold Mr(Rk×r)/GLr of equivalent classes of full-rank matrices Z in Mr(Rk×r) having the same column space colk,r(Z). Here, we avoid the use of equivalent classes and provide
an explicit description of an atlas Ak,r={(UZ,φZ)}Z∈Mr(Rk×r) for Gr(Rk), with local chart
[TABLE]
where Z⊥∈Rk×(k−r) is such that Z⊥TZ=0 and
colk,r(A) denotes the column space of a matrix A∈Rk×r, and we prove that the neighbourhood UZ have the structure of a Lie group. This parametrization of the Grassmann manifold is introduced in [2, Section 2] but the authors do not elaborate on it.
Then in Section 3, we consider the particular case of full-rank matrices. We introduce an atlas Bk,r={(VZ,ξZ)}Z∈Mr(Rk×r)
for the manifold Mr(Rk×r) of matrices with full rank r<k, with local chart
[TABLE]
and prove that Mr(Rk×r) is an analytic principal bundle with base Gr(Rk) and typical fibre GLr. Moreover, we prove that Mr(Rk×r) is an embedded submanifold of (Rk×r,τRk×r∗), and that each of the neighbourhoods VZ have the structure of a Lie group.
Finally, in Section 4, we provide an analytic atlas Bn,m,r={(UZ,θZ)}Z∈Mr(Rn×m) for the set Mr(Rn×m)
of matrices Z=UGVT with rank r<min{n,m}, with local chart
[TABLE]
and we prove that Mr(Rn×m) is an analytic principal bundle with base Gr(Rn)×Gr(Rm) and typical fibre GLr. Then we prove that Mr(Rn×m) is an embedded submanifold of (Rn×m,τRn×m∗), and that each of the neighbourhoods UZ have the structure of a Lie group.
2 The Grassmann manifold Gr(Rk)
In this section, we present a geometric description of the Grassmann manifold Gr(Rk) of all subspaces of dimension r in Rk, 0<r<k,
[TABLE]
with an explicit description of local charts.
We first introduce the surjective map
[TABLE]
where colk,r(Z) is the column space of the matrix Z, which is the subspace spanned by the column vectors of Z.
Given V∈Gr(Rk), there are infinitely many matrices Z such that
colk,r(Z)=V.
Given a matrix Z∈Mr(Rk×r), the set of matrices in Mr(Rk×r) having the same column space as Z is
[TABLE]
2.1 An atlas for Gr(Rk)
For a given matrix Z in Mr(Rk×r), we let Z⊥∈Mk−r(Rk×(k−r)) be a matrix
such that ZTZ⊥=0 and we introduce an affine cross section
[TABLE]
which has the following equivalent characterization.
Lemma 2.1**.**
The affine cross section SZ is characterized by
[TABLE]
and the map
[TABLE]
is bijective.
Proof.
We first observe that
ZT(Z+Z⊥XZ)=ZTZ for all X∈R(k−r)×r,
which implies that {Z+Z⊥X:X∈R(k−r)×r}⊂SZ. For the other inclusion, we observe that if W∈SZ, then ZTW=ZTZ and hence W−Z∈colk,r(Z)⊥, the orthogonal subspace to colk,r(Z) in Rk.
Since colk,r(Z)⊥=colk,k−r(Z⊥), there exists
X∈R(k−r)×r such that
W−Z=Z⊥X. Proving that ηZ is bijective is straightforward.
∎
Proposition 2.2**.**
For each W∈Mr(Rk×r) such that det(ZTW)=0, there exists a unique GW∈GLr such that
[TABLE]
holds, which means that the set of matrices with the same column space as W intersects SZ
at the single point WGW−1.
Furthermore, GW=idr
if and only if W∈SZ.
Proof.
By Lemma 2.1, a matrix A∈WGLr∩SZ is such that
A=WGW−1=Z+Z⊥X for a certain GW∈GLr and a certain X∈R(k−r)×r. Then ZTWGW−1=ZTZ and GW is uniquely defined by GW=(ZTZ)−1(ZTW), which proves that WGLr∩SZ is the singleton {WGW−1}, and
GW=idr
if and only if W∈SZ.
∎
Corollary 2.3**.**
For each Z∈Mr(Rk×r),
the map colk,r:SZ⟶Gr(Rk)
is injective.
Proof.
Let us assume the existence of W,W~∈SZ
such that colk,r(W)=colk,r(W~). Then W=W~ by Proposition 2.2.
∎
Lemma 2.1 and Corollary 2.3 allow us
to construct a system of local charts for Gr(Rk) by
defining for each Z∈Mr(Rk×r) a neighbourhood of colk,r(Z) by
[TABLE]
together with the bijective map
[TABLE]
such that
[TABLE]
for X∈R(k−r)×r.
We denote by Z+ the Moore-Penrose pseudo-inverse of the full rank matrix Z∈Mr(Rr×k), defined by
[TABLE]
It satisfies Z+Z=idr and Z+Z⊥=0. Moreover, ZZ+∈Rk×k is the projection onto colk,r(Z) parallel to colk,r(Z)⊥.
Finally, we have the following result.
Theorem 2.4**.**
The collection Ak,r:={(UZ,φZ):Z∈Mr(Rk×r)} is
an analytic atlas for Gr(Rk) and hence (Gr(Rk),Ak,r)
is an analytic r(k−r)-dimensional manifold modelled on R(k−r)×r.
Proof.
Clearly {UZ}Z∈Mr(Rk×r) is a covering of Gr(Rk). Now let
Z and Z~ be such that UZ∩UZ~=∅.
Let V∈UZ such that V=φZ−1(X)=colk,r(Z+Z⊥X), with X∈Rk×(k−r). We can write Z+Z⊥X=(Z~+Z~⊥X~)G with G=Z~+(Z+Z⊥X) and X~=Z~⊥+(Z+Z⊥X)G−1. Therefore, V=colk,r((Z~+Z~⊥X~)G)=colk,r(Z~+Z~⊥X~)=φZ~−1(X~)∈UZ~, which implies that
UZ=UZ∩UZ~. Therefore, φZ(UZ∩UZ~)=φZ(UZ)=Rk×(n−k) is an open set. In the same way, we show that UZ~=UZ∩UZ~ and φZ~(UZ)=Rk×(n−k) is an open set. Finally, the map φZ~∘φZ−1 from R(k−r)×r to R(k−r)×r
is given by φZ~∘φZ−1(X)=Z~⊥+(Z+Z⊥X)G−1, with G=Z~+(Z+Z⊥XZ), which is clearly an analytic map.
∎
2.2 Lie group structure of neighbourhoods UZ
Here we prove that each neighbourhood UZ of Gr(Rk) is a Lie group.
For that, we first note that
a neighbourhood UZ of Gr(Rk) can be identified with the set SZ through the application colk,r:SZ→UZ. The next step is to
identify SZ with
a closed Lie subgroup of GLk, denoted by GZ, with associated Lie algebra gZ isomorphic to Rr×(k−r), and such that the exponential map333We recall that the matrix exponential exp:Rk×k→GLk
is defined by
exp(A)=∑n=0∞n!An.
exp:gZ⟶GZ is a diffeomorphism. To this end, for a given
Z∈Mr(Rk×r), we introduce the vector space
[TABLE]
The following proposition proves that gZ is a
commutative subalgebra of Rk×k.
Proposition 2.5**.**
For all X,X~∈R(k−r)×r,
[TABLE]
holds, and gZ is a
commutative subalgebra of Rk×k. Moreover,
[TABLE]
[TABLE]
and
[TABLE]
hold for all X∈R(k−r)×r.
Proof.
Since
(Z⊥XZ+)(Z⊥X~Z+)=0
holds for all X,X~∈R(k−r)×r, the vector space
gZ is a closed subalgebra of
the matrix unitary algebra Rk×k. As a consequence, (Z⊥XZ+)p=0 holds for all X∈R(k−r)×r and all p≥2, which proves
(6).
We directly deduce (7) using ZZ+=idr, and (8) using Z+Z⊥=0.
∎
From Proposition 2.5 and the definition of
SZ, we obtain
the following results.
Corollary 2.6**.**
The affine cross section SZ satisfies
[TABLE]
and
[TABLE]
for all X∈R(k−r)×r, where the brackets [⋅∣⋅] are used for matrix concatenation.
Proof.
From Proposition 2.5 and (4), we obtain (9) and we can write
[TABLE]
Since exp(Z⊥XZ+),[Z∣Z⊥]∈GLk, (10) follows.
∎
Now we need to introduce the following definition and proposition (see [15, p.80]).
Definition 2.7**.**
Let (K,+,⋅) be a ring and let (K,+) be its additive group. A subset I⊂K
is called a two-sided ideal (or simply an ideal) of K if it is an additive subgroup of K such that
I⋅K:={r⋅x:r∈I and x∈K}⊂I and
K⋅I:={x⋅r:r∈I and x∈K}⊂I.
Proposition 2.8**.**
If g⊂h is a two-sided ideal of the Lie algebra h of a group H, then the subgroup G⊂H generated by
exp(g)={exp(G):G∈g} is normal and closed, with Lie algebra h.
From the above proposition, we deduce the following result.
Lemma 2.9**.**
Let Z∈Mr(Rk×r) and
Z⊥∈Mk−r(Rk×(k−r)) be such that ZTZ⊥=0. Then gZ⊂Rk×k
is a two-sided ideal of the Lie algebra Rk×k and hence
[TABLE]
is a closed Lie group
with Lie algebra gZ. Furthermore, the map
exp:gZ⟶GZ
is bijective.
Proof.
Consider Z⊥XZ+∈gZ and
A∈Rk×k. Noting that Z+Z=idr and (Z⊥)+Z⊥=idk−r, we have that
[TABLE]
which proves that
gZ⋅Rk×k⊂gZ.
Similarly, we have that
[TABLE]
which proves that Rk×k⋅gZ⊂gZ. This proves that gZ is a
two-sided ideal. The map exp is clearly
surjective. To prove that it is injective, we assume
exp(Z⊥XZ+)=exp(Z⊥X~Z+) for X,X~∈R(k−r)×r.
Then from (6), we obtain Z+Z⊥X=Z+Z⊥X~
and hence X=X~, i.e. Z⊥XZ+=Z⊥X~Z+ in gZ.
∎
Finally, we can prove the following result.
Theorem 2.10**.**
The set SZ together with the group operation ×Z defined by
[TABLE]
for X,X~∈R(k−r)×r
is a Lie group.
Proof.
To prove that it is a Lie group, we simply note that the multiplication and inversion maps
[TABLE]
and
[TABLE]
are analytic.
∎
It follows that UZ can be identified with a Lie group through the map φZ.
Theorem 2.11**.**
Each neighbourhood UZ of Gr(Rk) together with the group operation ∘Z defined by
[TABLE]
for V,V′∈UZ, is a Lie group and the map
γZ:UZ⟶GZ
given by
[TABLE]
is a Lie group isomorphism.
3 The non-compact Stiefel principal bundle Mr(Rk×r)
In this section, we give a new geometric description of the set Mr(Rk×r) of matrices with full rank r<k, which is based on the geometric description of the Grassmann manifold given in Section 2.
3.1 Principal bundle structure of Mr(Rk×r)
For Z∈Mr(Rk×r), we define a neighbourhood of Z as
[TABLE]
From Proposition 2.2, we know that for a given matrix W∈VZ,
there exists a unique pair of matrices (X,G)∈R(k−r)×r×GLr such that
W=(Z+Z⊥X)G. Therefore,
[TABLE]
It allows us to introduce a parametrisation ξZ−1 (see Figure 1) defined through the bijection
[TABLE]
such that
[TABLE]
for (X,G)∈R(k−r)×r×GLr, and
[TABLE]
for W∈VZ.
In particular,
[TABLE]
Theorem 3.1**.**
The collection Bk,r:={(VZ,ξZ):Z∈Mr(Rk×r)} is
an analytic atlas for Mr(Rk×r),
and hence (Mr(Rk×r),Bk,r) it is an analytic kr-dimensional
manifold modelled on R(k−r)×r×Rr×r.
Proof.
{VZ}Z∈Mr(Rk×r) is clearly a covering
of Mr(Rk×r). Moreover,
since ξZ is bijective from VZ to R(k−r)×r×GLr we claim that if VZ∩VZ~=∅ for Z,Z~∈Mr(Rk×r), then the following statements hold:
- i)
ξZ(VZ∩VZ~) and ξZ~(VZ∩VZ~) are open sets
in R(k−r)×r×GLr and
2. ii)
the map ξZ~∘ξZ−1 is analytic from ξZ(VZ∩VZ~)⊂R(k−r)×r×GLr to ξZ~(VZ∩VZ~)⊂R(k−r)×r×GLr.
In this proof, we equip Rk×r with the topology τRk×r induced by matrix norms. For any Z∈Mr(Rk×r), VZ={W∈Rk×r:det(ZTW)=0} is the inverse image of the open set R∖{0} by the continuous map W↦det(ZTW) from Rk×r to R, and therefore, VZ is an open set of Rk×r.
Since VZ and VZ~ are open sets in Rk×r, VZ∩VZ~ is also an open set in Rk×r and since ξZ−1 is a continuous map from R(k−r)×r×GLr to Rk×r, the set ξZ(VZ∩VZ~), as the inverse image of an open set by a continuous map, is an open set in R(k−r)×r×GLr. Similarly, ξZ~(VZ∩VZ~) is an open set.
Now let (X,G)∈R(k−r)×r×GLr such that
ξZ−1(X,G)∈VZ∩VZ~. From the expressions of ξZ−1 and ξZ~, the map ξZ~∘ξZ−1 is defined by
[TABLE]
with ξZ−1(X,G)=(Z+Z⊥X)G,
which is clearly an analytic map.
∎
Before stating the next result, we recall the definition of a morphism between manifolds and of
a fibre bundle. We introduce notions of Cp maps and Cp manifolds,
with p∈N∪{∞} or p=ω. In the latter case, Cω means analytic.
Definition 3.2**.**
Let (M,A) and (N,B)
be two Cp manifolds. Let F:M→N be a map.
We say that F is a Cp morphism between (M,A) and (N,B) if given m∈M, there exists a chart (U,φ)∈A
such that m∈U and a chart (W,ψ)∈B such that F(m)∈W where F(U)⊂W, and the map
[TABLE]
is a map of class Cp. If it is
a Cp diffeomorphism, then we
say that F is a Cp diffeomorphism between manifolds.
We say that ψ∘F∘φ−1
is a representation of F using a system of local coordinates given by the charts
(U,φ) and (W,ψ).
Definition 3.3**.**
Let B be a Cp manifold with atlas A={(Ub,φb):b∈B}, and let F be a manifold.
A Cp
fibre bundle E with base B and typical fibre F
is a Cp manifold which is locally a product manifold, that is, there exists a
surjective morphism π:E⟶B such that for each b∈B there is a Cp diffeomorphism between manifolds
[TABLE]
such that pb∘χb=π where pb:Ub×F⟶Ub is the projection. For each b∈B, π−1(b)=Eb is called the fibre over b.
The Cp diffeomorphisms χb are called fibre
bundle charts. If p=0, E,B and F are only required to be topological spaces and {Ub:b∈B}
an open covering of B. In the case where F is a Lie group, we say that
E is a Cp principal bundle, and if
F is a vector space, we say that it is a Cp vector bundle.
Theorem 3.4**.**
*The set Mr(Rk×r) is an analytic principal
bundle with typical fibre GLr and base Gr(Rk), with a surjective
morphism between Mr(Rk×r) and Gr(Rk) given by the map colk,r.
*
Proof.
To show that it is an analytic principal bundle, we first observe that
[TABLE]
is a surjective morphism. Indeed, let Z∈Mr(Rk×r) and (VZ,ξZ)∈Bk,r and (UZ,φZ)∈Ak,r.
Noting
that
colk,r(YG)=colk,r(Y) for all Y∈SZ, we obtain that
colk,r(VZ)=UZ.
Moreover, a representation of colk,r
by using a system of local coordinates given by the charts is
[TABLE]
which is clearly an analytic map from R(k−r)×r×GLr
to R(k−r)×r such that
colk,r−1(UZ)=VZ.
Now, a representation of the morphism
[TABLE]
using the system of local coordinates given by the charts is
[TABLE]
defined by
[TABLE]
which is clearly an analytic diffeomorphism. To conclude, consider the projection
[TABLE]
and observe that (pZ∘χZ)(W)=colk,r(W) holds for all W∈VZ.
∎
3.2 Mr(Rk×r) as a submanifold and its tangent space
Here, we prove that the non-compact Stiefel
manifold Mr(Rk×r)
equipped with the topology given by the atlas Bk,r
is an embedded submanifold in Rk×r. For that, we have to prove that
the standard inclusion map
[TABLE]
as a morphism is an embedding. To see this we need to recall some definitions and results.
Definition 3.5**.**
Let F:(M,A)→(N,B)
be a morphism between Cp manifolds and let m∈M. We say that F is an immersion at m if there exists an
open neighbourhood Um of m in M such that the restriction of F to
Um induces an isomorphism from Um onto a submanifold of N. We
say that F is an immersion if it is an immersion at each point of M.
The next step is to recall the definition of the differential as a morphism
which gives a linear map between the tangent spaces of the manifolds (in local coordinates)
involved with the morphism. Let us recall that for any m∈M, we denote by TmM the tangent space of M at m (in local coordinates).
Definition 3.6**.**
Let (M,A) and (N,B) be two Cp manifolds.
Let F:(M,A)→(N,B) be a morphism of class Cp, i.e., for any m∈M,
[TABLE]
is a map of class Cp, where (U,φ)∈A
is a chart in M containing m and (W,ψ)∈B is a chart in N containing F(m). Then we define
[TABLE]
For finite dimensional manifolds we have the following criterion for immersions (see
Theorem 3.5.7 in [1]).
Proposition 3.7**.**
Let (M,A) and (N,B)
be Cp manifolds. Let
[TABLE]
be a Cp morphism
and m∈M. Then F is an immersion at m if and only if TmF is injective.
A concept related to an immersion between manifolds is given in the
following definition.
Definition 3.8**.**
Let (M,A) and (N,B)
be Cp manifolds and let f:(M,A)⟶(N,B)
be a Cp morphism. If f is an injective immersion, then f(M) is called an immersed submanifold of N.
Finally, we give the definition of embedding.
Definition 3.9**.**
Let (M,A) and (N,B)
be Cp manifolds and let f:(M,A)⟶(N,B)
be a Cp morphism. If f is an injective immersion, and
f:(M,τA)⟶(f(M),τB∣f(M))
is a topological homeomorphism, then we say that f is an embedding and
f(M) is called an embedded submanifold of N.
We first note that the representation of the inclusion map i using the system of local coordinates given by the charts (VZ,ξZ)∈Bk,r
in Mr(Rk×r) and (Rk×r,idRk×r) in Rk×r
is
[TABLE]
Then the tangent map TZi at Z=ξZ−1(0,idr), defined by TZi=D(i∘ξZ−1)(0,idr), is
[TABLE]
Proposition 3.10**.**
The tangent map TZi:R(k−r)×r×Rr×r→Rk×r at Z∈Mr(Rk×r) is a linear isomorphism, with inverse (TZi)−1 given by
[TABLE]
for Z˙∈Rk×r.
Furthermore, the standard inclusion map i is an embedding from Mr(Rk×r) to Rk×r.
Proof.
Let us assume that
TZi(X˙,G˙)=Z⊥X˙+ZG˙=0.
Multiplying this equality by Z+ and Z⊥+ on the left, we obtain G˙=0 and X˙=0 respectively, which implies that
TZi is injective. To prove that it is
also surjective, we consider a matrix Z˙∈Rk×r and observe that X˙=Z⊥+Z˙∈R(k−r)×r and G˙=Z+Z˙∈Rr×r is such that
TZi(X˙,G˙)=Z˙.
Since TZi is injective, the inclusion map i is an immersion.
To prove that it is an
embedding we equip Mr(Rk×r) with the topology τBk,r given by the atlas and we equip Rk×r with the topology τRk×r induced by matrix norms.
We need to check that
[TABLE]
is a topological homeomorphism. Since the topology in (Mr(Rk×r),τBk,r)
has the property that each local chart ξZ is indeed a homeomorphism from VZ in Mr(Rk×r) to ξZ(VZ)=R(k−r)×r×GLr (see Section 1.1), we only need to show that the bijection
(i∘ξZ−1):R(k−r)×r×GLr→VZ⊂Rk×r given by
[TABLE]
is a topological homeomorphism for all Z∈Mr(Rk×r).
Observe that D(i∘ξZ−1)(X,G)∈L(R(k−r)×r×Rr×r,Rk×r) is given by
[TABLE]
Assume that Z⊥X˙G+(Z+Z⊥X)G˙=0. Multiplying
this equality by Z+ on the left we obtain G˙=0, and hence
Z⊥X˙G=0. Multiplying by Z⊥+ on the left
we obtain X˙G=0. Thus X˙=0 and
as a consequence D(i∘ξZ−1)(X,G) is a linear isomorphism
for each (X,G)∈R(k−r)×r×GLr. The inverse function
theorem says us that (i∘ξZ−1) is a diffeomorphism, in particular a homeomorphism,
and hence i is an embedding.
∎
The tangent space to Mr(Rk×r) at Z is the image through TZi of the tangent space at Z in local coordinates TZMr(Rk×r)=R(k−r)×r×Rr×r, i.e.
[TABLE]
and can be decomposed into a vertical tangent space
[TABLE]
and an horizontal tangent space
[TABLE]
3.3 Lie group structure of neighbourhoods VZ
We here prove that each neighbourhood VZ of Mr(Rk×r) has the structure of a Lie group. For that, we first note that
VZ can be identified with
SZ×GLr, with SZ given by (9). Noting that SZ can be identified with the Lie group GZ defined in (11), we then have that VZ can be identified with a product of two Lie groups GZ×GLr, which is a Lie group with the
group operation ⊙Z given by
[TABLE]
for X,X′∈R(k−r)×r and G,G′∈GLr.
It allows us to define a group operation ⋆Z over VZ defined
for W=ξZ−1(X,G) and W′=ξZ−1(X′,G′) by
[TABLE]
and to state the following result.
Theorem 3.11**.**
The set VZ together with the group operation ⋆Z defined by (15)
is a Lie group and the map
ηZ:VZ⟶GZ×GLr
given by
[TABLE]
is a Lie group isomorphism.
4 The principal bundle Mr(Rn×m) for 0<r<min(m,n)
In this section, we give a geometric description of the set of matrices Mr(Rn×m) with rank r<min(m,n).
4.1 Mr(Rn×m) as a principal bundle
For Z∈Mr(Rn×m), there exists
U∈Mr(Rn×r), V∈Mr(Rm×r),
and G∈GLr such that
[TABLE]
where the column space of Z is coln,r(U) and the row space of Z is colm,r(V).
Let us first introduce the surjective map
[TABLE]
The set
[TABLE]
can be identified with GLr. Let us consider
U⊥∈Mn−r(Rn×(n−r)) such that UTU⊥=0 and
V⊥∈Mm−r(Rm×(m−r)) such that VTV⊥=0. Then we define a neighbourhood of UGVT
in the set Mr(Rn×m) by
[TABLE]
where UU and UV are the neighbourhoods of coln,r(U) and colm,r(V) respectively (see Section 2.2).
Noting that UU=φU−1(R(n−r)×r)=coln,r(SU) and UV=φV−1(R(m−r)×r)=colm,r(SV), where SU and SV are the affine cross sections of U and V respectively (defined by (4)), the neighbourhood of UGVT can be written
[TABLE]
We can associate to UZ the parametrisation θZ−1 given by the chart (see Figure 2)
[TABLE]
defined by
[TABLE]
for (X,Y,H)∈R(n−r)×r×R(m−r)×r×GLr, and
[TABLE]
for A∈UZ.
In particular, we have θZ−1(0,0,G)=Z. We point out that
UZ=UZ′ and θZ=θZ′ for every Z′=UG′VT with G′=G.
Theorem 4.1**.**
The collection Bn,m,r:={(UZ,θZ):Z∈Mr(Rn×m)}
is
an analytic atlas for Mr(Rn×m) and hence (Mr(Rn×m),Bn,m,r)
is an analytic r(n+m−r)-dimensional manifold modelled on R(n−r)×r×R(m−r)×r×Rr×r.
Proof.
{UZ}Z∈Mr(Rn×m) is clearly a covering
of Mr(Rn×m). Moreover,
since θZ is bijective from UZ to R(n−r)×r×R(m−r)×r×GLr, we claim that if UZ∩UZ~=∅ for Z=UGVT and Z~=U~G~V~T∈Mr(Rn×m), then the following statements hold:
- i)
θZ(UZ∩UZ~) and θZ~(UZ∩UZ~) are open sets
in R(n−r)×r×R(m−r)×r×GLr and
2. ii)
the map θZ~∘θZ−1 is analytic from θZ(UZ∩UZ~)⊂R(n−r)×r×R(m−r)×r×GLr to θZ~(UZ∩UZ~)⊂R(n−r)×r×R(m−r)×r×GLr.
In this proof, we equip Rn×m with the topology τRn×m induced by matrix norms.
We first observe that the set
UZ={A∈Mr(Rn×m):det(UTAV)=0}=OZ∩Mr(Rn×m), where OZ={A∈Rn×m:det(UTAV)=0}, as the inverse image of the open set R∖{0} through the continuous map A↦det(UTAV) from Rn×m to R,
is an open set in Rn×m. In the same way, we have that UZ~=OZ~∩Mr(Rn×m), with UZ~ an open set in Rn×m.
Since UZ∩UZ~=OZ∩OZ~∩Mr(Rn×m), and since the image of θZ−1 is in Mr(Rn×m), we have
[TABLE]
the inverse image through θZ−1 of the open set OZ∩OZ~ in Rn×m. Since θZ−1 is a continuous map from R(n−r)×r×R(m−r)×r×GLr to Rn×m, we deduce that θZ(UZ∩UZ~) is an open set in R(n−r)×r×R(m−r)×r×GLr.
Similarly, θZ~(UZ∩UZ~) is an open set in R(n−r)×r×R(m−r)×r×GLr.
Now, let (X,Y,H)∈R(n−r)×r×R(m−r)×r×GLr such that
θZ−1(X,Y,H)∈UZ∩UZ~. From the expressions of θZ−1 and θZ~, the map θZ~∘θZ−1 is defined by
[TABLE]
with θZ−1(X,Y,H)=(U+U⊥X)H(V+V⊥Y)T, which is clearly an analytic map.
∎
Theorem 4.2**.**
The set Mr(Rn×m) is an analytic principal bundle with typical fibre GLr and base Gr(Rn)×Gr(Rm) with surjective morphism ϱr between Mr(Rn×m) and Gr(Rn)×Gr(Rm) given by ϱr.
Proof.
To prove that it is an analytic principal bundle, we consider the surjective map
[TABLE]
the atlas An,r:={(UU,φU):U∈Mr(Rn×r)} of Gr(Rn) and
the atlas Am,r:={(UV,φV):V∈Mr(Rm×r)} of Gr(Rm). Recall that
[TABLE]
with k=n if Z=U or k=m if Z=V, and hence
[TABLE]
Observe that for each fixed G∈GLr, we have that
ϱr−1(UU,UV)=UZ,
where Z=UGVT. Since UZ=UZ′ holds for Z′=UG′VT,
where G′∈GLr, the map
[TABLE]
defined by
[TABLE]
is independent of the choice of Z=UGVT, where G∈GLr. Now, the representation of χZ in local coordinates
is the map
[TABLE]
given by ((φU×φV×idRr×r)∘χZ∘θZ−1)(X,Y,H)=(X,Y,H), which
is an analytic diffeomorphism. Moreover, let pZ:UU×UV×GLr⟶UU×UV be the projection over the two first components. Then
[TABLE]
and the theorem follows.
∎
4.2 Mr(Rn×m) as a submanifold and its tangent space
Here, we prove that Mr(Rn×m) equipped with the topology given by the atlas
Bn,m,r is an embedded submanifold in Rn×m.
For that, we have to prove that the standard inclusion map i:Mr(Rn×m)→Rn×m is an embedding. Noting that the inclusion map restricted to the neighbourhood UZ of Z=UGVT is identified with
[TABLE]
the tangent map TZi at Z=θZ−1(0,0,G), defined by TZi=D(i∘θZ−1)(0,0,G), is
[TABLE]
Proposition 4.3**.**
The tangent map TZi:R(n−r)×r×R(m−r)×r×Rr×r→Rn×m at Z=UGVT∈Mr(Rn×m) is a linear isomorphism with inverse (TZi)−1 given by
[TABLE]
for Z˙∈Rn×m.
Furthermore, the standard inclusion map i is an embedding from Mr(Rn×m)
to Rn×m.
Proof.
Let us suppose that
TZi(X˙,Y˙,H˙)=0.
Multiplying this equality by (U⊥)+ and U+ on the left leads to
[TABLE]
respectively.
By multiplying the first equation by (V+)T on the right, we obtain X˙=0. By multiplying
the second equation on the right by (V+)T and (V⊥+)T, we respectively obtain H˙=0 and Y˙=0.
Then, TZi is injective and then i is an immersion. For Z˙∈Rn×m, we note that X˙=U⊥+Z˙(V+)TG−1∈Rn×r, Y˙=V⊥+Z˙T(U+)TG−T∈Rm×r, and G˙=U+Z˙(V+)T∈Rr×r is such that TZi(X˙,Y˙,G˙)=Z˙, then TZi is also surjective.
Let us now equip Mr(Rn×m) with the topology τBn,m,r given by the atlas and Rn×m with the topology τRn×m induced by matrix norms. We have to prove that
[TABLE]
is a topological isomorphism. The topology in (Mr(Rn×m),τBn,m,r)
is such that a local chart θZ is a homeomorphism from UZ⊂Mr(Rn×m) to θZ(UZ)=R(n−r)×r×R(m−r)×r×GLr (see Section 1.1).
Then, to prove that the map i is an embedding, we need to show that the bijection
[TABLE]
is a topological homeomorphism. For that, observe that its differential
[TABLE]
at (X,Y,H)∈R(n−r)×r×R(m−r)×r×GLr is given by
[TABLE]
Assume that
[TABLE]
Multiplying on the left by U+ and on the right by (V+)T, we obtain
H˙=0.
Multiplying on the left by U⊥+ and on the right by (V+)T we deduce
that X˙H=0, that is, X˙=0.
Finally, multiplying on the left by U+ and on the right by (V⊥+)T
we obtain HY˙T=0, and hence Y˙=0. Thus,
D(i∘θZ−1)(X,Y,H) is a linear isomorphism
from R(n−r)×r×R(m−r)×r×Rr×r to
D(i∘θZ−1)(X,Y,H)[R(n−r)×r×R(m−r)×r×Rr×r]
for each
(X,Y,H)∈R(n−r)×r×R(m−r)×r×GLr.
The inverse function theorem says us that
(i∘θZ−1) is a diffeomorphism from R(n−r)×r×R(m−r)×r×GLr to UZ=(i∘θZ−1)(R(n−r)×r×R(m−r)×r×GLr) and in particular, it is a topological homeomorphism.
In consequence, the map i is an embedding.
∎
The tangent space to Mr(Rn×m) at Z=UGVT, which is the image through TZi of the tangent space in local coordinates TZMr(Rn×m)=R(n−r)×r×R(m−r)×r×Rr×r, is
[TABLE]
and can be decomposed into a vertical tangent space
[TABLE]
and an horizontal tangent space
[TABLE]
4.3 Lie group structure of neighbourhoods UZ
We here prove that Mr(Rn×m) has locally the structure of a Lie group by proving that the neighbourhoods UZ can be identified with Lie groups.
Let Z=UGVT∈Mr(Rn×m). We first note that
UZ can be identified with
SU×SV×GLr, with SU and SV defined by (9). Noting that SU and SV can be identified with Lie groups GU and GV defined in (11), we then have that UZ can be identified with a product of three Lie groups, which is a Lie group with the
group operation ⊙Z given by
[TABLE]
It allows us to define a group operation ⋆Z over UZ
defined for W=θZ−1(X,Y,G) and W′=θZ−1(X′,Y′,G′) by
[TABLE]
and to state the following result.
Theorem 4.4**.**
Let Z=UGVT∈Mr(Rn×m).
Then
the set UZ together with the group operation ⋆Z defined by (17)
is a Lie group with identity element UVT,
and the map ηZ:UZ→GU×GV×GLr
given by
[TABLE]
is a Lie group isomorphism.