Principal bundle structure of matrix manifolds

Marie Billaud-Friess; Antonio Falco; Anthony Nouy

arXiv:1705.04093·math.DG·March 25, 2022

Principal bundle structure of matrix manifolds

Marie Billaud-Friess, Antonio Falco, Anthony Nouy

PDF

TL;DR

This paper presents a new geometric framework for matrix manifolds of fixed rank, modeling them as principal bundles with explicit atlases, which enhances understanding of their structure and topology.

Contribution

It introduces a novel geometric description of matrix manifolds as principal bundles, avoiding equivalence classes and providing explicit atlases and topological properties.

Findings

01

Matrix manifolds are modeled as principal bundles with explicit atlases.

02

The topology makes matrix rank a continuous map.

03

The set of fixed-rank matrices forms an embedded submanifold.

Abstract

In this paper, we introduce a new geometric description of the manifolds of matrices of fixed rank. The starting point is a geometric description of the Grassmann manifold $G_{r} (R^{k})$ of linear subspaces of dimension $r < k$ in $R^{k}$ which avoids the use of equivalence classes. The set $G_{r} (R^{k})$ is equipped with an atlas which provides it with the structure of an analytic manifold modelled on $R^{(k - r) \times r}$ . Then we define an atlas for the set $M_{r} (R^{k \times r})$ of full rank matrices and prove that the resulting manifold is an analytic principal bundle with base $G_{r} (R^{k})$ and typical fibre $GL_{r}$ , the general linear group of invertible matrices in $R^{k \times k}$ . Finally, we define an atlas for the set $M_{r} (R^{n \times m})$ of non-full rank matrices and…

Equations206

M_{r} (R^{n \times m}) = {Z \in R^{n \times m} : rank (Z) = r} .

M_{r} (R^{n \times m}) = {Z \in R^{n \times m} : rank (Z) = r} .

Z = U G V^{T},

Z = U G V^{T},

φ_{α^{'}} \circ φ_{α}^{- 1} : φ_{α} (U_{α} \cap U_{α^{'}}) ⟶ φ_{α^{'}} (U_{α} \cap U_{α^{'}})

φ_{α^{'}} \circ φ_{α}^{- 1} : φ_{α} (U_{α} \cap U_{α^{'}}) ⟶ φ_{α^{'}} (U_{α} \cap U_{α^{'}})

G (R^{k}) = 0 \leq r \leq k ⋃ G_{r} (R^{k}),

G (R^{k}) = 0 \leq r \leq k ⋃ G_{r} (R^{k}),

τ_{A} := {φ_{α}^{- 1} (O) : α \in A and O an open set in X_{α}},

τ_{A} := {φ_{α}^{- 1} (O) : α \in A and O an open set in X_{α}},

φ_{α} : (U_{α}, τ_{A} ∣_{U_{α}}) ⟶ (X_{α}, τ_{R^{N_{α}}} ∣_{X_{α}}),

φ_{α} : (U_{α}, τ_{A} ∣_{U_{α}}) ⟶ (X_{α}, τ_{R^{N_{α}}} ∣_{X_{α}}),

R^{n \times m} = 0 \leq r \leq m i n {n, m} ⋃ M_{r} (R^{n \times m}),

R^{n \times m} = 0 \leq r \leq m i n {n, m} ⋃ M_{r} (R^{n \times m}),

τ_{R^{n \times m}}^{*} = 0 \leq r \leq m i n {n, m} ⋃ τ_{R^{n \times m}} ∣_{M_{r} (R^{n \times m})},

τ_{R^{n \times m}}^{*} = 0 \leq r \leq m i n {n, m} ⋃ τ_{R^{n \times m}} ∣_{M_{r} (R^{n \times m})},

φ_{Z} : U_{Z} \to R^{(k - r) \times r}, φ_{Z}^{- 1} (X) = col_{k, r} (Z + Z_{⊥} X),

φ_{Z} : U_{Z} \to R^{(k - r) \times r}, φ_{Z}^{- 1} (X) = col_{k, r} (Z + Z_{⊥} X),

ξ_{Z} : V_{Z} \to R^{(k - r) \times r} \times GL_{r}, ξ_{Z}^{- 1} (X, G) = (Z + Z_{⊥} X) G,

ξ_{Z} : V_{Z} \to R^{(k - r) \times r} \times GL_{r}, ξ_{Z}^{- 1} (X, G) = (Z + Z_{⊥} X) G,

θ_{Z} : U_{Z} \to R^{(n - r) \times r} \times R^{(m - r) \times r} \times GL_{r}, θ_{Z}^{- 1} (X, Y, H) = (U + U_{⊥} X) H (V + V_{⊥} Y),

θ_{Z} : U_{Z} \to R^{(n - r) \times r} \times R^{(m - r) \times r} \times GL_{r}, θ_{Z}^{- 1} (X, Y, H) = (U + U_{⊥} X) H (V + V_{⊥} Y),

G_{r} (R^{k}) = {V \subset R^{k} : V is a linear subspace with dim (V) = r},

G_{r} (R^{k}) = {V \subset R^{k} : V is a linear subspace with dim (V) = r},

col_{k, r} : M_{r} (R^{k \times r}) ⟶ G_{r} (R^{k}), Z \mapsto col_{k, r} (Z),

col_{k, r} : M_{r} (R^{k \times r}) ⟶ G_{r} (R^{k}), Z \mapsto col_{k, r} (Z),

Z GL_{r} := {Z G : G \in GL_{r}} .

Z GL_{r} := {Z G : G \in GL_{r}} .

S_{Z} := {W \in M_{r} (R^{k \times r}) : Z^{T} W = Z^{T} Z},

S_{Z} := {W \in M_{r} (R^{k \times r}) : Z^{T} W = Z^{T} Z},

S_{Z} = {Z + Z_{⊥} X : X \in R^{(k - r) \times r}},

S_{Z} = {Z + Z_{⊥} X : X \in R^{(k - r) \times r}},

η_{Z} : R^{(k - r) \times r} ⟶ S_{Z}, X \mapsto Z + Z_{⊥} X

η_{Z} : R^{(k - r) \times r} ⟶ S_{Z}, X \mapsto Z + Z_{⊥} X

W GL_{r} \cap S_{Z} = {W G_{W}^{- 1}}

W GL_{r} \cap S_{Z} = {W G_{W}^{- 1}}

U_{Z} := col_{k, r} (S_{Z}) = {col_{k, r} (W) : W \in S_{Z}}

U_{Z} := col_{k, r} (S_{Z}) = {col_{k, r} (W) : W \in S_{Z}}

φ_{Z} := (col_{k, r} \circ η_{Z})^{- 1} : U_{Z} \to R^{(k - r) \times r}

φ_{Z} := (col_{k, r} \circ η_{Z})^{- 1} : U_{Z} \to R^{(k - r) \times r}

φ_{Z}^{- 1} (X) = col_{k, r} (Z + Z_{⊥} X)

φ_{Z}^{- 1} (X) = col_{k, r} (Z + Z_{⊥} X)

Z^{+} := (Z^{T} Z)^{- 1} Z^{T} \in M_{r} (R^{r \times k}) .

Z^{+} := (Z^{T} Z)^{- 1} Z^{T} \in M_{r} (R^{r \times k}) .

g_{Z} := {Z_{⊥} X Z^{+} : X \in R^{(k - r) \times r}} \subset R^{k \times k} .

g_{Z} := {Z_{⊥} X Z^{+} : X \in R^{(k - r) \times r}} \subset R^{k \times k} .

(Z_{⊥} X Z^{+}) (Z_{⊥} \tilde{X} Z^{+}) = 0

(Z_{⊥} X Z^{+}) (Z_{⊥} \tilde{X} Z^{+}) = 0

exp (Z_{⊥} X Z^{+}) = i d_{k} + Z_{⊥} X Z^{+},

exp (Z_{⊥} X Z^{+}) = i d_{k} + Z_{⊥} X Z^{+},

exp (Z_{⊥} X Z^{+}) Z = Z + Z_{⊥} X,

exp (Z_{⊥} X Z^{+}) Z = Z + Z_{⊥} X,

exp (Z_{⊥} X Z^{+}) Z_{⊥} = Z_{⊥}

exp (Z_{⊥} X Z^{+}) Z_{⊥} = Z_{⊥}

S_{Z} = {exp (Z_{⊥} X Z^{+}) Z : X \in R^{(k - r) \times r}},

S_{Z} = {exp (Z_{⊥} X Z^{+}) Z : X \in R^{(k - r) \times r}},

[exp (Z_{⊥} X Z^{+}) Z ∣ Z_{⊥}] \in GL_{k}

[exp (Z_{⊥} X Z^{+}) Z ∣ Z_{⊥}] \in GL_{k}

[exp (Z_{⊥} X Z^{+}) Z ∣ Z_{⊥}] = [exp (Z_{⊥} X Z^{+}) Z ∣ exp (Z_{⊥} X Z^{+}) Z_{⊥}] = exp (Z_{⊥} X Z^{+}) [Z ∣ Z_{⊥}] .

[exp (Z_{⊥} X Z^{+}) Z ∣ Z_{⊥}] = [exp (Z_{⊥} X Z^{+}) Z ∣ exp (Z_{⊥} X Z^{+}) Z_{⊥}] = exp (Z_{⊥} X Z^{+}) [Z ∣ Z_{⊥}] .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Principal bundle structure of matrix manifolds

M. Billaud-Friess , A. Falcó11footnotemark: 1 , A. Nouy11footnotemark: 1 Department of Computer Science and Mathematics, GeM, Ecole Centrale de Nantes, 1 rue de la Noë, BP 92101, 44321 Nantes Cedex 3, France. Email: [marie.billaud-friess,anthony.nouy]@ec-nantes.fr.Departamento de Matemáticas, Física y Ciencias Tecnológicas, Universidad CEU Cardenal Herrera, San Bartolomé 55, 46115 Alfara del Patriarca (Valencia), Spain. E-mail: [email protected].

Abstract

In this paper, we introduce a new geometric description of the manifolds of matrices of fixed rank. The starting point is a geometric description of the Grassmann manifold $\mathbb{G}_{r}(\mathbb{R}^{k})$ of linear subspaces of dimension $r<k$ in $\mathbb{R}^{k}$ which avoids the use of equivalence classes. The set $\mathbb{G}_{r}(\mathbb{R}^{k})$ is equipped with an atlas which provides it with the structure of an analytic manifold modelled on $\mathbb{R}^{(k-r)\times r}$ . Then we define an atlas for the set $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ of full rank matrices and prove that the resulting manifold is an analytic principal bundle with base $\mathbb{G}_{r}(\mathbb{R}^{k})$ and typical fibre $\mathrm{GL}_{r}$ , the general linear group of invertible matrices in $\mathbb{R}^{k\times k}$ . Finally, we define an atlas for the set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ of non-full rank matrices and prove that the resulting manifold is an analytic principal bundle with base $\mathbb{G}_{r}(\mathbb{R}^{n})\times\mathbb{G}_{r}(\mathbb{R}^{m})$ and typical fibre $\mathrm{GL}_{r}$ . The atlas of $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is indexed on the manifold itself, which allows a natural definition of a neighbourhood for a given matrix, this neighbourhood being proved to possess the structure of a Lie group. Moreover, the set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ equipped with the topology induced by the atlas is proven to be an embedded submanifold of the matrix space $\mathbb{R}^{n\times m}$ equipped with the subspace topology. The proposed geometric description then results in a description of the matrix space $\mathbb{R}^{n\times m}$ , seen as the union of manifolds $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ , as an analytic manifold equipped with a topology for which the matrix rank is a continuous map.

Keywords: Matrix manifolds, Low-rank matrices, Grassmann manifold, Principal bundles.

1 Introduction

Low-rank matrices appear in many applications involving high-dimensional data. Low-rank models are commonly used in statistics, machine learning or data analysis (see [18] for a recent survey). Also, low-rank approximation of matrices is the cornerstone of many modern numerical methods for high-dimensional problems in computational science, such as model order reduction methods for dynamical systems, or parameter-dependent or stochastic equations [4, 5, 14, 6].

These applications yield problems of approximation or optimization in the sets of matrices with fixed rank

[TABLE]

A usual geometric approach is to endow the set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ with the structure of a Riemannian manifold [16, 3], which is seen as an embedded submanifold of $\mathbb{R}^{n\times m}$ equipped with the topology $\tau_{\mathbb{R}^{n\times m}}$ given by matrix norms. Standard algorithms then work in the ambient matrix space $\mathbb{R}^{n\times m}$ and do not rely on an explicit geometric description of the manifold using local charts (see, e.g., [17, 12, 13, 8]). However, the matrix rank considered as a map is not continuous for the topology $\tau_{\mathbb{R}^{n\times m}}$ , which can yield undesirable numerical issues.

The purpose of this paper is to propose a new geometric description of the sets of matrices with fixed rank which is amenable for numerical use, and which relies on the natural parametrization of matrices in $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ given by

[TABLE]

where $U\in\mathbb{R}^{n\times r}$ and $V\in\mathbb{R}^{m\times r}$ are matrices with full rank $r<\min\{n,m\}$ , and $G\in\mathbb{R}^{r\times r}$ is a non singular matrix. The set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is here endowed with the structure of analytic principal bundle, with an explicit description of local charts. This results in a description of the matrix space $\mathbb{R}^{n\times m}$ as an analytic manifold with a topology induced by local charts which is different from $\tau_{\mathbb{R}^{n\times m}}$ and for which the rank is a continuous map. Note that the representation (1) of a matrix $Z$ is not unique because $Z=(UP)(P^{-1}GP^{T})(VP^{-1})^{T}$ holds for every invertible matrix $P$ in $\mathbb{R}^{r\times r}$ . An argument used to dodge this undesirable property is the possibility to uniquely define a tangent space (see for example Section 2.1 in [8]), which is a prerequisite for standard algorithms on differentiable manifolds. The geometric description proposed in this paper avoids this undesirable property. Indeed, the system of local charts for the set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is indexed on the set itself. This allows a natural definition of a neighbourhood for a matrix where all matrices admit a unique representation.

The present work opens the route for new numerical methods for optimization or dynamical low-rank approximation, with algorithms working in local coordinates and avoiding the use of a Riemannian structure, such as in [10], where a framework is introduced for generalising iterative methods from Euclidean space to manifolds which ensures that local convergence rates are preserved. The introduction of a principal bundle representation of matrix manifolds is also motivated by the importance of this geometric structure in the concept of gauge potential in physics [11].

We would point out that the proposed geometric description has a natural extension to the case of fixed-rank operators on infinite dimensional spaces and is consistent with the geometric description of manifolds of tensors with fixed rank proposed by Falcó, Hackbush and Nouy [7], in a tensor Banach space framework.

Before introducing the main results and outline of the paper, we recall some elements of geometry.

1.1 Elements of geometry

In this paper, we follow the approach of Serge Lang [9] for the definition of a manifold $\mathbb{M}$ . In this framework, a set $\mathbb{M}$ is equipped with an atlas which gives $\mathbb{M}$ the structure of a topological space, with a topology induced by local charts, and the structure of differentiable manifold compatible with this topology. More precisely, the starting point is the definition of a collection of non-empty subsets $U_{\alpha}\subset\mathbb{M}$ , with $\alpha$ in a set $A$ , such that $\{U_{\alpha}\}_{\alpha\in A}$ is a covering of $\mathbb{M}$ . The next step is the explicit construction for any $\alpha\in A$ of a local chart $\varphi_{\alpha}$ which is a bijection from $U_{\alpha}$ to an open set $X_{\alpha}$ of the finite dimensional space $\mathbb{R}^{N_{\alpha}}$ such that for any pair $\alpha,\alpha^{\prime}\in\mathbb{M}$ such that $U_{\alpha}\cap U_{\alpha^{\prime}}\neq\emptyset$ , the following properties hold:

(i)

$\varphi_{\alpha}(U_{\alpha}\cap U_{\alpha^{\prime}})$ and $\varphi_{\alpha^{\prime}}(U_{\alpha}\cap U_{\alpha^{\prime}})$ are open sets in $X_{\alpha}$ and $X_{\alpha^{\prime}}$ respectively, and 2. (ii)

the map

[TABLE]

is a $\mathcal{C}^{p}$ differentiable diffeomorphism, with $p\in\mathbb{N}\cup\{\infty\}$ or $p=\omega$ when the map is analytic.

Under the above assumptions, the set $\mathcal{A}:=\{(U_{\alpha},\varphi_{\alpha}):\alpha\in A\}$ is an atlas which endows $\mathbb{M}$ with a structure of $\mathcal{C}^{p}$ manifold. Then we say that $(\mathbb{M},\mathcal{A})$ is a $\mathcal{C}^{p}$ manifold, or an analytic manifold when $p=\omega$ . A consequence of the condition $(ii)$ is that when $U_{\alpha}\cap U_{\alpha^{\prime}}\neq\emptyset$ holds for $\alpha,\alpha^{\prime}\in A$ , then $N_{\alpha}=N_{\alpha^{\prime}}.$ In the particular case where $N_{\alpha}=N$ for all $\alpha\in A$ , we say that $(\mathbb{M},\mathcal{A})$ is a $\mathcal{C}^{p}$ manifold modelled on $\mathbb{R}^{N}.$ Otherwise, we say that it is a manifold not modelled on a particular finite-dimensional space. A paradigmatic example is the Grassmann manifold $\mathbb{G}(\mathbb{R}^{k})$ of all linear subspaces of $\mathbb{R}^{k}$ , such that

[TABLE]

where $\mathbb{G}_{0}(\mathbb{R}^{k})=\{0\}$ and $\mathbb{G}_{k}(\mathbb{R}^{k})=\{\mathbb{R}^{k}\}$ are trivial manifolds and $\mathbb{G}_{r}(\mathbb{R}^{k})$ is a manifold modelled on the linear space $\mathbb{R}^{(k-r)\times r}$ for $0<r<k.$ In consequence, $\mathbb{G}(\mathbb{R}^{k})$ is a manifold not modelled on a particular finite-dimensional space.

The atlas also endows $\mathbb{M}$ with a topology given by

[TABLE]

which makes $(\mathbb{M},\tau_{\mathcal{A}})$ a topological space where each local chart

[TABLE]

considered as a map between topological spaces, is a homeomorphism.111Here $(\mathfrak{X},\tau)$ denotes a topological space and if $\mathfrak{X}^{\prime}\subset\mathfrak{X}$ , then $\tau|_{\mathfrak{X}^{\prime}}$ denotes the subspace topology.

1.2 Main results and outline

Our first remark is that the matrix space $\mathbb{R}^{n\times m}$ is an analytic manifold modelled on itself and its geometric structure is fully compatible with the topology $\tau_{\mathbb{R}^{n\times m}}$ induced by a matrix norm. In this paper, we define an atlas on $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ which gives this set the structure of an analytic manifold, with a topology induced by the atlas fully compatible with the subspace topology $\tau_{\mathbb{R}^{n\times m}}|_{\mathcal{M}_{r}(\mathbb{R}^{n\times m})}$ . This implies that $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is an embedded submanifold of the matrix manifold $\mathbb{R}^{n\times m}$ modelled on itself222Note that the set $\mathcal{M}_{0}(\mathbb{R}^{n\times m})=\{0\}$ is a trivial manifold, which is trivially embedded in $\mathbb{R}^{n\times m}.$ . For the topology $\tau_{\mathbb{R}^{n\times m}}$ , the matrix rank considered as a map is not continuous but only lower semi-continuous. However, if $\mathbb{R}^{n\times m}$ is seen as the disjoint union of sets of matrices with fixed rank,

[TABLE]

then $\mathbb{R}^{n\times m}$ has the structure of an analytic manifold not modelled on a particular finite-dimensional space equipped with a topology

[TABLE]

which is not equivalent to $\tau_{\mathbb{R}^{n\times m}}$ , and for which the matrix rank is a continuous map.

Note that in the case when $r=n=m$ , the set $\mathcal{M}_{n}(\mathbb{R}^{n\times n})$ coincides with the general linear group $\mathrm{GL}_{n}$ of invertible matrices in $\mathbb{R}^{n\times n},$ which is an analytic manifold trivially embedded in $\mathbb{R}^{n\times n}$ . In all other cases, which are addressed in this paper, our geometric description of $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ relies on a geometric description of the Grassmann manifold $\mathbb{G}_{r}(\mathbb{R}^{k})$ , with $k=n$ or $m$ .

Therefore, we start in Section 2 by introducing a geometric description of $\mathbb{G}_{r}(\mathbb{R}^{k})$ . A classical approach consists of describing $\mathbb{G}_{r}(\mathbb{R}^{k})$ as the quotient manifold $\mathcal{M}_{r}(\mathbb{R}^{k\times r})/\mathrm{GL}_{r}$ of equivalent classes of full-rank matrices $Z$ in $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ having the same column space $\mathrm{col}_{k,r}(Z)$ . Here, we avoid the use of equivalent classes and provide an explicit description of an atlas $\mathcal{A}_{k,r}=\{(\mathfrak{U}_{Z},\varphi_{Z})\}_{Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})}$ for $\mathbb{G}_{r}(\mathbb{R}^{k})$ , with local chart

[TABLE]

where $Z_{\bot}\in\mathbb{R}^{k\times(k-r)}$ is such that $Z_{\bot}^{T}Z=0$ and $\mathrm{col}_{k,r}(A)$ denotes the column space of a matrix $A\in\mathbb{R}^{k\times r}$ , and we prove that the neighbourhood $\mathfrak{U}_{Z}$ have the structure of a Lie group. This parametrization of the Grassmann manifold is introduced in [2, Section 2] but the authors do not elaborate on it.

Then in Section 3, we consider the particular case of full-rank matrices. We introduce an atlas $\mathcal{B}_{k,r}=\{(\mathcal{V}_{Z},\xi_{Z})\}_{Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})}$ for the manifold $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ of matrices with full rank $r<k$ , with local chart

[TABLE]

and prove that $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ is an analytic principal bundle with base $\mathbb{G}_{r}(\mathbb{R}^{k})$ and typical fibre $\mathrm{GL}_{r}$ . Moreover, we prove that $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ is an embedded submanifold of $(\mathbb{R}^{k\times r},\tau^{*}_{\mathbb{R}^{k\times r}})$ , and that each of the neighbourhoods $\mathcal{V}_{Z}$ have the structure of a Lie group.

Finally, in Section 4, we provide an analytic atlas $\mathcal{B}_{n,m,r}=\{(\mathcal{U}_{Z},\theta_{Z})\}_{Z\in\mathcal{M}_{r}(\mathbb{R}^{n\times m})}$ for the set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ of matrices $Z=UGV^{T}$ with rank $r<\min\{n,m\}$ , with local chart

[TABLE]

and we prove that $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is an analytic principal bundle with base $\mathbb{G}_{r}(\mathbb{R}^{n})\times\mathbb{G}_{r}(\mathbb{R}^{m})$ and typical fibre $\mathrm{GL}_{r}$ . Then we prove that $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is an embedded submanifold of $(\mathbb{R}^{n\times m},\tau^{*}_{\mathbb{R}^{n\times m}})$ , and that each of the neighbourhoods $\mathcal{U}_{Z}$ have the structure of a Lie group.

2 The Grassmann manifold $\mathbb{G}_{r}(\mathbb{R}^{k})$

In this section, we present a geometric description of the Grassmann manifold $\mathbb{G}_{r}(\mathbb{R}^{k})$ of all subspaces of dimension $r$ in $\mathbb{R}^{k}$ , $0<r<k$ ,

[TABLE]

with an explicit description of local charts. We first introduce the surjective map

[TABLE]

where $\mathrm{col}_{k,r}(Z)$ is the column space of the matrix $Z$ , which is the subspace spanned by the column vectors of $Z.$ Given $\mathcal{V}\in\mathbb{G}_{r}(\mathbb{R}^{k}),$ there are infinitely many matrices $Z$ such that $\mathrm{col}_{k,r}(Z)=\mathcal{V}$ . Given a matrix $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ , the set of matrices in $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ having the same column space as $Z$ is

[TABLE]

2.1 An atlas for $\mathbb{G}_{r}(\mathbb{R}^{k})$

For a given matrix $Z$ in $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ , we let $Z_{\bot}\in\mathcal{M}_{k-r}(\mathbb{R}^{k\times(k-r)})$ be a matrix such that $Z^{T}Z_{\bot}=0$ and we introduce an affine cross section

[TABLE]

which has the following equivalent characterization.

Lemma 2.1.

The affine cross section $\mathcal{S}_{Z}$ is characterized by

[TABLE]

and the map

[TABLE]

is bijective.

Proof.

We first observe that $Z^{T}(Z+Z_{\bot}X_{Z})=Z^{T}Z$ for all $X\in\mathbb{R}^{(k-r)\times r}$ , which implies that $\{Z+Z_{\bot}X:X\in\mathbb{R}^{(k-r)\times r}\}\subset\mathcal{S}_{Z}.$ For the other inclusion, we observe that if $W\in\mathcal{S}_{Z}$ , then $Z^{T}W=Z^{T}Z$ and hence $W-Z\in\mathrm{col}_{k,r}(Z)^{\bot}$ , the orthogonal subspace to $\mathrm{col}_{k,r}(Z)$ in $\mathbb{R}^{k}$ . Since $\mathrm{col}_{k,r}(Z)^{\bot}=\mathrm{col}_{k,k-r}\,(Z_{\bot}),$ there exists $X\in\mathbb{R}^{(k-r)\times r}$ such that $W-Z=Z_{\bot}X.$ Proving that $\eta_{Z}$ is bijective is straightforward. ∎

Proposition 2.2.

For each $W\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ such that $\det(Z^{T}W)\neq 0$ , there exists a unique $G_{W}\in\mathrm{GL}_{r}$ such that

[TABLE]

holds, which means that the set of matrices with the same column space as $W$ intersects $\mathcal{S}_{Z}$ at the single point $WG_{W}^{-1}.$ Furthermore, $G_{W}=id_{r}$ if and only if $W\in\mathcal{S}_{Z}.$

Proof.

By Lemma 2.1, a matrix $A\in W\mathrm{GL}_{r}\cap\mathcal{S}_{Z}$ is such that $A=WG_{W}^{-1}=Z+Z_{\bot}X$ for a certain $G_{W}\in\mathrm{GL}_{r}$ and a certain $X\in\mathbb{R}^{(k-r)\times r}$ . Then $Z^{T}WG_{W}^{-1}=Z^{T}Z$ and $G_{W}$ is uniquely defined by $G_{W}=(Z^{T}Z)^{-1}(Z^{T}W),$ which proves that $W\mathrm{GL}_{r}\cap\mathcal{S}_{Z}$ is the singleton $\{WG_{W}^{-1}\},$ and $G_{W}=id_{r}$ if and only if $W\in\mathcal{S}_{Z}.$ ∎

Corollary 2.3.

For each $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ , the map $\mathrm{col}_{k,r}:\mathcal{S}_{Z}\longrightarrow\mathbb{G}_{r}(\mathbb{R}^{k})$ is injective.

Proof.

Let us assume the existence of $W,\tilde{W}\in\mathcal{S}_{Z}$ such that $\mathrm{col}_{k,r}(W)=\mathrm{col}_{k,r}(\tilde{W}).$ Then $W=\tilde{W}$ by Proposition 2.2. ∎

Lemma 2.1 and Corollary 2.3 allow us to construct a system of local charts for $\mathbb{G}_{r}(\mathbb{R}^{k})$ by defining for each $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ a neighbourhood of $\mathrm{col}_{k,r}(Z)$ by

[TABLE]

together with the bijective map

[TABLE]

such that

[TABLE]

for $X\in\mathbb{R}^{(k-r)\times r}$ . We denote by $Z^{+}$ the Moore-Penrose pseudo-inverse of the full rank matrix $Z\in\mathcal{M}_{r}(\mathbb{R}^{r\times k})$ , defined by

[TABLE]

It satisfies $Z^{+}Z=id_{r}$ and $Z^{+}Z_{\bot}=0$ . Moreover, $ZZ^{+}\in\mathbb{R}^{k\times k}$ is the projection onto $\mathrm{col}_{k,r}(Z)$ parallel to $\mathrm{col}_{k,r}(Z)^{\bot}$ . Finally, we have the following result.

Theorem 2.4.

The collection $\mathcal{A}_{k,r}:=\{(\mathfrak{U}_{Z},\varphi_{Z}):Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})\}$ is an analytic atlas for $\mathbb{G}_{r}(\mathbb{R}^{k})$ and hence $(\mathbb{G}_{r}(\mathbb{R}^{k}),\mathcal{A}_{k,r})$ is an analytic $r(k-r)$ -dimensional manifold modelled on $\mathbb{R}^{(k-r)\times r}$ .

Proof.

Clearly $\{\mathfrak{U}_{Z}\}_{Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})}$ is a covering of $\mathbb{G}_{r}(\mathbb{R}^{k}).$ Now let $Z$ and $\tilde{Z}$ be such that $\mathfrak{U}_{Z}\cap\mathfrak{U}_{\tilde{Z}}\neq\emptyset$ . Let $\mathcal{V}\in\mathfrak{U}_{Z}$ such that $\mathcal{V}=\varphi_{Z}^{-1}(X)=\mathrm{col}_{k,r}(Z+Z_{\bot}X)$ , with $X\in\mathbb{R}^{k\times(k-r)}$ . We can write $Z+Z_{\bot}X=(\tilde{Z}+\tilde{Z}_{\bot}\tilde{X})G$ with $G=\tilde{Z}^{+}(Z+Z_{\bot}X)$ and $\tilde{X}=\tilde{Z}_{\bot}^{+}(Z+Z_{\bot}X)G^{-1}$ . Therefore, $\mathcal{V}=\mathrm{col}_{k,r}((\tilde{Z}+\tilde{Z}_{\bot}\tilde{X})G)=\mathrm{col}_{k,r}(\tilde{Z}+\tilde{Z}_{\bot}\tilde{X})=\varphi_{\tilde{Z}}^{-1}({\tilde{X}})\in\mathfrak{U}_{\tilde{Z}}$ , which implies that $\mathfrak{U}_{Z}=\mathfrak{U}_{Z}\cap\mathfrak{U}_{\tilde{Z}}$ . Therefore, $\varphi_{Z}(\mathfrak{U}_{Z}\cap\mathfrak{U}_{\tilde{Z}})=\varphi_{Z}(\mathfrak{U}_{Z})=\mathbb{R}^{k\times(n-k)}$ is an open set. In the same way, we show that $\mathfrak{U}_{\tilde{Z}}=\mathfrak{U}_{Z}\cap\mathfrak{U}_{\tilde{Z}}$ and $\varphi_{\tilde{Z}}(\mathfrak{U}_{Z})=\mathbb{R}^{k\times(n-k)}$ is an open set. Finally, the map $\varphi_{\tilde{Z}}\circ\varphi_{Z}^{-1}$ from $\mathbb{R}^{(k-r)\times r}$ to $\mathbb{R}^{(k-r)\times r}$ is given by $\varphi_{\tilde{Z}}\circ\varphi_{Z}^{-1}(X)=\tilde{Z}_{\bot}^{+}(Z+Z_{\bot}X)G^{-1}$ , with $G=\tilde{Z}^{+}(Z+Z_{\bot}X_{Z})$ , which is clearly an analytic map. ∎

2.2 Lie group structure of neighbourhoods $\mathfrak{U}_{Z}$

Here we prove that each neighbourhood $\mathfrak{U}_{Z}$ of $\mathbb{G}_{r}(\mathbb{R}^{k})$ is a Lie group. For that, we first note that a neighbourhood $\mathfrak{U}_{Z}$ of $\mathbb{G}_{r}(\mathbb{R}^{k})$ can be identified with the set $\mathcal{S}_{Z}$ through the application $\mathrm{col}_{k,r}:\mathcal{S}_{Z}\to\mathfrak{U}_{Z}$ . The next step is to identify $\mathcal{S}_{Z}$ with a closed Lie subgroup of $\mathrm{GL}_{k},$ denoted by $\mathcal{G}_{Z},$ with associated Lie algebra $\mathfrak{g}_{Z}$ isomorphic to $\mathbb{R}^{r\times(k-r)}$ , and such that the exponential map333We recall that the matrix exponential $\exp:\mathbb{R}^{k\times k}\rightarrow\mathrm{GL}_{k}$ is defined by $\exp(A)=\sum_{n=0}^{\infty}\frac{A^{n}}{n!}.$ $\exp:\mathfrak{g}_{Z}\longrightarrow\mathcal{G}_{Z}$ is a diffeomorphism. To this end, for a given $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ , we introduce the vector space

[TABLE]

The following proposition proves that $\mathfrak{g}_{Z}$ is a commutative subalgebra of $\mathbb{R}^{k\times k}.$

Proposition 2.5.

For all $X,\tilde{X}\in\mathbb{R}^{(k-r)\times r}$ ,

[TABLE]

holds, and $\mathfrak{g}_{Z}$ is a commutative subalgebra of $\mathbb{R}^{k\times k}.$ Moreover,

[TABLE]

and

[TABLE]

hold for all $X\in\mathbb{R}^{(k-r)\times r}.$

Proof.

Since $(Z_{\bot}XZ^{+})(Z_{\bot}\tilde{X}Z^{+})=0$ holds for all $X,\tilde{X}\in\mathbb{R}^{(k-r)\times r},$ the vector space $\mathfrak{g}_{Z}$ is a closed subalgebra of the matrix unitary algebra $\mathbb{R}^{k\times k}.$ As a consequence, $(Z_{\bot}XZ^{+})^{p}=0$ holds for all $X\in\mathbb{R}^{(k-r)\times r}$ and all $p\geq 2$ , which proves (6). We directly deduce (7) using $ZZ^{+}=id_{r}$ , and (8) using $Z^{+}Z_{\bot}=0$ . ∎

From Proposition 2.5 and the definition of $\mathcal{S}_{Z}$ , we obtain the following results.

Corollary 2.6.

The affine cross section $\mathcal{S}_{Z}$ satisfies

[TABLE]

and

[TABLE]

for all $X\in\mathbb{R}^{(k-r)\times r}$ , where the brackets $\left[\cdot|\cdot\right]$ are used for matrix concatenation.

Proof.

From Proposition 2.5 and (4), we obtain (9) and we can write

[TABLE]

Since $\exp(Z_{\bot}XZ^{+}),[Z|Z_{\bot}]\in\mathrm{GL}_{k}$ , (10) follows. ∎

Now we need to introduce the following definition and proposition (see [15, p.80]).

Definition 2.7.

Let $(\mathbb{K},+,\cdot)$ be a ring and let $(\mathbb{K},+)$ be its additive group. A subset $\mathbb{I}\subset\mathbb{K}$ is called a two-sided ideal (or simply an ideal) of $\mathbb{K}$ if it is an additive subgroup of $\mathbb{K}$ such that $\mathbb{I}\cdot\mathbb{K}:=\{r\cdot x:r\in\mathbb{I}\text{ and }x\in\mathbb{K}\}\subset\mathbb{I}$ and $\mathbb{K}\cdot\mathbb{I}:=\{x\cdot r:r\in\mathbb{I}\text{ and }x\in\mathbb{K}\}\subset\mathbb{I}.$

Proposition 2.8.

If $\mathfrak{g}\subset\mathfrak{h}$ is a two-sided ideal of the Lie algebra $\mathfrak{h}$ of a group $\mathcal{H}$ , then the subgroup $\mathcal{G}\subset\mathcal{H}$ generated by $\exp(\mathfrak{g})=\{\exp(G):G\in\mathfrak{g}\}$ is normal and closed, with Lie algebra $\mathfrak{h}.$

From the above proposition, we deduce the following result.

Lemma 2.9.

Let $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ and $Z_{\bot}\in\mathcal{M}_{k-r}(\mathbb{R}^{k\times(k-r)})$ be such that $Z^{T}Z_{\bot}=0.$ Then $\mathfrak{g}_{Z}\subset\mathbb{R}^{k\times k}$ is a two-sided ideal of the Lie algebra $\mathbb{R}^{k\times k}$ and hence

[TABLE]

is a closed Lie group with Lie algebra $\mathfrak{g}_{Z}.$ Furthermore, the map $\exp:\mathfrak{g}_{Z}\longrightarrow\mathcal{G}_{Z}$ is bijective.

Proof.

Consider $Z_{\bot}XZ^{+}\in\mathfrak{g}_{Z}$ and $A\in\mathbb{R}^{k\times k}$ . Noting that $Z^{+}Z=id_{r}$ and $(Z_{\bot})^{+}Z_{\bot}=id_{k-r}$ , we have that

[TABLE]

which proves that $\mathfrak{g}_{Z}\cdot\mathbb{R}^{k\times k}\subset\mathfrak{g}_{Z}.$ Similarly, we have that

[TABLE]

which proves that $\mathbb{R}^{k\times k}\cdot\mathfrak{g}_{Z}\subset\mathfrak{g}_{Z}.$ This proves that $\mathfrak{g}_{Z}$ is a two-sided ideal. The map $\exp$ is clearly surjective. To prove that it is injective, we assume $\exp(Z_{\bot}XZ^{+})=\exp(Z_{\bot}\tilde{X}Z^{+})$ for $X,\tilde{X}\in\mathbb{R}^{(k-r)\times r}$ . Then from (6), we obtain $Z+Z_{\bot}X=Z+Z_{\bot}\tilde{X}$ and hence $X=\tilde{X}$ , i.e. $Z_{\bot}XZ^{+}=Z_{\bot}\tilde{X}Z^{+}$ in $\mathfrak{g}_{Z}.$ ∎

Finally, we can prove the following result.

Theorem 2.10.

The set $\mathcal{S}_{Z}$ together with the group operation $\times_{Z}$ defined by

[TABLE]

for $X,\tilde{X}\in\mathbb{R}^{(k-r)\times r}$ is a Lie group.

Proof.

To prove that it is a Lie group, we simply note that the multiplication and inversion maps

[TABLE]

and

[TABLE]

are analytic. ∎

It follows that $\mathfrak{U}_{Z}$ can be identified with a Lie group through the map $\varphi_{Z}$ .

Theorem 2.11.

Each neighbourhood $\mathfrak{U}_{Z}$ of $\mathbb{G}_{r}(\mathbb{R}^{k})$ together with the group operation $\circ_{Z}$ defined by

[TABLE]

for $\mathcal{V},\mathcal{V^{\prime}}\in\mathfrak{U}_{Z}$ , is a Lie group and the map $\gamma_{Z}:\mathfrak{U}_{Z}\longrightarrow\mathcal{G}_{Z}$ given by

[TABLE]

is a Lie group isomorphism.

3 The non-compact Stiefel principal bundle $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$

In this section, we give a new geometric description of the set $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ of matrices with full rank $r<k$ , which is based on the geometric description of the Grassmann manifold given in Section 2.

3.1 Principal bundle structure of $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$

For $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ , we define a neighbourhood of $Z$ as

[TABLE]

From Proposition 2.2, we know that for a given matrix $W\in\mathcal{V}_{Z}$ , there exists a unique pair of matrices $(X,G)\in\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ such that $W=(Z+Z_{\bot}X)G.$ Therefore,

[TABLE]

It allows us to introduce a parametrisation $\xi_{Z}^{-1}$ (see Figure 1) defined through the bijection

[TABLE]

such that

[TABLE]

for $(X,G)\in\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ , and

[TABLE]

for $W\in\mathcal{V}_{Z}$ . In particular,

[TABLE]

Theorem 3.1.

The collection $\mathcal{B}_{k,r}:=\{(\mathcal{V}_{Z},\xi_{Z}):Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})\}$ is an analytic atlas for $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ , and hence $(\mathcal{M}_{r}(\mathbb{R}^{k\times r}),\mathcal{B}_{k,r})$ it is an analytic $kr$ -dimensional manifold modelled on $\mathbb{R}^{(k-r)\times r}\times\mathbb{R}^{r\times r}.$

Proof.

$\{\mathcal{V}_{Z}\}_{Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})}$ is clearly a covering of $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ . Moreover, since $\xi_{Z}$ is bijective from $\mathcal{V}_{Z}$ to $\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ we claim that if $\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}}\neq\emptyset$ for $Z,\tilde{Z}\in\mathcal{M}_{r}(\mathbb{R}^{k\times r}),$ then the following statements hold:

i)

$\xi_{Z}(\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}})$ and $\xi_{\tilde{Z}}(\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}})$ are open sets in $\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ and 2. ii)

the map $\xi_{\tilde{Z}}\circ\xi_{Z}^{-1}$ is analytic from $\xi_{Z}(\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}})\subset\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ to $\xi_{\tilde{Z}}(\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}})\subset\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ .

In this proof, we equip $\mathbb{R}^{k\times r}$ with the topology $\tau_{\mathbb{R}^{k\times r}}$ induced by matrix norms. For any $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ , $\mathcal{V}_{Z}=\{W\in\mathbb{R}^{k\times r}:\det(Z^{T}W)\neq 0\}$ is the inverse image of the open set $\mathbb{R}\setminus\{0\}$ by the continuous map $W\mapsto\det(Z^{T}W)$ from $\mathbb{R}^{k\times r}$ to $\mathbb{R}$ , and therefore, $\mathcal{V}_{Z}$ is an open set of $\mathbb{R}^{k\times r}$ . Since $\mathcal{V}_{Z}$ and $\mathcal{V}_{\tilde{Z}}$ are open sets in $\mathbb{R}^{k\times r}$ , $\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}}$ is also an open set in $\mathbb{R}^{k\times r}$ and since $\xi_{Z}^{-1}$ is a continuous map from $\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ to $\mathbb{R}^{k\times r}$ , the set $\xi_{Z}(\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}})$ , as the inverse image of an open set by a continuous map, is an open set in $\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ . Similarly, $\xi_{\tilde{Z}}(\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}})$ is an open set. Now let $(X,G)\in\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ such that $\xi_{Z}^{-1}(X,G)\in\mathcal{V}_{Z}\cap\mathcal{V}_{\tilde{Z}}$ . From the expressions of $\xi_{Z}^{-1}$ and $\xi_{\tilde{Z}}$ , the map $\xi_{\tilde{Z}}\circ\xi_{Z}^{-1}$ is defined by

[TABLE]

with $\xi_{Z}^{-1}(X,G)=(Z+Z_{\bot}X)G$ , which is clearly an analytic map. ∎

Before stating the next result, we recall the definition of a morphism between manifolds and of a fibre bundle. We introduce notions of $\mathcal{C}^{p}$ maps and $\mathcal{C}^{p}$ manifolds, with $p\in\mathbb{N}\cup\{\infty\}$ or $p=\omega$ . In the latter case, $\mathcal{C^{\omega}}$ means analytic.

Definition 3.2.

Let $(\mathbb{M},\mathcal{A})$ and $(\mathbb{N},\mathcal{B})$ be two $\mathcal{C}^{p}$ manifolds. Let $F:\mathbb{M}\rightarrow\mathbb{N}$ be a map. We say that $F$ is a $\mathcal{C}^{p}$ morphism between $(\mathbb{M},\mathcal{A})$ and $(\mathbb{N},\mathcal{B})$ if given $m\in\mathbb{M}$ , there exists a chart $(U,\varphi)\in\mathcal{A}$ such that $m\in U$ and a chart $(W,\psi)\in\mathcal{B}$ such that $F(m)\in W$ where $F(U)\subset W,$ and the map

[TABLE]

is a map of class $\mathcal{C}^{p}$ . If it is a $\mathcal{C}^{p}$ diffeomorphism, then we say that $F$ is a $\mathcal{C}^{p}$ diffeomorphism between manifolds. We say that $\psi\circ F\circ\varphi^{-1}$ is a representation of $F$ using a system of local coordinates given by the charts $(U,\varphi)$ and $(W,\psi).$

Definition 3.3.

Let $\mathbb{B}$ be a $\mathcal{C}^{p}$ manifold with atlas $\mathcal{A}=\{(U_{b},\varphi_{b}):b\in\mathbb{B}\}$ , and let $\mathbb{F}$ be a manifold. A $\mathcal{C}^{p}$ fibre bundle $\mathbb{E}$ with base $\mathbb{B}$ and typical fibre $\mathbb{F}$ is a $\mathcal{C}^{p}$ manifold which is locally a product manifold, that is, there exists a surjective morphism $\pi:\mathbb{E}\longrightarrow\mathbb{B}$ such that for each $b\in\mathbb{B}$ there is a $\mathcal{C}^{p}$ diffeomorphism between manifolds

[TABLE]

such that $p_{b}\circ\chi_{b}=\pi$ where $p_{b}:U_{b}\times\mathbb{F}\longrightarrow U_{b}$ is the projection. For each $b\in\mathbb{B},$ $\pi^{-1}(b)=\mathbb{E}_{b}$ is called the fibre over $b.$ The $\mathcal{C}^{p}$ diffeomorphisms $\chi_{b}$ are called fibre bundle charts. If $p=0,$ $\mathbb{E},\mathbb{B}$ and $\mathbb{F}$ are only required to be topological spaces and $\{U_{b}:b\in\mathbb{B}\}$ an open covering of $\mathbb{B}.$ In the case where $\mathbb{F}$ is a Lie group, we say that $\mathbb{E}$ is a $\mathcal{C}^{p}$ principal bundle, and if $\mathbb{F}$ is a vector space, we say that it is a $\mathcal{C}^{p}$ vector bundle.

Theorem 3.4.

*The set $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ is an analytic principal bundle with typical fibre $\mathrm{GL}_{r}$ and base $\mathbb{G}_{r}(\mathbb{R}^{k})$ , with a surjective morphism between $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ and $\mathbb{G}_{r}(\mathbb{R}^{k})$ given by the map $\mathrm{col}_{k,r}$ . *

Proof.

To show that it is an analytic principal bundle, we first observe that

[TABLE]

is a surjective morphism. Indeed, let $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ and $(\mathcal{V}_{Z},\xi_{Z})\in\mathcal{B}_{k,r}$ and $(\mathfrak{U}_{Z},\varphi_{Z})\in\mathcal{A}_{k,r}$ . Noting that $\mathrm{col}_{k,r}(YG)=\mathrm{col}_{k,r}(Y)$ for all $Y\in\mathcal{S}_{Z}$ , we obtain that $\mathrm{col}_{k,r}(\mathcal{V}_{Z})=\mathfrak{U}_{Z}$ . Moreover, a representation of $\mathrm{col}_{k,r}$ by using a system of local coordinates given by the charts is

[TABLE]

which is clearly an analytic map from $\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ to $\mathbb{R}^{(k-r)\times r}$ such that $\mathrm{col}_{k,r}^{-1}(\mathfrak{U}_{Z})=\mathcal{V}_{Z}.$ Now, a representation of the morphism

[TABLE]

using the system of local coordinates given by the charts is

[TABLE]

defined by

[TABLE]

which is clearly an analytic diffeomorphism. To conclude, consider the projection

[TABLE]

and observe that $(p_{Z}\circ\chi_{Z})(W)=\mathrm{col}_{k,r}(W)$ holds for all $W\in\mathcal{V}_{Z}.$ ∎

3.2 $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ as a submanifold and its tangent space

Here, we prove that the non-compact Stiefel manifold $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ equipped with the topology given by the atlas $\mathcal{B}_{k,r}$ is an embedded submanifold in $\mathbb{R}^{k\times r}$ . For that, we have to prove that the standard inclusion map

[TABLE]

as a morphism is an embedding. To see this we need to recall some definitions and results.

Definition 3.5.

Let $F:(\mathbb{M},\mathcal{A})\rightarrow(\mathbb{N},\mathcal{B})$ be a morphism between $\mathcal{C}^{p}$ manifolds and let $m\in\mathbb{M}.$ We say that $F$ is an immersion at $m$ if there exists an open neighbourhood $U_{m}$ of $m$ in $\mathbb{M}$ such that the restriction of $F$ to $U_{m}$ induces an isomorphism from $U_{m}$ onto a submanifold of $\mathbb{N}.$ We say that $F$ is an immersion if it is an immersion at each point of $\mathbb{M}.$

The next step is to recall the definition of the differential as a morphism which gives a linear map between the tangent spaces of the manifolds (in local coordinates) involved with the morphism. Let us recall that for any $m\in\mathbb{M}$ , we denote by $\mathbb{T}_{m}\mathbb{M}$ the tangent space of $\mathbb{M}$ at $m$ (in local coordinates).

Definition 3.6.

Let $(\mathbb{M},\mathcal{A})$ and $(\mathbb{N},\mathcal{B})$ be two $\mathcal{C}^{p}$ manifolds. Let $F:(\mathbb{M},\mathcal{A})\rightarrow(\mathbb{N},\mathcal{B})$ be a morphism of class $\mathcal{C}^{p}$ , i.e., for any $m\in\mathbb{M}$ ,

[TABLE]

is a map of class $\mathcal{C}^{p}$ , where $(U,\varphi)\in\mathcal{A}$ is a chart in $\mathbb{M}$ containing $m$ and $(W,\psi)\in\mathcal{B}$ is a chart in $\mathbb{N}$ containing $F(m)$ . Then we define

[TABLE]

For finite dimensional manifolds we have the following criterion for immersions (see Theorem 3.5.7 in [1]).

Proposition 3.7.

Let $(\mathbb{M},\mathcal{A})$ and $(\mathbb{N},\mathcal{B})$ be $\mathcal{C}^{p}$ manifolds. Let

[TABLE]

be a $\mathcal{C}^{p}$ morphism and $m\in\mathbb{M}.$ Then $F$ is an immersion at $m$ if and only if $\mathrm{T}_{m}F$ is injective.

A concept related to an immersion between manifolds is given in the following definition.

Definition 3.8.

Let $(\mathbb{M},\mathcal{A})$ and $(\mathbb{N},\mathcal{B})$ be $\mathcal{C}^{p}$ manifolds and let $f:(\mathbb{M},\mathcal{A})\longrightarrow(\mathbb{N},\mathcal{B})$ be a $\mathcal{C}^{p}$ morphism. If $f$ is an injective immersion, then $f(\mathbb{M})$ is called an immersed submanifold of $\mathbb{N}$ .

Finally, we give the definition of embedding.

Definition 3.9.

Let $(\mathbb{M},\mathcal{A})$ and $(\mathbb{N},\mathcal{B})$ be $\mathcal{C}^{p}$ manifolds and let $f:(\mathbb{M},\mathcal{A})\longrightarrow(\mathbb{N},\mathcal{B})$ be a $\mathcal{C}^{p}$ morphism. If $f$ is an injective immersion, and $f:(\mathbb{M},\tau_{\mathcal{A}})\longrightarrow(f(\mathbb{M}),\tau_{\mathcal{B}}|_{f(\mathbb{M})})$ is a topological homeomorphism, then we say that $f$ is an embedding and $f(\mathbb{M})$ is called an embedded submanifold of $\mathbb{N}$ .

We first note that the representation of the inclusion map $i$ using the system of local coordinates given by the charts $(\mathcal{V}_{Z},\xi_{Z})\in\mathcal{B}_{k,r}$ in $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ and $(\mathbb{R}^{k\times r},id_{\mathbb{R}^{k\times r}})$ in $\mathbb{R}^{k\times r}$ is

[TABLE]

Then the tangent map $T_{Z}i$ at $Z=\xi_{Z}^{-1}(0,id_{r})$ , defined by $T_{Z}i=D(i\circ\xi_{Z}^{-1})(0,id_{r})$ , is

[TABLE]

Proposition 3.10.

The tangent map $\mathrm{T}_{Z}i:\mathbb{R}^{(k-r)\times r}\times\mathbb{R}^{r\times r}\rightarrow\mathbb{R}^{k\times r}$ at $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ is a linear isomorphism, with inverse $(\mathrm{T}_{Z}i)^{-1}$ given by

[TABLE]

for $\dot{Z}\in\mathbb{R}^{k\times r}$ . Furthermore, the standard inclusion map $i$ is an embedding from $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ to $\mathbb{R}^{k\times r}.$

Proof.

Let us assume that $\mathrm{T}_{Z}i(\dot{X},\dot{G})=Z_{\bot}\dot{X}+Z\dot{G}=0.$ Multiplying this equality by $Z^{+}$ and $Z_{\bot}^{+}$ on the left, we obtain $\dot{G}=0$ and $\dot{X}=0$ respectively, which implies that $\mathrm{T}_{Z}i$ is injective. To prove that it is also surjective, we consider a matrix $\dot{Z}\in\mathbb{R}^{k\times r}$ and observe that $\dot{X}=Z_{\bot}^{+}\dot{Z}\in\mathbb{R}^{(k-r)\times r}$ and $\dot{G}=Z^{+}\dot{Z}\in\mathbb{R}^{r\times r}$ is such that $\mathrm{T}_{Z}i(\dot{X},\dot{G})=\dot{Z}$ . Since $\mathrm{T}_{Z}i$ is injective, the inclusion map $i$ is an immersion.

To prove that it is an embedding we equip $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ with the topology $\tau_{\mathcal{B}_{k,r}}$ given by the atlas and we equip $\mathbb{R}^{k\times r}$ with the topology $\tau_{\mathbb{R}^{k\times r}}$ induced by matrix norms. We need to check that

[TABLE]

is a topological homeomorphism. Since the topology in $(\mathcal{M}_{r}(\mathbb{R}^{k\times r}),\tau_{\mathcal{B}_{k,r}})$ has the property that each local chart $\xi_{Z}$ is indeed a homeomorphism from $\mathcal{V}_{Z}$ in $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ to $\xi_{Z}(\mathcal{V}_{Z})=\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}$ (see Section 1.1), we only need to show that the bijection $(i\circ\xi_{Z}^{-1}):\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}\rightarrow\mathcal{V}_{Z}\subset\mathbb{R}^{k\times r}$ given by

[TABLE]

is a topological homeomorphism for all $Z\in\mathcal{M}_{r}(\mathbb{R}^{k\times r}).$ Observe that $D(i\circ\xi_{Z}^{-1})(X,G)\in\mathcal{L}(\mathbb{R}^{(k-r)\times r}\times\mathbb{R}^{r\times r},\mathbb{R}^{k\times r})$ is given by

[TABLE]

Assume that $Z_{\bot}\dot{X}G+(Z+Z_{\bot}X)\dot{G}=0.$ Multiplying this equality by $Z^{+}$ on the left we obtain $\dot{G}=0,$ and hence $Z_{\bot}\dot{X}G=0.$ Multiplying by $Z_{\bot}^{+}$ on the left we obtain $\dot{X}G=0.$ Thus $\dot{X}=0$ and as a consequence $D(i\circ\xi_{Z}^{-1})(X,G)$ is a linear isomorphism for each $(X,G)\in\mathbb{R}^{(k-r)\times r}\times\mathrm{GL}_{r}.$ The inverse function theorem says us that $(i\circ\xi_{Z}^{-1})$ is a diffeomorphism, in particular a homeomorphism, and hence $i$ is an embedding. ∎

The tangent space to $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ at $Z$ is the image through $T_{Z}i$ of the tangent space at $Z$ in local coordinates $\mathbb{T}_{Z}\mathcal{M}_{r}(\mathbb{R}^{k\times r})=\mathbb{R}^{(k-r)\times r}\times\mathbb{R}^{r\times r}$ , i.e.

[TABLE]

and can be decomposed into a vertical tangent space

[TABLE]

and an horizontal tangent space

[TABLE]

3.3 Lie group structure of neighbourhoods $\mathcal{V}_{Z}$

We here prove that each neighbourhood $\mathcal{V}_{Z}$ of $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ has the structure of a Lie group. For that, we first note that $\mathcal{V}_{Z}$ can be identified with $\mathcal{S}_{Z}\times\mathrm{GL}_{r}$ , with $\mathcal{S}_{Z}$ given by (9). Noting that $\mathcal{S}_{Z}$ can be identified with the Lie group $\mathcal{G}_{Z}$ defined in (11), we then have that $\mathcal{V}_{Z}$ can be identified with a product of two Lie groups $\mathcal{G}_{Z}\times\mathrm{GL}_{r}$ , which is a Lie group with the group operation $\odot_{Z}$ given by

[TABLE]

for $X,X^{\prime}\in\mathbb{R}^{(k-r)\times r}$ and $G,G^{\prime}\in\mathrm{GL}_{r}$ . It allows us to define a group operation $\star_{Z}$ over $\mathcal{V}_{Z}$ defined for $W=\xi_{Z}^{-1}(X,G)$ and $W^{\prime}=\xi_{Z}^{-1}(X^{\prime},G^{\prime})$ by

[TABLE]

and to state the following result.

Theorem 3.11.

The set $\mathcal{V}_{Z}$ together with the group operation $\star_{Z}$ defined by (15) is a Lie group and the map $\eta_{Z}:\mathcal{V}_{Z}\longrightarrow\mathcal{G}_{Z}\times\mathrm{GL}_{r}$ given by

[TABLE]

is a Lie group isomorphism.

4 The principal bundle $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ for $0<r<\min(m,n)$

In this section, we give a geometric description of the set of matrices $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ with rank $r<\min(m,n)$ .

4.1 $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ as a principal bundle

For $Z\in\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ , there exists $U\in\mathcal{M}_{r}(\mathbb{R}^{n\times r}),$ $V\in\mathcal{M}_{r}(\mathbb{R}^{m\times r}),$ and $G\in\mathrm{GL}_{r}$ such that

[TABLE]

where the column space of $Z$ is $\mathrm{col}_{n,r}(U)$ and the row space of $Z$ is $\mathrm{col}_{m,r}(V).$

Let us first introduce the surjective map

[TABLE]

The set

[TABLE]

can be identified with $\mathrm{GL}_{r}$ . Let us consider $U_{\bot}\in\mathcal{M}_{n-r}(\mathbb{R}^{n\times(n-r)})$ such that $U^{T}\,U_{\bot}=0$ and $V_{\bot}\in\mathcal{M}_{m-r}(\mathbb{R}^{m\times(m-r)})$ such that $V^{T}\,V_{\bot}=0.$ Then we define a neighbourhood of $UGV^{T}$ in the set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ by

[TABLE]

where $\mathfrak{U}_{U}$ and $\mathfrak{U}_{V}$ are the neighbourhoods of $\mathrm{col}_{n,r}(U)$ and $\mathrm{col}_{m,r}(V)$ respectively (see Section 2.2). Noting that $\mathfrak{U}_{U}=\varphi^{-1}_{U}(\mathbb{R}^{(n-r)\times r})=\mathrm{col}_{n,r}(\mathcal{S}_{U})$ and $\mathfrak{U}_{V}=\varphi_{V}^{-1}(\mathbb{R}^{(m-r)\times r})=\mathrm{col}_{m,r}(\mathcal{S}_{V})$ , where $\mathcal{S}_{U}$ and $\mathcal{S}_{V}$ are the affine cross sections of $U$ and $V$ respectively (defined by (4)), the neighbourhood of $UGV^{T}$ can be written

[TABLE]

We can associate to $\mathcal{U}_{Z}$ the parametrisation $\theta_{Z}^{-1}$ given by the chart (see Figure 2)

[TABLE]

defined by

[TABLE]

for $(X,Y,H)\in\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ , and

[TABLE]

for $A\in\mathcal{U}_{Z}$ . In particular, we have $\theta_{Z}^{-1}(0,0,G)=Z.$ We point out that $\mathcal{U}_{Z}=\mathcal{U}_{Z^{\prime}}$ and $\theta_{Z}=\theta_{Z^{\prime}}$ for every $Z^{\prime}=UG^{\prime}V^{T}$ with $G^{\prime}\neq G.$

Theorem 4.1.

The collection $\mathcal{B}_{n,m,r}:=\{(\mathcal{U}_{Z},\theta_{Z}):Z\in\mathcal{M}_{r}(\mathbb{R}^{n\times m})\}$ is an analytic atlas for $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ and hence $(\mathcal{M}_{r}(\mathbb{R}^{n\times m}),\mathcal{B}_{n,m,r})$ is an analytic $r(n+m-r)$ -dimensional manifold modelled on $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathbb{R}^{r\times r}.$

Proof.

$\{\mathcal{U}_{Z}\}_{Z\in\mathcal{M}_{r}(\mathbb{R}^{n\times m})}$ is clearly a covering of $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ . Moreover, since $\theta_{Z}$ is bijective from $\mathcal{U}_{Z}$ to $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ , we claim that if $\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}}\neq\emptyset$ for $Z=UGV^{T}$ and $\tilde{Z}=\tilde{U}\tilde{G}\tilde{V}^{T}\in\mathcal{M}_{r}(\mathbb{R}^{n\times m}),$ then the following statements hold:

i)

$\theta_{Z}(\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}})$ and $\theta_{\tilde{Z}}(\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}})$ are open sets in $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ and 2. ii)

the map $\theta_{\tilde{Z}}\circ\theta_{Z}^{-1}$ is analytic from $\theta_{Z}(\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}})\subset\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ to $\theta_{\tilde{Z}}(\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}})\subset\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ .

In this proof, we equip $\mathbb{R}^{n\times m}$ with the topology $\tau_{\mathbb{R}^{n\times m}}$ induced by matrix norms. We first observe that the set $\mathcal{U}_{Z}=\{A\in\mathcal{M}_{r}(\mathbb{R}^{n\times m}):\det(U^{T}AV)\neq 0\}=\mathcal{O}_{Z}\cap\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ , where $\mathcal{O}_{Z}=\{A\in\mathbb{R}^{n\times m}:\det(U^{T}AV)\neq 0\}$ , as the inverse image of the open set $\mathbb{R}\setminus\{0\}$ through the continuous map $A\mapsto\det(U^{T}AV)$ from $\mathbb{R}^{n\times m}$ to $\mathbb{R}$ , is an open set in $\mathbb{R}^{n\times m}$ . In the same way, we have that $\mathcal{U}_{\tilde{Z}}=\mathcal{O}_{\tilde{Z}}\cap\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ , with $\mathcal{U}_{\tilde{Z}}$ an open set in $\mathbb{R}^{n\times m}$ . Since $\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}}=\mathcal{O}_{Z}\cap\mathcal{O}_{\tilde{Z}}\cap\mathcal{M}_{r}(\mathbb{R}^{n\times m}),$ and since the image of $\theta_{Z}^{-1}$ is in $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ , we have

[TABLE]

the inverse image through $\theta_{Z}^{-1}$ of the open set $\mathcal{O}_{Z}\cap\mathcal{O}_{\tilde{Z}}$ in $\mathbb{R}^{n\times m}$ . Since $\theta_{Z}^{-1}$ is a continuous map from $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ to $\mathbb{R}^{n\times m}$ , we deduce that $\theta_{Z}(\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}})$ is an open set in $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ . Similarly, $\theta_{\tilde{Z}}(\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}})$ is an open set in $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ . Now, let $(X,Y,H)\in\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ such that $\theta_{Z}^{-1}(X,Y,H)\in\mathcal{U}_{Z}\cap\mathcal{U}_{\tilde{Z}}$ . From the expressions of $\theta_{Z}^{-1}$ and $\theta_{\tilde{Z}}$ , the map $\theta_{\tilde{Z}}\circ\theta_{Z}^{-1}$ is defined by

[TABLE]

with $\theta_{Z}^{-1}(X,Y,H)=(U+U_{\bot}X)H(V+V_{\bot}Y)^{T}$ , which is clearly an analytic map. ∎

Theorem 4.2.

The set $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is an analytic principal bundle with typical fibre $\mathrm{GL}_{r}$ and base $\mathbb{G}_{r}(\mathbb{R}^{n})\times\mathbb{G}_{r}(\mathbb{R}^{m})$ with surjective morphism $\varrho_{r}$ between $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ and $\mathbb{G}_{r}(\mathbb{R}^{n})\times\mathbb{G}_{r}(\mathbb{R}^{m})$ given by $\varrho_{r}.$

Proof.

To prove that it is an analytic principal bundle, we consider the surjective map

[TABLE]

the atlas $\mathcal{A}_{n,r}:=\{(\mathfrak{U}_{U},\varphi_{U}):U\in\mathcal{M}_{r}(\mathbb{R}^{n\times r})\}$ of $\mathbb{G}_{r}(\mathbb{R}^{n})$ and the atlas $\mathcal{A}_{m,r}:=\{(\mathfrak{U}_{V},\varphi_{V}):V\in\mathcal{M}_{r}(\mathbb{R}^{m\times r})\}$ of $\mathbb{G}_{r}(\mathbb{R}^{m}).$ Recall that

[TABLE]

with $k=n$ if $Z=U$ or $k=m$ if $Z=V$ , and hence

[TABLE]

Observe that for each fixed $G\in\mathrm{GL}_{r}$ , we have that $\varrho_{r}^{-1}(\mathfrak{U}_{U},\mathfrak{U}_{V})=\mathcal{U}_{Z},$ where $Z=UGV^{T}$ . Since $\mathcal{U}_{Z}=\mathcal{U}_{Z^{\prime}}$ holds for $Z^{\prime}=UG^{\prime}V^{T},$ where $G^{\prime}\in\mathrm{GL}_{r},$ the map

[TABLE]

defined by

[TABLE]

is independent of the choice of $Z=UGV^{T},$ where $G\in\mathrm{GL}_{r}.$ Now, the representation of $\chi_{Z}$ in local coordinates is the map

[TABLE]

given by $((\varphi_{U}\times\varphi_{V}\times id_{\mathbb{R}^{r\times r}})\circ\chi_{Z}\circ\theta_{Z}^{-1})(X,Y,H)=(X,Y,H)$ , which is an analytic diffeomorphism. Moreover, let $p_{Z}:\mathfrak{U}_{U}\times\mathfrak{U}_{V}\times\mathrm{GL}_{r}\longrightarrow\mathfrak{U}_{U}\times\mathfrak{U}_{V}$ be the projection over the two first components. Then

[TABLE]

and the theorem follows. ∎

4.2 $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ as a submanifold and its tangent space

Here, we prove that $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ equipped with the topology given by the atlas $\mathcal{B}_{n,m,r}$ is an embedded submanifold in $\mathbb{R}^{n\times m}.$ For that, we have to prove that the standard inclusion map ${i}:\mathcal{M}_{r}(\mathbb{R}^{n\times m})\rightarrow\mathbb{R}^{n\times m}$ is an embedding. Noting that the inclusion map restricted to the neighbourhood $\mathcal{U}_{Z}$ of $Z=UGV^{T}$ is identified with

[TABLE]

the tangent map $T_{Z}i$ at $Z=\theta_{Z}^{-1}(0,0,G)$ , defined by $T_{Z}i=D({i}\circ\theta_{Z}^{-1})(0,0,G)$ , is

[TABLE]

Proposition 4.3.

The tangent map $\mathrm{T}_{Z}i:\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathbb{R}^{r\times r}\rightarrow\mathbb{R}^{n\times m}$ at $Z=UGV^{T}\in\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ is a linear isomorphism with inverse $(T_{Z}i)^{-1}$ given by

[TABLE]

for $\dot{Z}\in\mathbb{R}^{n\times m}$ . Furthermore, the standard inclusion map $i$ is an embedding from $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ to $\mathbb{R}^{n\times m}.$

Proof.

Let us suppose that $\mathrm{T}_{Z}i(\dot{X},\dot{Y},\dot{H})=0.$ Multiplying this equality by $(U_{\bot})^{+}$ and $U^{+}$ on the left leads to

[TABLE]

respectively. By multiplying the first equation by $(V^{+})^{T}$ on the right, we obtain $\dot{X}=0$ . By multiplying the second equation on the right by $(V^{+})^{T}$ and $(V_{\bot}^{+})^{T}$ , we respectively obtain $\dot{H}=0$ and $\dot{Y}=0$ . Then, $\mathrm{T}_{Z}i$ is injective and then $i$ is an immersion. For $\dot{Z}\in\mathbb{R}^{n\times m}$ , we note that $\dot{X}=U_{\perp}^{+}\dot{Z}(V^{+})^{T}G^{-1}\in\mathbb{R}^{n\times r}$ , $\dot{Y}=V_{\perp}^{+}\dot{Z}^{T}(U^{+})^{T}G^{-T}\in\mathbb{R}^{m\times r}$ , and $\dot{G}=U^{+}\dot{Z}(V^{+})^{T}\in\mathbb{R}^{r\times r}$ is such that $T_{Z}i(\dot{X},\dot{Y},\dot{G})=\dot{Z}$ , then $\mathrm{T}_{Z}i$ is also surjective. Let us now equip $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ with the topology $\tau_{\mathcal{B}_{n,m,r}}$ given by the atlas and $\mathbb{R}^{n\times m}$ with the topology $\tau_{\mathbb{R}^{n\times m}}$ induced by matrix norms. We have to prove that

[TABLE]

is a topological isomorphism. The topology in $(\mathcal{M}_{r}(\mathbb{R}^{n\times m}),\tau_{\mathcal{B}_{n,m,r}})$ is such that a local chart $\theta_{Z}$ is a homeomorphism from $\mathcal{U}_{Z}\subset\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ to $\theta_{Z}(\mathcal{U}_{Z})=\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ (see Section 1.1). Then, to prove that the map $i$ is an embedding, we need to show that the bijection

[TABLE]

is a topological homeomorphism. For that, observe that its differential

[TABLE]

at $(X,Y,H)\in\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ is given by

[TABLE]

Assume that

[TABLE]

Multiplying on the left by $U^{+}$ and on the right by $(V^{+})^{T}$ , we obtain $\dot{H}=0.$ Multiplying on the left by $U_{\bot}^{+}$ and on the right by $(V^{+})^{T}$ we deduce that $\dot{X}H=0,$ that is, $\dot{X}=0.$ Finally, multiplying on the left by $U^{+}$ and on the right by $(V_{\bot}^{+})^{T}$ we obtain $H\dot{Y}^{T}=0,$ and hence $\dot{Y}=0.$ Thus, $D({i}\circ\theta_{Z}^{-1})(X,Y,H)$ is a linear isomorphism from $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathbb{R}^{r\times r}$ to $D({i}\circ\theta_{Z}^{-1})(X,Y,H)[\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathbb{R}^{r\times r}]$ for each $(X,Y,H)\in\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}.$ The inverse function theorem says us that $({i}\circ\theta_{Z}^{-1})$ is a diffeomorphism from $\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r}$ to $\mathcal{U}_{Z}=({i}\circ\theta_{Z}^{-1})(\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathrm{GL}_{r})$ and in particular, it is a topological homeomorphism. In consequence, the map $i$ is an embedding. ∎

The tangent space to $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ at $Z=UGV^{T}$ , which is the image through $T_{Z}i$ of the tangent space in local coordinates $\mathbb{T}_{Z}\mathcal{M}_{r}(\mathbb{R}^{n\times m})=\mathbb{R}^{(n-r)\times r}\times\mathbb{R}^{(m-r)\times r}\times\mathbb{R}^{r\times r}$ , is

[TABLE]

and can be decomposed into a vertical tangent space

[TABLE]

and an horizontal tangent space

[TABLE]

4.3 Lie group structure of neighbourhoods $\mathcal{U}_{Z}$

We here prove that $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ has locally the structure of a Lie group by proving that the neighbourhoods $\mathcal{U}_{Z}$ can be identified with Lie groups.

Let $Z=UGV^{T}\in\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ . We first note that $\mathcal{U}_{Z}$ can be identified with $\mathcal{S}_{U}\times\mathcal{S}_{V}\times\mathrm{GL}_{r}$ , with $\mathcal{S}_{U}$ and $\mathcal{S}_{V}$ defined by (9). Noting that $\mathcal{S}_{U}$ and $\mathcal{S}_{V}$ can be identified with Lie groups $\mathcal{G}_{U}$ and $\mathcal{G}_{V}$ defined in (11), we then have that $\mathcal{U}_{Z}$ can be identified with a product of three Lie groups, which is a Lie group with the group operation $\odot_{Z}$ given by

[TABLE]

It allows us to define a group operation $\star_{Z}$ over $\mathcal{U}_{Z}$ defined for $W=\theta_{Z}^{-1}(X,Y,G)$ and $W^{\prime}=\theta_{Z}^{-1}(X^{\prime},Y^{\prime},G^{\prime})$ by

[TABLE]

and to state the following result.

Theorem 4.4.

Let $Z=UGV^{T}\in\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ . Then the set $\mathcal{U}_{Z}$ together with the group operation $\star_{Z}$ defined by (17) is a Lie group with identity element $UV^{T}$ , and the map $\eta_{Z}:\mathcal{U}_{Z}\to\mathcal{G}_{U}\times\mathcal{G}_{V}\times\mathrm{GL}_{r}$ given by

[TABLE]

is a Lie group isomorphism.

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. Abraham, J. E. Marsden, and T. Ratiu. Manifolds, tensor analysis, and applications , vol. 75. Springer-Verlag, New York, second edn., 1988.
2[2] Absil, P.-A., Mahony R., and Sepulchre R. Riemannian Geometry of Grassmann Manifolds with a View on Algorithmic Computation. Acta Applicandae Mathematicae , 80(2): 199–220, 2004.
3[3] Absil, P.-A., Mahony R., and Sepulchre R. Optimization Algorithms on Matrix Manifolds . Princeton University Press, 2008.
4[4] Athanasios C Antoulas, Danny C Sorensen, and Serkan Gugercin. A survey of model reduction methods for large-scale systems. Contemporary mathematics , 280:193–220, 2001.
5[5] Peter Benner, Serkan Gugercin, and Karen Willcox. A survey of projection-based model reduction methods for parametric dynamical systems. SIAM review , 57(4):483–531, 2015.
6[6] P. Benner, A. Cohen, M. Ohlberger, and K. Willcox, editors. Model Reduction and Approximation: Theory and Algorithms . SIAM, Philadelphia, PA, 2017.
7[7] A. Falcó, A., W. Hackbusch, and A. Nouy. On the Dirac-Frenkel variational principle on tensor Banach spaces. ar Xiv preprint ar Xiv:1610.09865, 2016.
8[8] O. Koch and C. Lubich. Dynamical low-rank approximation. SIAM Journal on Matrix Analysis and Applications , 29(2):434–454, 2007.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Principal bundle structure of matrix manifolds

Abstract

1 Introduction

1.1 Elements of geometry

1.2 Main results and outline

2 The Grassmann manifold Gr(Rk)\mathbb{G}_{r}(\mathbb{R}^{k})Gr​(Rk)

2.1 An atlas for Gr(Rk)\mathbb{G}_{r}(\mathbb{R}^{k})Gr​(Rk)

Lemma 2.1**.**

Proof.

Proposition 2.2**.**

Proof.

Corollary 2.3**.**

Proof.

Theorem 2.4**.**

Proof.

2.2 Lie group structure of neighbourhoods UZ\mathfrak{U}_{Z}UZ​

Proposition 2.5**.**

Proof.

Corollary 2.6**.**

Proof.

Definition 2.7**.**

Proposition 2.8**.**

Lemma 2.9**.**

Proof.

Theorem 2.10**.**

Proof.

Theorem 2.11**.**

3 The non-compact Stiefel principal bundle Mr(Rk×r)\mathcal{M}_{r}(\mathbb{R}^{k\times r})Mr​(Rk×r)

3.1 Principal bundle structure of Mr(Rk×r)\mathcal{M}_{r}(\mathbb{R}^{k\times r})Mr​(Rk×r)

Theorem 3.1**.**

Proof.

Definition 3.2**.**

Definition 3.3**.**

Theorem 3.4**.**

Proof.

3.2 Mr(Rk×r)\mathcal{M}_{r}(\mathbb{R}^{k\times r})Mr​(Rk×r) as a submanifold and its tangent space

Definition 3.5**.**

Definition 3.6**.**

Proposition 3.7**.**

Definition 3.8**.**

Definition 3.9**.**

Proposition 3.10**.**

Proof.

3.3 Lie group structure of neighbourhoods VZ\mathcal{V}_{Z}VZ​

Theorem 3.11**.**

4 The principal bundle Mr(Rn×m)\mathcal{M}_{r}(\mathbb{R}^{n\times m})Mr​(Rn×m) for 0<r<min⁡(m,n)0<r<\min(m,n)0<r<min(m,n)

4.1 Mr(Rn×m)\mathcal{M}_{r}(\mathbb{R}^{n\times m})Mr​(Rn×m) as a principal bundle

Theorem 4.1**.**

Proof.

Theorem 4.2**.**

Proof.

4.2 Mr(Rn×m)\mathcal{M}_{r}(\mathbb{R}^{n\times m})Mr​(Rn×m) as a submanifold and its tangent space

Proposition 4.3**.**

Proof.

4.3 Lie group structure of neighbourhoods UZ\mathcal{U}_{Z}UZ​

Theorem 4.4**.**

2 The Grassmann manifold $\mathbb{G}_{r}(\mathbb{R}^{k})$

2.1 An atlas for $\mathbb{G}_{r}(\mathbb{R}^{k})$

Lemma 2.1.

Proposition 2.2.

Corollary 2.3.

Theorem 2.4.

2.2 Lie group structure of neighbourhoods $\mathfrak{U}_{Z}$

Proposition 2.5.

Corollary 2.6.

Definition 2.7.

Proposition 2.8.

Lemma 2.9.

Theorem 2.10.

Theorem 2.11.

3 The non-compact Stiefel principal bundle $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$

3.1 Principal bundle structure of $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$

Theorem 3.1.

Definition 3.2.

Definition 3.3.

Theorem 3.4.

3.2 $\mathcal{M}_{r}(\mathbb{R}^{k\times r})$ as a submanifold and its tangent space

Definition 3.5.

Definition 3.6.

Proposition 3.7.

Definition 3.8.

Definition 3.9.

Proposition 3.10.

3.3 Lie group structure of neighbourhoods $\mathcal{V}_{Z}$

Theorem 3.11.

4 The principal bundle $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ for $0<r<\min(m,n)$

4.1 $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ as a principal bundle

Theorem 4.1.

Theorem 4.2.

4.2 $\mathcal{M}_{r}(\mathbb{R}^{n\times m})$ as a submanifold and its tangent space

Proposition 4.3.

4.3 Lie group structure of neighbourhoods $\mathcal{U}_{Z}$

Theorem 4.4.