A Primer on Persistent Homology of Finite Metric Spaces

Facundo Memoli; Kritika Singhal

arXiv:1905.13400·math.AT·June 3, 2019

A Primer on Persistent Homology of Finite Metric Spaces

Facundo Memoli, Kritika Singhal

PDF

1 Repo

TL;DR

This paper provides a concise introduction to persistent homology, a key concept in topological data analysis, explaining its construction, invariance, and stability for analyzing datasets across scales.

Contribution

It offers a self-contained overview of persistent homology, emphasizing its foundational ideas and stability properties in data analysis.

Findings

01

Persistent homology captures scale-dependent topological features.

02

It is stable under data perturbations.

03

Provides a foundational understanding for TDA applications.

Abstract

TDA (topological data analysis) is a relatively new area of research related to importing classical ideas from topology into the realm of data analysis. Under the umbrella term TDA, there falls, in particular, the notion of persistent homology, which can be described in a nutshell, as the study of scale dependent homological invariants of datasets. In these notes, we provide a terse self contained description of the main ideas behind the construction of persistent homology as an invariant feature of datasets, and its stability to perturbations.

Figures5

Click any figure to enlarge with its caption.

Equations181

P := {(X, P_{X}) ∣ X \in M, P_{X} \in P (X)} .

P := {(X, P_{X}) ∣ X \in M, P_{X} \in P (X)} .

d_{X} (x, x^{'}) \geq d_{Y} ((ϕ (x), ϕ (x^{'})) .

d_{X} (x, x^{'}) \geq d_{Y} ((ϕ (x), ϕ (x^{'})) .

C ((X, d_{X})) \in P \mbox an d C (ϕ) \in Mor_{P} (C ((X, d_{X})), C ((Y, d_{Y}))) .

C ((X, d_{X})) \in P \mbox an d C (ϕ) \in Mor_{P} (C ((X, d_{X})), C ((Y, d_{Y}))) .

C (ψ \circ ϕ) = C (ψ) \circ C (ϕ) .

C (ψ \circ ϕ) = C (ψ) \circ C (ϕ) .

u_{X} (x, x^{'}) := x = x_{0}, x_{1}, \dots, x_{n} = x^{'} \in S_{x, x^{'}} min i \in [1 : n] max d_{X} (x_{i}, x_{i - 1}) .

u_{X} (x, x^{'}) := x = x_{0}, x_{1}, \dots, x_{n} = x^{'} \in S_{x, x^{'}} min i \in [1 : n] max d_{X} (x_{i}, x_{i - 1}) .

L = {{3, 5, 7}, {3, 5}, {3, 7}, {5, 7}, {2}, {3}, {5}, {7}}

L = {{3, 5, 7}, {3, 5}, {3, 7}, {5, 7}, {2}, {3}, {5}, {7}}

{3, 5, 7}, {3, 5}, {3, 7}, {5, 7}, {3}, {5}, {7} .

{3, 5, 7}, {3, 5}, {3, 7}, {5, 7}, {3}, {5}, {7} .

L_{0}^{S} = {[2], [3], [5], [7]}, L_{1}^{S} = {[3, 5], [3, 7], [5, 7]} \cup L_{0}^{S} \mbox an d L_{2} = {[3, 5, 7]} \cup L_{1}^{S} .

L_{0}^{S} = {[2], [3], [5], [7]}, L_{1}^{S} = {[3, 5], [3, 7], [5, 7]} \cup L_{0}^{S} \mbox an d L_{2} = {[3, 5, 7]} \cup L_{1}^{S} .

{[2]}

{[2]}

\frac{V}{U} := V / \sim := {v + U ∣ v \in V} .

\frac{V}{U} := V / \sim := {v + U ∣ v \in V} .

C_{n} := {i \sum c_{i} σ_{i} ∣ c_{i} \in F, σ_{i} \in K_{n}} .

C_{n} := {i \sum c_{i} σ_{i} ∣ c_{i} \in F, σ_{i} \in K_{n}} .

\partial_{n} (σ) := i = 0 \sum n [x_{0}, x_{1}, \dots, x_{i - 1}, x_{i}, x_{i + 1}, \dots, x_{n}] (- 1)^{i} .

\partial_{n} (σ) := i = 0 \sum n [x_{0}, x_{1}, \dots, x_{i - 1}, x_{i}, x_{i + 1}, \dots, x_{n}] (- 1)^{i} .

C_{*} (K, F) := \dots \partial_{n + 1} C_{n} \partial_{n} C_{n - 1} \partial_{n - 1} \dots \partial_{3} C_{2} \partial_{2} C_{1} \partial_{1} C_{0} \partial_{0} 0.

C_{*} (K, F) := \dots \partial_{n + 1} C_{n} \partial_{n} C_{n - 1} \partial_{n - 1} \dots \partial_{3} C_{2} \partial_{2} C_{1} \partial_{1} C_{0} \partial_{0} 0.

H_{n} (K, F) := \frac{Z _{n} ( K , F )}{B _{n} ( K , F )} .

H_{n} (K, F) := \frac{Z _{n} ( K , F )}{B _{n} ( K , F )} .

\dots

\dots

\overline{ϕ} (\partial_{n}^{K} (σ)) = i = 0 \sum n [ϕ (x_{0}), ϕ (x_{1}), \dots, ϕ (x_{i}) \dots, ϕ (x_{n})] (- 1)^{i} = \partial_{n}^{L} (ϕ (σ)) = \partial_{n}^{L} (\overline{ϕ} (σ)) .

\overline{ϕ} (\partial_{n}^{K} (σ)) = i = 0 \sum n [ϕ (x_{0}), ϕ (x_{1}), \dots, ϕ (x_{i}) \dots, ϕ (x_{n})] (- 1)^{i} = \partial_{n}^{L} (ϕ (σ)) = \partial_{n}^{L} (\overline{ϕ} (σ)) .

K_{δ}^{VR} (X) := {σ \subseteq X ∣ diam (σ) \leq δ} .

K_{δ}^{VR} (X) := {σ \subseteq X ∣ diam (σ) \leq δ} .

\overset{ˇ}{C}_{δ} (X) := {σ \subseteq X ∣ x \in X min p \in σ max d_{X} (x, p) \leq δ} .

\overset{ˇ}{C}_{δ} (X) := {σ \subseteq X ∣ x \in X min p \in σ max d_{X} (x, p) \leq δ} .

V_{δ}

V_{δ}

H_{k} (K_{δ_{1}} (X), F) \to H_{k} (K_{δ_{2}} (X), F) \to \dots \to H_{k} (K_{δ_{n - 1}} (X), F) \to H_{k} (K_{δ_{n}} (X), F)

H_{k} (K_{δ_{1}} (X), F) \to H_{k} (K_{δ_{2}} (X), F) \to \dots \to H_{k} (K_{δ_{n - 1}} (X), F) \to H_{k} (K_{δ_{n}} (X), F)

V_{δ}

V_{δ}

S (\cdot, A) : P V (F) \to P V_{n} (F)

S (\cdot, A) : P V (F) \to P V_{n} (F)

{V_{δ} v_{δ, δ^{'}} V_{δ^{'}}} = V \mapsto V^{A} = {V_{1}^{A} v_{1, 2}^{A} V_{2}^{A} v_{2, 3}^{A} \dots v_{n - 1, n}^{A} V_{n}^{A}};

{V_{δ} v_{δ, δ^{'}} V_{δ^{'}}} = V \mapsto V^{A} = {V_{1}^{A} v_{1, 2}^{A} V_{2}^{A} v_{2, 3}^{A} \dots v_{n - 1, n}^{A} V_{n}^{A}};

\mathrm{dim}(\mathbb{V}):=\big{(}\dim(V_{1}),\dim(V_{2}),\ldots,\dim(V_{n})\big{)}.

\mathrm{dim}(\mathbb{V}):=\big{(}\dim(V_{1}),\dim(V_{2}),\ldots,\dim(V_{n})\big{)}.

F

F

α_{3} \circ v_{1, 3} (1_{F}) = b \cdot 1_{F} \neq = 0_{F} = w_{1, 3} \circ α_{1} (1_{F}) .

α_{3} \circ v_{1, 3} (1_{F}) = b \cdot 1_{F} \neq = 0_{F} = w_{1, 3} \circ α_{1} (1_{F}) .

I (b, d) = 0 \to \dots \to 0 \to F \to F \to \dots \to F \to 0 \dots \to 0.

I (b, d) = 0 \to \dots \to 0 \to F \to F \to \dots \to F \to 0 \dots \to 0.

spec (X) := {d_{X} (x, x^{'}) ∣ x, x^{'} \in X} .

spec (X) := {d_{X} (x, x^{'}) ∣ x, x^{'} \in X} .

\mathbb{V}_{k}^{X}:=\mathbf{S}\big{(}H_{k}\circ\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathrm{spec}(X)\big{)}.

\mathbb{V}_{k}^{X}:=\mathbf{S}\big{(}H_{k}\circ\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathrm{spec}(X)\big{)}.

T (\cdot, A) : D_{[1 : n]} \to D

T (\cdot, A) : D_{[1 : n]} \to D

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rrrlw/TDAstats
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Primer on Persistent Homology of Finite Metric Spaces

Facundo Mémoli

Department of Mathematics and Department of Computer Science and Engineering, The Ohio State University.††thanks: [email protected]

Kritika Singhal

Department of Mathematics, The Ohio State University.††thanks: [email protected]

1 Introduction

TDA (topological data analysis) is a relatively new area of research related to importing classical ideas from topology into the realm of data analysis. Under the umbrella term TDA, there falls, in particular, the notion of persistent homology PH, which can be described in a nutshell, as the study of scale dependent homological invariants of datasets.

The so called “persistent homology pipeline” is depicted in Figure 1: datasets are modeled as finite metric spaces. A given finite metric space induces a filtered simplicial complex (via the Vietoris-Rips construction), which in turn, via the homology functor induces a persistence vector space. Finally, these persistence vector spaces are decomposed into certain building blocks which give rise to a persistence diagram. The figure suggests that if two different datasets (modeled as finite metric spaces) $(X,d_{X})$ and $(Y,d_{Y})$ are given, the dissimilarity between them controls how dissimilar their persistence diagrams will be. In other words, the assignment of persistence diagram to a dataset is continuous (actually Lipschitz) in a suitable sense.

In these notes, we provide a terse self contained description of the main ideas behind the construction of persistent homology as an invariant feature of datasets, and also discuss details about its stability to perturbations. These notes also include a brief discussion about applications to biological data and an overview of software packages that implement the PH pipeline.

Other useful resources for a more in depth study of the different ideas contained in these notes are [EH10, Car09, Ghr14].

Organization.

In Section 2 we provide a mathematical formulation of clustering (in both its flat and hierarchical forms) of finite metric spaces as a precursor for the notion of persistent homology.

In Section 3 we cover the basics of simplicial homology – a necessary ingredient for later discussing the theoretical elements pertaining to persistent homology.

In Section 4 we describe the persistent homology pipeline in detail, and in particular we review the construction of Vietoris-Rips persistence barcodes. In Section 4.1 we provide an analysis of the Vietoris-Rips barcodes corresponding to zero-dimensional persistent homology.

In Section 5 we review the main theoretical elements underpinning the stability of Vietoris-Rips persistent homology of finite metric spaces.

In Section 6 we overview a number of applications of persistent homology to biological data and beyond.

Finally, Section 7 provides a list of software packages implement different parts of the persistent homology pipeline.

Acknowledgements.

These notes are meant to supplement the lectures given by the first author during the TGDA@OSU TRIPODS Summer School held at MBI during May 2018. Videos of the lectures are available at [mbi18]. We acknowledge NSF support through project CCF #1740761.

1 Introduction
2 Clustering
2.1 Hierarchical Clustering
3 Simplicial Homology
4 Persistent Homology
4.1 Interpretation of Clustering via 0-Dimensional Persistence Diagram
5 Stability of Invariants
5.1 Gromov-Hausdorff Distance
5.2 Interleaving Distance
5.3 Bottleneck Distance
6 Applications of Persistent Homology
6.1 A Topological Paradigm for Hippocampal Spatial Map Formation using Persistent Homology
6.2 Topological Analysis of Population Activity in Visual Cortex
6.3 Further Applications to Biology
6.4 Applications to Other Domains
7 Software Packages for Persistent Homology

2 Clustering

One of the methods for extracting information from a data set is clustering the data set according to some rule. In this paper, datasets are represented as finite metric spaces. A finite metric space is a pair $(X,d_{X})$ , where $X$ is a finite set and $d_{X}:X\times X\rightarrow\mathbb{R}_{+}$ is a distance function. We denote by $\mathcal{M}$ the collection of all finite metric spaces.

We start by providing a definition of a clustering method with some examples. For any $n\in\mathbb{N}$ , we denote the set $\{1,2,\ldots,n\}$ by $[1:n]$ . Given $(X,d_{X})\in\mathcal{M}$ , we denote by $P(X)$ , the collection of all partitions of $X$ . Precisely, every $P\in P(X)$ is a family of sets $P=\{B_{1},B_{2},\ldots,B_{k}\}$ , $k\leq|X|$ , such that $B_{i}\subseteq X$ for all $i\in[1:k]$ , for all $i,j\in[1:k]$ with $i\neq j$ , $B_{i}\cap B_{j}=\emptyset$ and $\cup_{i=1}^{k}B_{i}=X$ . We refer to each $B_{i}$ , $i\in[1:k]$ as a block of $P$ . We denote by $\mathcal{P}$ , the collection of all pairs $(X,P_{X})$ , where $X\in\mathcal{M}$ and $P_{X}\in P(X)$ . Formally,

[TABLE]

Definition 2.1 (Clustering Method).

A clustering method $\mathfrak{C}$ is a map $\mathfrak{C}:\mathcal{M}\rightarrow\mathcal{P}$ such that for every $(X,d_{X})\in\mathcal{M}$ , $\mathfrak{C}((X,d_{X}))=(X,P_{X})$ , where $P_{X}\in P(X)$ .

Example 2.2

An example of a clustering method is the discrete clustering that partitions every metric space into singletons. Precisely, we have $\mathfrak{C}_{\mathrm{disc}}:\mathcal{M}\rightarrow\mathcal{P}$ with $\mathfrak{C}_{\mathrm{disc}}((X,d_{X}))=(X,S_{X})$ , where $S_{X}\in P(X)$ is the partition of $X$ into singletons.

Example 2.3

Another example of a clustering method is the full clustering that partitions every metric space into a single block. Precisely, we have $\mathfrak{C}_{\mathrm{full}}:\mathcal{M}\rightarrow\mathcal{P}$ with $\mathfrak{C}_{\mathrm{full}}((X,d_{X}))=(X,\{X\})$ .

There are various other examples of clustering methods such as partitioning into clusters whose diameter is bounded above by a constant, or partitioning into clusters whose diameter is bounded below by a constant, and so on [JS72]. Since we are working with finite metric spaces, the metric structure is the only information we have for determining a partition. Thus, it seems natural that for $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ and a structure preserving map $f:X\rightarrow Y$ , a partition of $Y$ induced by a clustering method $\mathfrak{C}$ can be determined, at least partially, using the map $f$ and a partition of $X$ induced by the same clustering method $\mathfrak{C}$ . Precisely, we want a clustering method $\mathfrak{C}$ to be a functor, see [CM13].

In order to view a clustering method $\mathfrak{C}$ as a functor, we need to view $\mathcal{M}$ and $\mathcal{P}$ as categories. We refer the readers to [Jac12, Spi14] for an account on category theory. We define the categorical structure on $\mathcal{M}$ and $\mathcal{P}$ as follows:

Definition 2.4 (Category of Finite Metric Spaces).

Let $\mathcal{M}$ , by abuse of notation, denote the category of finite metric spaces. The objects of $\mathcal{M}$ are finite metric spaces $(X,d_{X})$ , and the morphisms are defined as follows: for $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ , we say that a set map $\phi:X\rightarrow Y$ belongs to $\mathrm{Mor}_{\mathcal{M}}((X,d_{X}),(Y,d_{Y}))$ if for all $x,x^{\prime}\in X$ ,

[TABLE]

In other words, $\phi$ is $1$ -Lipschitz.

We observe that for all $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ , the set $\mathrm{Mor}_{\mathcal{M}}((X,d_{X}),(Y,d_{Y}))\neq\emptyset$ , since the map $\phi:X\rightarrow Y$ that sends every point in $X$ to a single point in $Y$ belongs to $\mathrm{Mor}_{\mathcal{M}}((X,d_{X}),(Y,d_{Y}))$ . We now define the category of partitions of finite sets.

Definition 2.5 (Category of Partitions of Finite Sets ).

Let $\mathcal{P}$ , by abuse of notation, denote the category of partitions of finite sets. The objects of $\mathcal{P}$ are $(X,P_{X})$ , where $X$ is a finite set and $P_{X}\in P(X)$ . Here, recall that $P(X)$ is the family of all partitions of $X$ .

Given $P_{Y}=\{B_{1},\ldots,B_{k}\}\in P(Y)$ , and a set map $\phi:X\rightarrow Y$ , the pullback of $P_{Y}$ along $\phi$ is defined as $\phi^{\ast}P_{Y}=\{\phi^{-1}(B_{i})~{}|~{}i\in[1:k]\}$ . Clearly, $\phi^{\ast}P_{Y}\in P(X)$ . The morphisms in $\mathcal{P}$ are then defined as follows: for $(X,P_{X}),(Y,P_{Y})\in\mathcal{P}$ , we say that a set map $\phi:X\rightarrow Y$ belongs to $\mathrm{Mor}_{\mathcal{P}}((X,P_{X}),(Y,P_{Y}))$ if $P_{X}$ is finer than $\phi^{\ast}P_{Y}$ . This means that for every set $A\in P_{X}$ , there exists a set $B\in\phi^{\ast}P_{Y}$ such that $A\subseteq B$ .

We observe that for all $(X,P_{X}),(Y,P_{Y})\in\mathcal{P}$ , the set $\mathrm{Mor}_{\mathcal{P}}((X,P_{X}),(Y,P_{Y}))\neq\emptyset$ , since the map $\phi:X\rightarrow Y$ that sends every point of $X$ to a single point of $Y$ satisfies $\phi^{\ast}P_{Y}=\{X\}$ . Thus, any $P_{X}\in P(X)$ is finer than $\phi^{\ast}P_{Y}$ , and we obtain $\phi\in\mathrm{Mor}_{\mathcal{P}}((X,P_{X}),(Y,P_{Y}))$ .

We now define a clustering method $\mathfrak{C}:\mathcal{M}\rightarrow\mathcal{P}$ to be a functor. This means that for all $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ and $\phi\in\mathrm{Mor}_{\mathcal{M}}((X,d_{X}),(Y,d_{Y}))$ ,

[TABLE]

Furthermore, $\mathfrak{C}$ satisfies $\mathfrak{C}(\mathrm{id}_{(X,d_{X})})=\mathrm{id}_{\mathfrak{C}((X,d_{X}))}$ and for all $(Z,d_{Z})\in\mathcal{M}$ with $\psi\in\mathrm{Mor}_{\mathcal{M}}((Y,d_{Y}),(Z,d_{Z}))$ ,

[TABLE]

We recall the clustering method $\mathfrak{C}_{\mathrm{disc}}$ and show that $\mathfrak{C}_{\mathrm{disc}}$ is a functor. For all $(X,d_{X})\in\mathcal{M}$ , $\mathfrak{C}_{\mathrm{disc}}((X,d_{X}))=(X,S_{X})$ , where $S_{X}\in P(X)$ is the partition into singletons. For any $(Y,d_{Y})\in\mathcal{M}$ , let $\phi:X\rightarrow Y$ be a set map such that $\phi\in\mathrm{Mor}_{\mathcal{M}}((X,d_{X}),(Y,d_{Y}))$ . Then, clearly, $\phi\in\mathrm{Mor}_{\mathcal{P}}((X,S_{X}),(Y,S_{Y}))$ since $S_{X}$ is the partition of $X$ into singletons, and thus is finer than any other partition of $X$ , in particular, $S_{X}$ is finer than $\phi^{\ast}S_{Y}$ . It is trivial to check that $\mathfrak{C}_{\mathrm{disc}}$ satisfies other properties of being a functor.

Similarly, it can be checked that the clustering method $\mathfrak{C}_{\mathrm{full}}$ is a functor. We now provide another example of a clustering method that is also a functor, and is defined for every real number $\delta\geq 0$ . It is called the Vietoris-Rips clustering functor.

Example 2.6 (Vietoris-Rips clustering functor)

The Vietoris-Rips clustering functor at a fixed scale parameter $\delta>0$ , is denoted by $\mathfrak{C}^{\mathrm{VR}_{\delta}}$ , and is defined as follows: given $(X,d_{X})\in\mathcal{M}$ and $\delta>0$ , define $P_{X}(\delta)\in P(X)$ as $P_{X}(\delta)=X/\sim_{\delta}$ , where $x\sim_{\delta}x^{\prime}$ if and only if there exists a sequence $x_{0},x_{1},\ldots,x_{n}$ in $X$ with $x_{0}=x$ and $x_{n}=x^{\prime}$ , such that for all $i\in[1:n]$ , $d_{X}(x_{i-1},x_{i})\leq\delta$ . Then, $\mathfrak{C}^{\mathrm{VR}_{\delta}}((X,d_{X})):=(X,P_{X}(\delta))$ . The clustering $P_{X}(\delta)$ is referred to as the single linkage clustering of $X$ at scale $\delta$ .

Consider a metric space $(X,d_{X})$ where $X=\{a,b\}$ and $d_{X}(a,b)=r>0$ . Then, for all $\delta<r$ , $\mathfrak{C}^{\mathrm{VR}_{\delta}}((X,d_{X}))=\{\{a\},\{b\}\}$ , and for $\delta\geq r$ , $\mathfrak{C}^{\mathrm{VR}_{\delta}}((X,d_{X}))=\{a,b\}$ .

The functoriality of $\mathfrak{C}^{\mathrm{VR}_{\delta}}$ can be seen as follows: given $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ , let $\phi\in\mathrm{Mor}_{\mathcal{M}}((X,d_{X}),(Y,d_{Y}))$ . Then, by definition, for all $x,x^{\prime}\in X$ , $d_{X}(x,x^{\prime})\geq d_{Y}(\phi(x),\phi(x^{\prime}))$ . Now, let $\delta>0$ be fixed. If $x,x^{\prime}\in X$ are such that $x\sim_{\delta}x^{\prime}$ , then there exists a sequence $x=x_{0},x_{1},\ldots,x_{n}=x^{\prime}$ in $X$ such that for all $0\leq i\leq n-1$ , $d_{X}(x_{i},x_{i+1})\leq\delta$ . By definition of $\phi$ , this implies that for all $0\leq i\leq n-1$ , $d_{Y}(\phi(x_{i}),\phi(x_{i+1}))\leq\delta$ , and therefore $\phi(x)\sim_{\delta}\phi(x^{\prime})$ . Thus, we obtain that $P_{X}(\delta)$ is finer than $\phi^{\ast}P_{Y}(\delta)$ . We can similarly check that $\mathfrak{C}^{\mathrm{VR}_{\delta}}$ satisfies other properties of being a functor.

Given $\delta\geq 0$ , let $\Delta_{2}(\delta)$ denote the metric space consisting of $2$ points at distance $\delta$ . The next theorem states the uniqueness of the Vietoris-Rips clustering functor with respect to a particular property.

Theorem 2.7 ([CM13, Theorem 6.4]).

Let $\mathfrak{C}:\mathcal{M}\rightarrow\mathcal{P}$ be a clustering functor for which there exists $\delta_{\mathfrak{C}}>0$ with the property that:

$\mathfrak{C}(\Delta_{2}(\delta))$ * is in two pieces for all $\delta\in[0,\delta_{\mathfrak{C}})$ , and* 2. 2.

$\mathfrak{C}(\Delta_{2}(\delta))$ * is in one piece for all $\delta\geq\delta_{\mathfrak{C}}$ .*

Then, $\mathfrak{C}$ is the Vietoris-Rips clustering functor with parameter $\delta_{\mathfrak{C}}$ .

As we discussed, we have that $\mathfrak{C}_{\mathrm{disc}},\mathfrak{C}_{\mathrm{full}}$ and the Vietoris-Rips clustering functor are examples of functorial clustering methods. It is worth pointing out that the well known average linkage and complete linkage clustering methods fail to be functorial, see [CM10].

We observe that the Vietoris-Rips clustering functor $\mathfrak{C}^{\mathrm{VR}_{\delta}}$ varies with $\delta$ . Thus, a natural question one may ask is how the clustering at scale $\delta$ is related to the clustering at scale $\delta^{\prime}\neq\delta$ . This leads to the concept of hierarchical clustering.

2.1 Hierarchical Clustering

We start by looking at an example. Consider a metric space $(Z,d_{Z})$ where $Z=\{a,b,c\}$ and $d_{Z}(a,b)=0.4,d_{Z}(b,c)=0.6,d_{Z}(a,c)=0.7$ . Then, we have that for all $0\leq\delta<0.4$ , $\mathfrak{C}^{\mathrm{VR}_{\delta}}((Z,d_{Z}))=\{\{a\},\{b\},\{c\}\}$ , for $0.4\leq\delta<0.6$ , $\mathfrak{C}^{\mathrm{VR}_{\delta}}((Z,d_{Z}))=\{\{a,b\},\{c\}\}$ and for $\delta\geq 0.6$ , $\mathfrak{C}^{\mathrm{VR}_{\delta}}((Z,d_{Z}))=\{\{a,b,c\}\}$ . We observe that for $\delta=0$ , the clusters are singletons, and for $\delta$ large enough, all points fall into one cluster. In addition, for $\delta^{\prime}>\delta$ , the clusters at $\delta^{\prime}$ are obtained by merging clusters at $\delta$ . Such a clustering can be pictorially represented using a dendrogram.

Definition 2.8 (Dendrogram).

Let $X$ be a finite set. A dendrogram over $X$ is a function $\theta_{X}:[0,\infty)\rightarrow P(X)$ , such that the following hold:

For all $s\leq t$ , $\theta_{X}(s)$ is finer than $\theta_{X}(t)$ . 2. 2.

$\theta_{X}(0)$ * is the partition into singletons.* 3. 3.

There exists $t_{f}\in(0,\infty)$ such that $\theta_{X}(t_{f})=\{X\}$ . 4. 4.

For all $t>0$ , there exists $\epsilon>0$ such that $\theta_{X}(t+\epsilon)=\theta_{X}(t)$ .

The parameter $t$ is referred to as the scale of partition.

A dendrogram depicting the Vietoris-Rips clustering (called the single linkage dendrogram) of the $3$ -point metric space $(Z,d_{Z})$ described above is as follows:

Here, we have that $\theta_{Z}(0.3)=\{\{a\},\{b\},\{c\}\}$ , $\theta_{Z}(0.5)=\{\{a,b\},\{c\}\}$ and $\theta_{Z}(1)=\{\{a,b,c\}\}$ . Precisely, we have that for every $t\geq 0$ , the $\theta_{Z}(t)=\mathfrak{C}^{\mathrm{VR}_{t}}((Z,d_{Z}))$ .

Let $G(X)$ denote the collection of all dendrograms over a finite set $X$ and let $\mathcal{G}=\{(X,\theta_{X})~{}|~{}|X|<\infty,~{}\theta_{X}\in G(X)\}$ . Then, $\mathcal{G}$ can be viewed as a category. The objects of $\mathcal{G}$ are as specified in the definition, and for all $(X,\theta_{X}),(Y,\theta_{Y})\in\mathcal{G}$ , a set map $\phi:X\rightarrow Y$ belongs to $\mathrm{Mor}_{\mathcal{G}}((X,\theta_{X}),(Y,\theta_{Y}))$ if for all $t\geq 0$ , $\theta_{X}(t)$ is finer than $\phi^{\ast}\theta_{Y}(t)$ . We again have that for all $(X,\theta_{X}),(Y,\theta_{Y})\in\mathcal{G}$ , $\mathrm{Mor}_{\mathcal{G}}((X,\theta_{X}),(Y,\theta_{Y}))\neq\emptyset$ , since the map $\phi:X\rightarrow Y$ that takes all points of $X$ to a single point of $Y$ belongs to $\mathrm{Mor}_{\mathcal{G}}((X,\theta_{X}),(Y,\theta_{Y}))$ . We are now ready to define hierarchical clustering formally.

Definition 2.9 (Hierarchical Clustering).

A hierarchical clustering method is any functor $\mathcal{H}:\mathcal{M}\rightarrow\mathcal{G}$ , i.e. for any $(X,d_{X})\in\mathcal{M}$ , $\mathcal{H}((X,d_{X}))=(X,\theta_{X})$ , where $\theta_{X}\in G(X)$ .

The Vietoris-Rips clustering functor, as described in Example 2.6, is a an example of a hierarchical clustering, since for any $X\in\mathcal{M}$ , the function $P_{X}:[0,\infty)\rightarrow P(X)$ , as defined in Example 2.6, is a dendrogram over $X$ . The Vietoris-Rips clustering functor applied on $(X,d_{X})$ induces a metric on $X$ in the following manner: given $x,x^{\prime}\in X$ , we can find the smallest $t>0$ , such that $x$ and $x^{\prime}$ belong to same block of the partition $\mathfrak{C}^{\mathrm{VR}_{t}}((X,d_{X}))$ . This provides us with a measure of dissimilarity between points of $X$ . This dissimilarity induces a metric on $X$ , referred to as an ultra-metric.

Definition 2.10 (Ultra-metric).

Given a set $X$ , a function $u:X\times X\rightarrow\mathbb{R}_{\geq 0}$ is called an ultra-metric if the following hold:

For all $x,x^{\prime}\in X$ , $u(x,x^{\prime})=u(x^{\prime},x)\geq 0$ and $u(x,x^{\prime})=0$ if and only if $x=x^{\prime}$ . 2. 2.

For all $x,x^{\prime},x^{\prime\prime}\in X$ , $u(x,x^{\prime\prime})\leq\max\{u(x,x^{\prime}),u(x^{\prime},x^{\prime\prime})\}$ . The second condition is referred to as the strong triangle inequality.

We now define the ultra-metric induced by the Vietoris-Rips clustering functor.

Definition 2.11 (Ultra-metric induced by $\mathfrak{C}^{\mathrm{VR}}$ ).

Let $(X,d_{X})\in\mathcal{M}$ . For every $x,x^{\prime}\in X$ , let $S_{x,x^{\prime}}$ denote the collection of all sequences $x=x_{0},x_{1},\ldots,x_{n}=x^{\prime}$ in $X$ satisfying $x_{i}\neq x_{j}$ for all $i,j\in[1:n]$ with $i\neq j$ . Then, the ultra-metric induced by $\mathfrak{C}^{\mathrm{VR}}$ , denoted by $u_{X}$ , is defined as

[TABLE]

It is straightforward to check that $u_{X}$ satisfies the properties of symmetry, positivity and strong triangle inequality. We observe that for any $x,x^{\prime}\in X$ , $u_{X}(x,x^{\prime})$ is the smallest $t>0$ at which the block containing $x$ merges with the block containing $x^{\prime}$ in the single linkage dendrogram of $X$ . Thus, we obtain that the Vietoris-Rips clustering functor applied to $(X,d_{X})$ induces an ultra-metric on $X$ . It has been shown in [CM10, Theorem 18] that the Vietoris-Rips clustering functor is the unique hierarchical clustering method with this property.

We remark that the ultrametric $u_{X}$ defined above is the maximal subdominant ultrametric on X, which means that for every ultrametric $\hat{u}$ on $X$ satisfying $\hat{u}\leq d_{X}$ , we have $\hat{u}\leq u_{X}$ .

In subsequent sections, we describe the machinery of persistent homology — a generalization of hierarchical clustering — which can be used to obtain information about a metric space. The rest of the paper is focused on developing the theory of persistent homology.

3 Simplicial Homology

In this section, we define the pre-requisites needed to develop the theory of persistent homology. We will be defining and working only with abstract simplicial complexes in this paper. For the rest of the paper, any simplicial complex is an abstract simplicial complex. We refer the reader to [Mun96] for the definitions in this section.

Definition 3.1 (Simplicial Complex).

A simplicial complex is a collection $\mathrm{K}$ of finite non-empty sets such that if $A$ is an element of $\mathrm{K}$ , then so is every non-empty subset of $A$ .

For example, the collection

[TABLE]

forms a simplicial complex, but the collection $\{\{2,3\},\{1\},\{2\}\}$ does not.

Definition 3.2 (Subcomplex).

Given a simplicial complex $\mathrm{K}$ , a subcollection $\mathrm{J}$ of $\mathrm{K}$ is a subcomplex of $\mathrm{K}$ if $\mathrm{J}$ is a simplicial complex in itself.

The collection $\{\{3,5\},\{3\},\{5\}\}$ is a subcomplex of the simplicial complex $\mathrm{L}$ defined above.

Definition 3.3 (Simplex of a complex).

Every element $A$ of a simplicial complex $\mathrm{K}$ is a simplex of $\mathrm{K}$ .

Some of the simplices of $\mathrm{L}$ are $\{3,5,7\}$ , $\{2\}$ , and $\{5\}$ . For every simplicial complex $\mathrm{K}$ , if $\sigma=\{x_{0},x_{1},\ldots,x_{k}\}$ , $k\in\mathbb{N}$ , is a simplex of $\mathrm{K}$ , we assume that $\sigma$ is oriented by the ordering $x_{0}<x_{1}<\ldots<x_{k}$ . We write $[x_{0},x_{1},\ldots,x_{k}]$ to denote the equivalence class of the even permutations of this ordering, and $-[x_{0},x_{1},\ldots,x_{k}]$ to denote the equivalence class of the odd permutations of this ordering.

Definition 3.4 (Face of a simplex).

The faces of a simplex $A$ of a simplicial complex $\mathrm{K}$ are the non-empty subsets of $A$ .

The faces of the simplex $[3,5,7]$ of the simplicial complex $\mathrm{L}$ defined above are

[TABLE]

Definition 3.5 (Dimension of a complex).

The dimension of a simplicial complex $\mathrm{K}$ is the largest dimension of a simplex of $\mathrm{K}$ , where the dimension of a simplex $X$ of $\mathrm{K}$ is $|X|-1$ . If there is no such largest dimension, then dimension of $\mathrm{K}$ is infinite.

The dimension of the simplicial complex $\mathrm{L}$ is $2$ .

Definition 3.6 (Vertices of a complex).

The vertex set of a simplicial complex $\mathrm{K}$ , denoted by $V(\mathrm{K})$ , is the union of the one-point elements of $\mathrm{K}$ .

The vertex set of the simplicial complex $\mathrm{L}$ is $\{2,3,5,7\}$ .

Definition 3.7 ( $n$ -skeleton of a complex).

Given $n\in\mathbb{Z}_{+}$ , an $n$ -skeleton of a simplicial complex $\mathrm{K}$ , denoted by $\mathrm{K}^{S}_{n}$ , is the collection of all simplices of $\mathrm{K}$ of dimension at most $n$ .

We observe that the [math]-skeleton of a simplicial complex $\mathrm{K}$ consists of all singletons of $\mathrm{K}$ . For the simplicial complex $\mathrm{L}$ , we have

[TABLE]

Definition 3.8 (Connected component).

Two simplices $S$ and $T$ of a simplicial complex $\mathrm{K}$ belong to the same connected component of $\mathrm{K}$ if there exists a non-empty sequence of simplices of $\mathrm{K}$ , $S=S_{0},S_{1},\ldots,S_{n}=T$ such that for all $0\leq i\leq n-1$ , $S_{i}\cap S_{i+1}\neq\emptyset$ .

The connected components of the simplicial complex $\mathrm{L}$ are

[TABLE]

Definition 3.9 (Simplicial map).

Given simplicial complexes $\mathrm{K}$ and $\mathrm{K}^{\prime}$ , a map $\phi:V(\mathrm{K})\rightarrow V(\mathrm{K}^{\prime})$ is called a simplicial map, if for every simplex $R$ of $\mathrm{K}$ , $\phi(R)$ is a simplex of $\mathrm{K}^{\prime}$ .

The collection of all simplicial complexes along with simplicial maps between them forms a category. For any pair of simplicial complexes $\mathrm{K}$ and $\mathrm{L}$ , a map that sends every vertex of $\mathrm{K}$ to the same vertex of $\mathrm{L}$ is a simplicial map. Thus, the set of simplicial maps between $\mathrm{K}$ and $\mathrm{L}$ is non-empty. We denote the category of simplicial complexes by $\mathcal{S}$ .

We need the following two definitions in order to define the homology groups.

Definition 3.10 (Quotient vector space).

Let $\mathbb{V}$ and $\mathbb{U}$ be vector spaces over a field $\mathbb{F}$ such that $\mathbb{U}\subseteq\mathbb{V}$ . We define an equivalence relation $\sim$ on $\mathbb{V}$ as follows: for $v,v^{\prime}\in\mathbb{V}$ , $v\sim v^{\prime}$ if $v-v^{\prime}\in\mathbb{U}$ . For every $v\in\mathbb{V}$ , the equivalence class of $v$ is denoted by $v+\mathbb{U}$ , and is defined as $v+\mathbb{U}=\{v+u~{}|~{}u\in\mathbb{U}\}$ . The quotient vector space $\frac{\mathbb{V}}{\mathbb{U}}$ is then defined as

[TABLE]

Definition 3.11 (Isomorphic vector spaces).

Two vector spaces $\mathbb{V}$ and $\mathbb{W}$ over a field $\mathbb{F}$ are called isomorphic if there exists a bijective linear transformation $\phi:\mathbb{V}\rightarrow\mathbb{W}$ .

Definition 3.12 (Chain Complex).

Let $\mathrm{K}$ be a simplicial complex and $\mathbb{F}$ be a field. Let $\mathrm{K}_{n}$ denote the collection of all simplices of $\mathrm{K}$ of dimension $n$ . For every $n\in\mathbb{Z}_{+}$ , we define

[TABLE]

Precisely, $C_{n}$ is the free vector space over $\mathbb{F}$ with basis $\mathrm{K}_{n}$ . The boundary map $\partial_{n}:C_{n}\rightarrow C_{n-1}$ is defined as follows: for $\sigma=[x_{0},x_{1},\ldots,x_{n}]\in\mathrm{K}_{n}$ , and $0\leq i\leq n$ , we denote the element $[x_{0},x_{1},\ldots,x_{n}]\setminus x_{i}\in\mathrm{K}_{n-1}$ by $[x_{0},x_{1},\ldots,x_{i-1},\widehat{x_{i}},x_{i+1},\ldots,x_{n}]$ . Then, we set

[TABLE]

Since $C_{n}$ is a free vector space over $\mathrm{K}_{n}$ , it suffices to define the boundary maps on elements of $\mathrm{K}_{n}$ . The chain complex associated to $\mathrm{K}$ , denoted by $\mathcal{C}_{\ast}(\mathrm{K},\mathbb{F})$ , is defined to be the sequence of vector spaces $\{C_{n}\}_{n\in\mathbb{Z}_{+}}$ , along with the boundary maps $\partial_{n}:C_{n}\rightarrow C_{n-1}$ . Precisely, we have

[TABLE]

Lemma 3.13.

Let $\mathrm{K}$ be a simplicial complex, and $\mathcal{C}_{\ast}(\mathrm{K},\mathbb{F})$ be the chain complex as defined above. Then, for all $n\in\mathbb{Z}_{+}$ , we have $\partial_{n}\circ\partial_{n+1}=0$ .

Proof.

The lemma holds for any field $\mathbb{F}$ , and the proof follows from the definition of boundary map. ∎

Since $\partial_{n}\circ\partial_{n+1}=0$ for all $n\in\mathbb{Z}_{+}$ , we obtain that for every $n\in\mathbb{Z}_{+}$ , the image of $\partial_{n+1}$ is contained in the kernel of $\partial_{n}$ .

Definition 3.14 ( $n$ -Cycle and $n$ -Boundary).

Given a simplicial complex $\mathrm{K}$ and its associated chain complex $\mathcal{C}_{*}(\mathrm{K},\mathbb{F})=(C_{n},\partial_{n})_{n\in\mathbb{Z}_{+}}$ , the kernel of the map $\partial_{n}$ is the set of $n$ -cycles and is denoted by $\mathcal{Z}_{n}(\mathrm{K},\mathbb{F})$ . The image of the map $\partial_{n+1}$ is the set of $n$ -boundaries, and is denoted by $\mathcal{B}_{n}(\mathrm{K},\mathbb{F})$ .

By Lemma 3.13, we have that for all $n\in\mathbb{Z}_{+}$ , $\mathcal{B}_{n}(\mathrm{K},\mathbb{F})\subseteq\mathcal{Z}_{n}(\mathrm{K},\mathbb{F})$ .

Definition 3.15 (Simplicial Homology).

Given $n\in\mathbb{Z}_{+}$ , the $n$ -th homology group of a simplicial complex $\mathrm{K}$ , is denoted by $H_{n}(\mathrm{K},\mathbb{F})$ , and is defined as

[TABLE]

That is, $H_{n}(\mathrm{K},\mathbb{F})$ is a quotient vector space and the elements of $H_{n}(\mathrm{K},\mathbb{F})$ are equivalence classes of $n$ -cycles of $\mathcal{C}_{\ast}(\mathrm{K},\mathbb{F})$ .

Definition 3.16 (Betti numbers).

Given $n\in\mathbb{Z}_{+}$ , the $n$ -th Betti number of a simplicial complex $\mathrm{K}$ is denoted by $\beta_{n}(\mathrm{K})$ , and is defined as $\beta_{n}(\mathrm{K}):=\dim(H_{n}(\mathrm{K},\mathbb{F}))$ .

Lemma 3.17.

For every simplicial complex $\mathrm{K}$ , $\beta_{0}(\mathrm{K})$ is equal to the number of connected components of $\mathrm{K}$ .

Proof.

For any simplicial complex $\mathrm{K}$ , we have $H_{0}(\mathrm{K},\mathbb{F})=\frac{\mathcal{Z}_{0}(\mathrm{K},\mathbb{F})}{\mathcal{B}_{0}(\mathrm{K},\mathbb{F})}$ , where $\mathcal{Z}_{0}(\mathrm{K},\mathbb{F})=\mathrm{ker}(\partial_{0})$ and $\mathcal{B}_{0}(\mathrm{K},\mathbb{F})=\mathrm{im}(\partial_{1})$ . The $1$ -simplex $\mathrm{K}_{1}$ consists of all elements of $\mathrm{K}$ of cardinality $2$ , while the [math]-simplex $\mathrm{K}_{0}$ consists of singletons of $\mathrm{K}$ . We use the symbol $\cong$ to denote an isomorphism of the concerned spaces, and $\langle S\rangle_{F}$ to denote the free vector space over $\mathbb{F}$ with basis elements of $S$ . We have $\langle\mathrm{K}_{0}\rangle_{\mathbb{F}}\cong\mathbb{F}^{|\mathrm{K}_{0}|}$ , and $\langle\mathrm{K}_{1}\rangle_{\mathbb{F}}\cong\mathbb{F}^{|\mathrm{K}_{1}|}$ . The map $\partial_{0}:\langle\mathrm{K}_{0}\rangle_{\mathbb{F}}\rightarrow 0$ satisfies $\mathrm{ker}(\partial_{0})=\langle\mathrm{K}_{0}\rangle_{\mathbb{F}}\cong\mathbb{F}^{|\mathrm{K}_{0}|}$ . The image under $\partial_{1}$ of an element $[x_{i},x_{j}]\in\mathrm{K}_{1}$ is $[x_{j}]-[x_{i}]\in\langle\mathrm{K}_{0}\rangle_{\mathbb{F}}$ . The elements $[x_{i},x_{j}],[x_{j},x_{k}]\in\mathrm{K}_{1}$ belong to the same connected component, and span a subspace of dimension $2$ in $\langle\mathrm{K}_{0}\rangle_{\mathbb{F}}$ with basis $\{[x_{j}]-[x_{i}],[x_{k}]-[x_{j}]\}$ . In general, if a connected component $S$ in $\mathrm{K}$ contains $n$ vertices, then the image under $\partial_{1}$ of the elements of $\mathrm{K}_{1}$ belonging to $S$ is a vector space of dimension $n-1$ . Thus, if $\mathrm{K}$ has $l$ connected components $S_{1},S_{2},\ldots,S_{l}$ , then we have that $\mathrm{im}(\partial_{1})\cong\mathbb{F}^{\sum_{i}(|S_{i}|-1)}$ . Thus, we obtain $H_{0}(\mathrm{K},\mathbb{F})\cong\frac{\mathbb{F}^{|\mathrm{K}_{0}|}}{\mathbb{F}^{\sum_{i}(|S_{i}|-1)}}$ . Now, we know that $\mathrm{K}_{0}$ consists of all singletons and therefore $|\mathrm{K}_{0}|$ is equal to the number of vertices of $\mathrm{K}$ . Every vertex of $\mathrm{K}$ belongs to a unique connected component, therefore we have that $\sum_{i}|S_{i}|=|\mathrm{K}_{0}|$ . This implies that $H_{0}(\mathrm{K},\mathbb{F})\cong\mathbb{F}^{l}$ , and we obtain that $\beta_{0}(\mathrm{K})$ is equal to the number of connected components of $\mathrm{K}$ . ∎

Given simplicial complexes $\mathrm{K}$ and $\mathrm{L}$ , and a simplicial map $\phi:\mathrm{K}\rightarrow\mathrm{L}$ , a natural question to ask is whether $\phi$ induces a map between chain complexes $\mathcal{C}_{\ast}(\mathrm{K},\mathbb{F})$ and $\mathcal{C}_{\ast}(\mathrm{L},\mathbb{F})$ , as well as between homology vector spaces $H_{n}(\mathrm{K},\mathbb{F})$ and $H_{n}(\mathrm{L},\mathbb{F})$ , for $n\in\mathbb{Z}_{+}$ . The following proposition answers this question.

Proposition 3.18.

Given simplicial complexes $\mathrm{K}$ and $\mathrm{L}$ , a simplicial map $\phi:\mathrm{K}\rightarrow\mathrm{L}$ induces a map $\overline{\phi}:\mathcal{C}_{\ast}(\mathrm{K},\mathbb{F})\rightarrow\mathcal{C}_{\ast}(\mathrm{L},\mathbb{F})$ , as well as maps $H_{n}(\phi,\mathbb{F}):H_{n}(\mathrm{K},\mathbb{F})\rightarrow H_{n}(\mathrm{L},\mathbb{F})$ , for every $n\in\mathbb{Z}_{+}$ .

Proof.

Let $\mathrm{K}$ and $\mathrm{L}$ be simplicial complexes and $\phi:\mathrm{K}\rightarrow\mathrm{L}$ be a simplicial map. Let $\mathcal{C}_{\ast}(\mathrm{K},\mathbb{F})=\{C_{n}(\mathrm{K})\xrightarrow{\partial_{n}^{\mathrm{K}}}C_{n-1}(\mathrm{K})\}_{n\in\mathbb{Z}_{+}}$ and $\mathcal{C}_{\ast}(\mathrm{L},\mathbb{F})=\{C_{n}(\mathrm{L})\xrightarrow{\partial_{n}^{\mathrm{L}}}C_{n-1}(\mathrm{L})\}_{n\in\mathbb{Z}_{+}}$ . For every $n\in\mathbb{Z}_{+}$ , $C_{n}(\mathrm{K})$ and $C_{n}(\mathrm{L})$ are free vector spaces over the collection of $n$ -simplices of $\mathrm{K}$ and $\mathrm{L}$ respectively. Therefore, the map $\overline{\phi}:C_{n}(\mathrm{K})\rightarrow C_{n}(\mathrm{L})$ defined by linearly extending $\phi$ is a well-defined map. Precisely, we have that for $n\in\mathbb{Z}_{+}$ , $i\in I$ , $I$ an indexing set, $c_{i}\in\mathbb{F}$ , $\sigma_{i}\in\mathrm{K}_{n}$ and $\sum_{i}c_{i}\sigma_{i}\in C_{n}(\mathrm{K})$ , $\overline{\phi}\left(\sum_{i}c_{i}\sigma_{i}\right)=\sum_{i}c_{i}\phi(\sigma_{i})$ . Since $\phi$ is a simplicial map, we obtain $\sum_{i}c_{i}\phi(\sigma_{i})\in C_{n}(\mathrm{L})$ . Thus, we obtain the following diagram:

[TABLE]

We now show that the squares in the above diagram commute. Let $\sigma=[x_{0},x_{1},\ldots,x_{n}]\in C_{n}(\mathrm{K})$ . Then $\partial_{n}^{\mathrm{K}}(\sigma)=\sum_{i=0}^{n}[x_{0},x_{1},\ldots,\widehat{x_{i}},\ldots,x_{n}](-1)^{i}$ . We have

[TABLE]

Thus, we have shown that $\overline{\phi}\circ\partial_{n}^{\mathrm{K}}=\partial_{n}^{\mathrm{L}}\circ\overline{\phi}$ for every $n\in\mathbb{Z}_{+}$ . This implies that for every $n\in\mathbb{Z}_{+}$ , the map $\overline{\phi}$ sends the kernel of $\partial_{n}^{\mathrm{K}}$ to the kernel of $\partial_{n}^{\mathrm{L}}$ , and the image of $\partial_{n}^{\mathrm{K}}$ to the image of $\partial_{n}^{\mathrm{L}}$ . Thus, for every $n\in\mathbb{Z}_{+}$ , $\overline{\phi}$ sends $\mathcal{Z}_{n}(\mathrm{K},\mathbb{F})$ to $\mathcal{Z}_{n}(\mathrm{L},\mathbb{F})$ and $\mathcal{B}_{n}(\mathrm{K},\mathbb{F})$ to $\mathcal{B}_{n}(\mathrm{L},\mathbb{F})$ . This provides us with the map $H_{n}(\phi,\mathbb{F}):H_{n}(\mathrm{K},\mathbb{F})\rightarrow H_{n}(\mathrm{L},\mathbb{F})$ defined as $H_{n}(\phi,\mathbb{F})\left(\frac{\mathcal{Z}_{n}(\mathrm{K},\mathbb{F})}{\mathcal{B}_{n}(\mathrm{K},\mathbb{F})}\right)=\frac{\overline{\phi}(\mathcal{Z}_{n}(\mathrm{K},\mathbb{F}))}{\overline{\phi}(\mathcal{B}_{n}(\mathrm{K},\mathbb{F}))}\subseteq\frac{\mathcal{Z}_{n}(\mathrm{L},\mathbb{F})}{\mathcal{B}_{n}(\mathrm{L},\mathbb{F})}=H_{n}(\mathrm{L},\mathbb{F})$ . It is straightforward to check that for any simplicial complex $\mathrm{K}$ , $H_{n}(\mathrm{id}_{\mathrm{K}},\mathbb{F})=\mathrm{id}_{H_{n}(\mathrm{K},\mathbb{F})}$ , and for simplicial maps $\phi:\mathrm{K}\rightarrow\mathrm{L}$ , $\psi:\mathrm{L}\rightarrow\mathcal{N}$ , $H_{n}(\psi\circ\phi,\mathbb{F})=H_{n}(\psi,\mathbb{F})\circ H_{n}(\phi,\mathbb{F})$ . ∎

A direct corollary of the above theorem is the following.

Corollary 3.19.

Let $\mathcal{V}_{\mathbb{F}}$ denote the category of finite dimensional vector spaces over the field $\mathbb{F}$ with linear transformations. Then, for every $n\in\mathbb{Z}_{+}$ , $H_{n}(\ast,\mathbb{F}):\mathcal{S}\rightarrow\mathcal{V}_{\mathbb{F}}$ is a functor.

We now introduce the concept of contiguous simplicial maps which will be used crucially later.

Definition 3.20 (Contiguous Simplicial Maps).

Given simplicial complexes $\mathrm{K}$ and $\mathrm{L}$ , simplicial maps $f,g:\mathrm{K}\rightarrow\mathrm{L}$ are said to be contiguous if for every simplex $\sigma\in\mathrm{K}$ , $f(\sigma)\cup g(\sigma)$ is a simplex in $\mathrm{L}$ .

The following lemma states that contiguous maps agree at the level of homology groups.

Lemma 3.21 ([Mun96]).

For all $k\in\mathbb{N}$ , and simplicial complexes $\mathrm{K},\mathrm{L}$ , if maps $f,g:\mathrm{K}\rightarrow\mathrm{L}$ are contiguous, then $H_{k}(f)=H_{k}(g):H_{k}(\mathrm{K},\mathbb{F})\rightarrow H_{k}(\mathrm{L},\mathbb{F})$ .

In the next section, we describe how to use the machinery developed in this section for studying data sets.

4 Persistent Homology

Persistent homology is a tool that is widely used for studying data sets. The persistent homology pipeline consists of four steps which are outlined below. We remark that some of the terminologies used below have not been defined yet. We will define these later in the section. The pipeline is introduced before so as to provide motivation for this section.

We start with a finite metric space $(X,d_{X})$ . We recall that every finite dataset can be viewed as a metric space by defining a measure of dissimilarity between its data points. 2. 2.

We assign a filtered simplicial complex to the metric space $(X,d_{X})$ . There are many methods for constructing filtered simplicial complexes from finite metric spaces. We will describe some of these methods in this section. 3. 3.

For every $n\in\mathbb{Z}_{+}$ , we apply the homology functor $H_{n}(\ast,\mathbb{F})$ to the filtered simplicial complex obtained in the last step. This produces persistence vector spaces. 4. 4.

For every persistence vector space obtained in the last step, we determine the persistence diagram associated with it.

We will see that the persistence diagrams obtained in the end encode features of the input data set. We now provide missing details from the above pipeline.

Definition 4.1 (Filtered simplicial complex).

A filtered simplicial complex is a sequence of simplicial complexes $\{\mathrm{K}_{\delta}\}_{\delta\in\mathbb{R}_{+}}$ such that for all $\delta\leq\delta^{\prime}$ , $\mathrm{K}_{\delta}\subseteq\mathrm{K}_{\delta^{\prime}}$ .

We now see some examples of filtered simplicial complexes that can be constructed from a finite metric space $(X,d_{X})$ .

Definition 4.2 (Vietoris-Rips Complex).

Given $(X,d_{X})\in\mathcal{M}$ and $\delta\geq 0$ , define

[TABLE]

It is straightforward to see that a Vietoris-Rips complex is a legitimate simplicial complex. In addition, we have that for $\delta\leq\delta^{\prime}$ , $\mathrm{K}_{\delta}^{\mathrm{VR}}(X)\subseteq\mathrm{K}_{\delta^{\prime}}^{\mathrm{VR}}(X)$ . This is because, for every $\sigma\in\mathrm{K}_{\delta}^{\mathrm{VR}}(X)$ , $\mathrm{diam}(\sigma)\leq\delta\leq\delta^{\prime}$ , and therefore, $\sigma\in\mathrm{K}_{\delta^{\prime}}^{\mathrm{VR}}(X)$ . Thus, for every finite metric space $(X,d_{X})$ , $\mathrm{K}_{\bullet}^{\mathrm{VR}}(X)=\{\mathrm{K}_{\delta}^{\mathrm{VR}}(X)\xrightarrow{i^{X}_{\delta,\delta^{\prime}}}\mathrm{K}_{\delta^{\prime}}^{\mathrm{VR}}(X)\}_{\delta\leq\delta^{\prime}}$ is a filtered simplicial complex. Here, $i^{X}_{\delta,\delta^{\prime}}$ is the inclusion map. The next proposition follows from the definitions.

Proposition 4.3.

Let $\delta\geq 0$ be fixed. Then, $\mathrm{K}_{\delta}^{\mathrm{VR}}:\mathcal{M}\rightarrow\mathcal{S}$ is a functor.

Another example of a filtered simplicial complex is the Čech complex.

Definition 4.4 (Čech Complex).

Given $(X,d_{X})\in\mathcal{M}$ and $\delta>0$ , define

[TABLE]

It is an easy exercise to check that for every $(X,d_{X})\in\mathcal{M}$ , $\{\check{\mathrm{C}}_{\delta}(X)\}_{\delta\in\mathbb{R}_{+}}$ is a filtered simplicial complex.

We have now explained the second step of the persistent homology pipeline. The third step is applying the homology map on a filtered simplicial complex to obtain a persistence vector space.

Definition 4.5 (Persistence Vector Space[Car14]).

A persistence vector space $\mathbb{V}$ over a field $\mathbb{F}$ is a collection of vector spaces $\{V_{\delta}\}_{\delta\in\mathbb{R}_{+}}$ over $\mathbb{F}$ and $\mathbb{F}$ -linear maps $\{V_{\delta}\xrightarrow{v_{\delta,\delta^{\prime}}}V_{\delta^{\prime}}\}$ with the following properties:

For all $\delta\geq 0$ , the map $v_{\delta,\delta}:V_{\delta}\rightarrow V_{\delta}$ is the identity map on $V_{\delta}$ . 2. 2.

For all $\delta^{\prime\prime}\geq\delta^{\prime}\geq\delta$ , the following diagram commutes:

[TABLE]

Precisely, we have $v_{\delta,\delta^{\prime\prime}}=v_{\delta^{\prime},\delta^{\prime\prime}}\circ v_{\delta,\delta^{\prime}}$ .

We use $P\mathcal{V}(\mathbb{F})$ to denote the collection of all persistence vector spaces over the field $\mathbb{F}$ .

Given a finite metric space $(X,d_{X})$ , a filtered simplicial complex $\{\mathrm{K}_{\delta}(X)\}_{\delta\in\mathbb{R}_{+}}$ and a sequence $0\leq\delta_{1}\leq\ldots\leq\delta_{n}$ , for every $k\in\mathbb{Z}_{+}$ the sequence

[TABLE]

forms a persistence vector space (where the maps are induced by the simplicial inclusions), since $H_{k}(\ast,\mathbb{F}):\mathcal{S}\rightarrow\mathcal{V}_{\mathbb{F}}$ is a functor.

Definition 4.6 (Morphisms of Persistence Vector Spaces[Car14]).

Given $\mathbb{V},\mathbb{W}\in P\mathcal{V}(\mathbb{F})$ , a morphism $\alpha:\mathbb{V}\rightarrow\mathbb{W}$ is a collection of linear maps $\alpha_{\delta}:V_{\delta}\rightarrow W_{\delta}$ , $\delta\in\mathbb{R}_{+}$ , such that the following diagram commutes for every $\delta\leq\delta^{\prime}$ :

[TABLE]

We say that $\alpha:\mathbb{V}\rightarrow\mathbb{W}$ is an isomorphism if each $\alpha_{\delta}:V_{\delta}\rightarrow W_{\delta}$ is an isomorphism of vector spaces. In this case, we write $\mathbb{V}\cong\mathbb{W}$ .

It will be useful to consider persistence vector spaces of finite length (indexed by natural numbers). A persistence vector space of length $n\in\mathbb{N}$ is any sequence $\{V_{i}\xrightarrow{v_{i,i+1}}V_{i+1}\}_{i\in[1:n-1]}$ of vector spaces over $\mathbb{F}$ and $\mathbb{F}$ -linear maps. In analogy with Definition 4.5, here we assume that the map $v_{i,j}=v_{j-1,j}\circ v_{j-2,j-1}\circ\cdots v_{i,i+1}$ for all $1\leq i<j\leq n$ and $v_{i,i}=\mathrm{id}_{V_{i}}$ for all $i\in[1:n].$ For $n\in\mathbb{Z}_{+}$ , let $P\mathcal{V}_{n}(\mathbb{F})$ denote the collection of all persistence vector spaces over the field $\mathbb{F}$ of length $n$ .

Definition 4.7 (Sampling map).

Let $\mathbb{V}$ be any persistence vector space. Given a finite set $A\subset\mathbb{R}_{+}$ with $|A|=n$ , we write $A=\{\alpha_{1}<\alpha_{2}<\cdots\alpha_{n}\}$ and consider the $A$ -sampling map

[TABLE]

defined by

[TABLE]

where for each $i\in[1:n]$ , $V_{i}^{A}:=V_{\alpha_{i}}$ , and $v_{i,i+1}^{A}:=v_{\alpha_{i},\alpha_{i+1}}.$

We now concentrate on persistence vector spaces of length $n$ and describe a full invariant for those. An invariant of persistent modules is any map $\iota:P\mathcal{V}_{n}(\mathbb{F})\rightarrow\mathcal{I}$ into some set $\mathcal{I}$ such that $\mathbb{V}\cong\mathbb{W}$ implies $\iota(\mathbb{V})=\iota(\mathbb{W})$ . An invariant $\iota$ is a full invariant if $\iota(\mathbb{V})=\iota(\mathbb{W})$ implies that $\mathbb{V}\cong\mathbb{W}$ .

The full invariants of persistence vector spaces are called Persistence Diagrams and will help us associate an algebraic signature to finite metric spaces.

Persistence diagrams of persistence vector spaces of length $n$ .

We now assume that for every $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ and for every $i\in[1:n]$ , $\dim(V_{i})<\infty$ . Thus, $P\mathcal{V}_{n}(\mathbb{F})$ is the collection of all pointwise finite dimensional (pfd) persistence vector spaces of length $n$ , $n\in\mathbb{Z}_{+}$ . The reason behind this assumption is that such persistence vector spaces have a simple representation in terms of interval persistence vector spaces.

In the same way that finite dimensional vector spaces can be classified up to isomorphism by their dimension, finite length persistence vector spaces admit a classification based on certain finite multisets of points in the plane. In particular, it is not true that persistence vector spaces can be classified by the sequence of dimensions.

Example 4.8

Assume that $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ . Consider the vector

[TABLE]

We claim that there exists a natural number $n$ and $\mathbb{V},\mathbb{W}\in P\mathcal{V}_{n}(\mathbb{F})$ such that $\mathbb{V}\not\cong\mathbb{W}$ but $\dim(\mathbb{V})=\dim(\mathbb{W})$ . This can be seen from the following example: let $\mathbb{V}=\mathbb{F}\xrightarrow{v_{1,2}}\mathbb{F}^{2}\xrightarrow{v_{2,3}}\mathbb{F}$ , where $v_{1,2}(1_{\mathbb{F}})=(1_{\mathbb{F}},0_{\mathbb{F}})$ , and $v_{2,3}((1_{\mathbb{F}},0_{\mathbb{F}}))=v_{2,3}((0_{\mathbb{F}},1_{\mathbb{F}}))=1_{\mathbb{F}}$ . Let $\mathbb{W}=\mathbb{F}\xrightarrow{w_{1,2}}\mathbb{F}^{2}\xrightarrow{w_{2,3}}\mathbb{F}$ , where $w_{1,2}(1_{\mathbb{F}})=(1_{\mathbb{F}},0_{\mathbb{F}}),w_{2,3}((1_{\mathbb{F}},0_{\mathbb{F}}))=0_{\mathbb{F}}$ , and $w_{2,3}((0_{\mathbb{F}},1_{\mathbb{F}}))=1_{\mathbb{F}}$ . We have $\dim(\mathbb{V})=\dim(\mathbb{W})$ . Suppose $\mathbb{V}$ and $\mathbb{W}$ are isomorphic. Then, for $i=1,2,3$ , there exist isomorphisms $\alpha_{i}:V_{i}\rightarrow W_{i}$ such that all squares in the following diagram commute:

[TABLE]

Let $\alpha_{1}(1_{\mathbb{F}})=a\cdot 1_{\mathbb{F}}$ , and $\alpha_{3}(1_{\mathbb{F}})=b\cdot 1_{\mathbb{F}}$ , where $0\neq a,b\in\mathbb{F}$ . Here, $a,b\neq 0$ because both $\alpha_{1}$ and $\alpha_{3}$ are isomorphisms. Now, we have $v_{1,3}(1_{\mathbb{F}})=v_{2,3}\circ v_{1,2}(1_{\mathbb{F}})=1_{\mathbb{F}}$ , and similarly $w_{1,3}(1_{\mathbb{F}})=w_{2,3}\circ w_{1,2}(1_{\mathbb{F}})=0_{\mathbb{F}}$ . Then, we obtain

[TABLE]

This contradicts the commutativity of all squares. Thus, we conclude that $\mathbb{V}$ and $\mathbb{W}$ are not isomorphic.

The above example shows that for $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ , $\dim(\mathbb{V})$ is not a full invariant of $\mathbb{V}$ . The construction of a full invariant of a persistence vector space requires a more subtle approach which depends on the notion of persistence diagrams of persistence vector spaces (see Corollary 4.13 below).

Definition 4.9 (Interval persistence vector space).

Given $n\in\mathbb{N}$ and $b,d\in[1:n],~{}b\leq d$ , an interval persistence vector space is defined as follows: $V_{i}=0$ for all $i<b$ and $i>d$ , and $V_{i}=\mathbb{F}$ for all $b\leq i\leq d$ . The map between the [math]-vector spaces, as well as maps $0\rightarrow\mathbb{F}$ and $\mathbb{F}\rightarrow 0$ are specified to be the [math]-maps. The maps $\mathbb{F}\rightarrow\mathbb{F}$ are specified to be identity maps. Such a persistence vector space is denoted by $\mathbb{I}(b,d)$ . Thus, we have

[TABLE]

An example of an interval persistence vector space is $\mathbb{I}(2,3)=0\rightarrow\mathbb{F}\rightarrow\mathbb{F}\rightarrow 0$ . We now have the following theorem.

Theorem 4.10 ([CB12]).

For every $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ , there exist intervals $[b_{i},d_{i}]_{i\in I}$ , $I$ an indexing set such that for every $i\in I$ , $b_{i},d_{i}\in[1:n]$ , and $\mathbb{V}\cong\bigoplus_{i\in I}\mathbb{I}(b_{i},d_{i})$ .

Furthermore, we have the following theorem.

Theorem 4.11 (Krull-Remak-Schmidt-Azumaya [Azu50]).

Let $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ and $\mathbb{V}=\bigoplus_{i\in I}\mathbb{I}(b_{i},d_{i})=\bigoplus_{j\in J}\mathbb{I}(b_{j},d_{j})$ be two decompositions of $\mathbb{V}$ into interval persistence vector spaces. Then, $|I|=|J|$ and there exists a permutation $\pi\in S_{N}$ , $N=|I|$ such that for all $i\in I$ , there exists $j\in J$ satisfying $i=\pi(j)$ .

A consequence of the above theorems is that for every $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ , if $\mathbb{V}=\bigoplus_{i\in I}\mathbb{I}(b_{i},d_{i})$ , then the multiset $\{\!\{(b_{i},d_{i})\}\!\}_{i\in I}$ is a full invariant of $\mathbb{V}$ . This multiset is called the persistence diagram of $\mathbb{V}$ , and this brings us to the fourth step of the persistent homology pipeline.

Definition 4.12 (Persistence Diagram).

Let $n\in\mathbb{N}$ and $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ be a persistence vector space. Let $\mathbb{V}=\bigoplus_{i\in I}\mathbb{I}(b_{i},d_{i})$ . Then, the persistence diagram of $\mathbb{V}$ , denoted by $\mathrm{dgm}(\mathbb{V})$ is defined as the multiset of intervals $\{\!\{(b_{i},d_{i})\}\!\}_{i\in I}$ .

Corollary 4.13.

For any $\mathbb{V},\mathbb{W}\in P\mathcal{V}_{n}(\mathbb{F})$ it holds that $\mathbb{V}\cong\mathbb{W}$ if and only if $\mathrm{dgm}(\mathbb{V})=\mathrm{dgm}(\mathbb{W}).$

Let $\mathcal{D}$ denote the collection of all multisets $\{\!\{(b_{i},d_{i})\}\!\}_{i\in I}$ , where $b_{i}\leq d_{i}$ are non-negative real numbers, and for $n\in\mathbb{N}$ , let $\mathcal{D}_{[1:n]}$ denote the collection of all multisets $\{\!\{(b_{i},d_{i})\}\!\}_{i\in I}$ , where $b_{i}\leq d_{i}$ and $\{b_{i},d_{i}~{}|~{}i\in I\}\subseteq[1:n]$ .

We note that for any $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ , $\mathrm{dgm}(\mathbb{V})$ is a collection of points in $\mathbb{R}^{2}$ . For $\mathrm{dgm}(\mathbb{V})=\{\!\{(b_{i},d_{i})\}\!\}_{i\in I}$ , the $b_{i}$ ’s are referred to as the birth times and are represented on the x-axis, while $d_{i}$ ’s are referred to as the death times and are represented on the y-axis. Since $b_{i}\leq d_{i}$ for all $i\in I$ , the points of $\mathrm{dgm}(\mathbb{V})$ lie on or above the $x=y$ line in $\mathbb{R}^{2}$ . For example, consider $\mathbb{V}=\mathbb{I}(1,5)\oplus\mathbb{I}(3,4)$ . Then, the persistence diagram of $\mathbb{V}$ is depicted in the following figure:

$1$$2$$3$$4$$5$$6$$7$$1$$2$$3$$4$$5$$6$$b(birth)$$d(death)$

Another way of depicting $\mathrm{dgm}(\mathbb{V})$ is through barcodes. The following diagram depicts the barcode of $\mathbb{V}=\mathbb{I}(1,5)\oplus\mathbb{I}(3,4)$ .

$1$$2$$3$$4$$5$$6$$7$

Vietoris-Rips persistence diagrams of finite metric spaces.

Now, given $(X,d_{X})\in\mathcal{M}$ , we define the spectrum of $(X,d_{X})$ as

[TABLE]

Let $n=|\mathrm{spec}(X)|$ and write $\mathrm{spec}(X)=\{0=\delta_{1}<\delta_{2}<\ldots<\delta_{n}=\mathrm{diam}(X)\}$ . Given an integer $k\geq 0$ we consider the persistence vector space $\mathbb{V}$ of length $n$ defined as

[TABLE]

Here, $\mathbf{S}$ is the sampling map as given in definition 4.7. Now we need a process that is in some sense dual to sampling. Given a finite set $\{\alpha_{1}<\cdots<\alpha_{n}\}=A\subset\mathbb{R}_{+}$ with $|A|=n$ , we define a map

[TABLE]

to be the function satisfying

(Additivity) For all $D_{1},D_{2}\in\mathcal{D}_{[1:n]}$ , $\mathbf{T}(D_{1}\sqcup D_{2},A)=\mathbf{T}(D_{1},A)\sqcup\mathbf{T}(D_{2},A)$ . 2. 2.

(Definition on atomic elements) We define

(a)

$\mathbf{T}(\{\!\{(1,n)\}\!\},A):=\{\!\{(\alpha_{1},\infty)\}\!\}$ . 2. (b)

For all $j\in[1:n]$ , $\mathbf{T}(\{\!\{(j,j)\}\!\},A):=\{\!\{(\alpha_{j},\alpha_{j+1})\}\!\}$ . 3. (c)

For all $i,j\in[1:n]$ with $i<j$ and $(i,j)\neq(1,n)$ , $\mathbf{T}(\{\!\{(i,j)\}\!\},A):=\{\!\{(\alpha_{i},\alpha_{j})\}\!\}$ .

Note that these properties uniquely determine the map $\mathbf{T}(\cdot,A)$ . We illustrate how the map $\mathbf{T}(\cdot,A)$ works via the following example.

Example 4.14

Let $n\geq 3$ , and $A=\{\alpha_{1}<\ldots<\alpha_{n}\}\subset\mathbb{R}_{+}$ . Let $D=\{\!\{(1,1),(1,2),(1,n),(2,3)\}\!\}$ . Then, we have that

[TABLE]

We now have the following definition.

Definition 4.15 ( $k$ -th Vietoris-Rips Persistence Diagram).

Given $(X,d_{X})\in\mathcal{M}$ and $k\in\mathbb{N}$ , the $k$ -th Vietoris-Rips persistence diagram of $(X,d_{X})$ is defined as the

[TABLE]

Example 4.16

We now provide an example to illustrate the definitions. Let $X=\{a,b\}$ with $d_{X}(a,b)=1$ . The filtered Vietoris-Rips simplicial complex of $X$ is as follows:

[TABLE]

Let $\mathrm{K}^{1}=\{[a],[b]\}$ and $\mathrm{K}^{2}=\{[a,b],[a],[b]\}$ . The set of [math]-simplices of $\mathrm{K}^{1}$ is $\mathrm{K}^{1}_{n}=\{[a],[b]\}$ and for $n>0$ , the set of $n$ -simplices of $\mathrm{K}^{1}$ is $\mathrm{K}^{1}_{n}=\emptyset$ . The set of [math]-simplices of $\mathrm{K}^{2}$ is $\mathrm{K}^{2}_{0}=\{[a],[b]\}$ , the set of $1$ -simplices of $\mathrm{K}^{2}$ is $\mathrm{K}^{2}_{1}=\{[a,b]\}$ , and, for all $n\geq 2$ , the set of $n$ -simplices of $\mathrm{K}^{2}$ is $\mathrm{K}^{2}_{n}=\emptyset$ . Thus, we have that $H_{0}(\mathrm{K}^{1},\mathbb{F})=\mathbb{F}^{2}$ and $H_{k}(\mathrm{K}^{1},\mathbb{F})=0$ for all $k\geq 1$ . Similarly, $H_{0}(\mathrm{K}^{2},\mathbb{F})=\mathbb{F}$ and $H_{k}(\mathrm{K}^{2},\mathbb{F})=0$ for all $k\geq 1$ . This implies that

[TABLE]

with the transition $\mathbb{F}^{2}\rightarrow\mathbb{F}$ occurring at $\delta=1$ , and the notation $V\rightarrow\ldots$ meaning that all vector spaces hidden in the dots are $V$ . Furthermore, we have that $H_{k}(\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathbb{F})=0$ for all $k\geq 1$ . Thus, we have that $\mathrm{dgm}_{k}^{\mathrm{VR}}(X)=\emptyset$ for all $k\geq 1$ . We now calculate $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ , using the maps $\mathbf{S}$ and $\mathbf{T}$ defined above.

We have that $\mathrm{spec}(X)=\{0,1\}$ , and therefore $\mathbf{S}(H_{0}\circ\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathrm{spec}(X))$ is a persistence vector space of length $2$ given by

[TABLE]

with the map $v_{1,2}$ being defined as $v_{1,2}((1_{\mathbb{F}},0_{\mathbb{F}}))=v_{1,2}((0_{\mathbb{F}},1_{\mathbb{F}}))=1_{\mathbb{F}}$ .

Clearly, we have

[TABLE]

Thus, we obtain $\mathrm{dgm}(\mathbb{V}_{0}^{X})=\{\!\{(1,2),(1,1)\}\!\}$ . We now apply the map $\mathbf{T}(\cdot,\mathrm{spec}(X))$ to $\mathrm{dgm}(\mathbb{V}_{0}^{X})$ in order to obtain $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ . By definition, we have

[TABLE]

Thus, we obtain $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)=\{\!\{(0,\infty),(0,1)\}\!\}$ .

Example 4.17

We now consider another example of a metric space with $4$ points, depicted in Figure 3. This metric space is defined as follows:

[TABLE]

Thus, $X$ consists of the corners of a square of side length $1$ , with $\ell_{1}$ -distance. The filtered Vietoris-Rips simplicial complex of $X$ is as follows:

[TABLE]

Let $\mathrm{K}^{1}:=\mathrm{K}^{\mathrm{VR}}_{0}(X)$ , $\mathrm{K}^{2}:=\mathrm{K}^{\mathrm{VR}}_{1}(X)$ , and $\mathrm{K}^{3}=\mathrm{K}^{\mathrm{VR}}_{2}(X)$ . For $i=\{1,2,3\}$ and $j\in\mathbb{Z}_{+}$ , let $\mathrm{K}^{i}_{j}$ denote set of $j$ -simplices of $\mathrm{K}^{i}$ . Then, we have that $\mathrm{K}^{1}_{0}=\mathrm{K}^{1}$ and for $j\geq 1$ , $\mathrm{K}^{1}_{j}=\emptyset$ . This implies that

[TABLE]

and $H_{n}(\mathrm{K}^{1},\mathbb{F})=0$ for all $n\geq 1$ .

For $\mathrm{K}^{2}$ , we have $\mathrm{K}^{2}_{0}=\{[1],[2],[3],[4]\}$ , $\mathrm{K}^{2}_{1}=\{[1,2],[2,3],[3,4],[1,4]\}$ and for $j\geq 2$ , $\mathrm{K}^{2}_{j}=\emptyset$ . The chain complex of $\mathrm{K}^{2}$ looks as $\langle\mathrm{K}^{2}_{1}\rangle_{\mathbb{F}}\xrightarrow{\partial_{1}}\langle\mathrm{K}^{2}_{0}\rangle_{\mathbb{F}}\xrightarrow{\partial_{0}}0$ . We have that $\mathcal{B}_{0}(\mathrm{K}^{2},\mathbb{F})=\mathrm{image}(\partial_{1})=\mathrm{span}\left([2]-[1],[3]-[2],[4]-[3],[4]-[1]\right)$ . Clearly, $\dim(\mathcal{B}_{0}(\mathrm{K}^{2},\mathbb{F}))=3$ , and thus we obtain that

[TABLE]

We also obtain that $\dim(\mathrm{ker}(\partial_{1}))=1$ , and thus

[TABLE]

Clearly, $H_{n}(\mathrm{K}^{2},\mathbb{F})=0$ for all $n\geq 2$ .

For $\mathrm{K}^{3}$ , we have $\mathrm{K}^{3}_{0}=\{[1],[2],[3],[4]\}$ , $\mathrm{K}^{3}_{1}=\{[1,2],[2,3],[3,4],[1,4],[1,3],[2,4]\}$ , $\mathrm{K}^{3}_{2}=\{[1,2,3],[1,2,4],[1,3,4],[2,3,4]\}$ , $\mathrm{K}^{3}_{3}=\{[1,2,3,4]\}$ , and $\mathrm{K}^{3}_{n}=\emptyset$ for all $n\geq 4$ . The chain complex of $\mathrm{K}^{3}$ looks as

[TABLE]

We have $\mathcal{B}_{0}(\mathrm{K}^{3},\mathbb{F})=\mathrm{image}(\partial_{1})=\mathrm{span}\big{(}[2]-[1],[3]-[2],[4]-[3],[4]-[1],[3]-[1],[4]-[2]\big{)}$ . We observe that $\dim(\mathcal{B}_{0}(\mathrm{K}^{3},\mathbb{F}))=3$ , and therefore

[TABLE]

We have $\mathcal{Z}_{1}(\mathrm{K}^{3},\mathbb{F})=\mathrm{ker}(\partial_{1})=\mathrm{span}\big{(}[1,2]+[2,3]-[1,3],[2,3]+[3,4]-[2,4],[1,3]+[3,4]-[1,4],[1,2]+[2,4]-[1,4],[1,2]+[2,3]+[3,4]-[1,4]\big{)}.$ It is an easy exercise to check that $\dim(\mathcal{Z}_{1}(\mathrm{K}^{3},\mathbb{F}))=3$ . We have $\mathcal{B}_{1}(\mathrm{K}^{3},\mathbb{F})=\mathrm{image}(\partial_{2})=\mathrm{span}\big{(}[2,3]-[1,3]+[1,2],[2,4]-[1,4]+[1,2],[3,4]-[1,4]+[1,3],[3,4]-[2,4]+[2,3]\big{)}$ . It is easy to see that $\dim(\mathcal{B}_{1}(\mathrm{K}^{3},\mathbb{F}))=3$ , and therefore, we obtain

[TABLE]

We have $\mathcal{Z}_{2}(\mathrm{K}^{3},\mathbb{F})=\mathrm{ker}(\partial_{2})=\mathrm{span}\big{(}[1,2,3]+[1,3,4]-[1,2,4]-[2,3,4]\big{)}$ , and therefore $\dim(\mathcal{Z}_{2}(\mathrm{K}^{3},\mathbb{F}))=1$ . We have $\mathcal{B}_{2}(\mathrm{K}^{3},\mathbb{F})=\mathrm{image}(\partial_{3})=\mathrm{span}\big{(}[2,3,4]-[1,3,4]+[1,2,4]-[1,2,3]\big{)}=\mathcal{Z}_{2}(\mathrm{K}^{3},\mathbb{F}).$ This implies that

[TABLE]

Clearly, for $n\geq 3$ , $H_{n}(\mathrm{K}^{3},\mathbb{F})=0$ .

Therefore, the homology groups of $\mathrm{K}^{\mathrm{VR}}_{\bullet}(X)$ are as follows:

[TABLE]

with the transition $\mathbb{F}^{4}\rightarrow\mathbb{F}^{1}$ occurring at $\delta=1$ ,

[TABLE]

with the transition $0\rightarrow\mathbb{F}$ occurring at $\delta=1$ and the transition $\mathbb{F}\rightarrow 0$ occurring at $\delta=2$ ; and $H_{n}(\mathrm{K}^{\mathrm{VR}}_{\bullet}(X),\mathbb{F})=0$ for all $n\geq 2$ . We now calculate $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ and $\mathrm{dgm}_{1}^{\mathrm{VR}}(X)$ using maps $\mathbf{S}$ and $\mathbf{T}$ .

We have that $\mathrm{spec}(X)=\{0,1,2\}$ , and therefore for $k=0,1$ , $\mathbf{S}(H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(X),\mathrm{spec}(X))$ are persistence vector spaces of length $3$ given by

[TABLE]

Here, we have that for all permutations $\sigma\in S_{4}$ , $v_{1,2}(\sigma(1_{\mathbb{F}},0_{\mathbb{F}},0_{\mathbb{F}},0_{\mathbb{F}}))=1_{\mathbb{F}}$ , and $v_{2,3}(1_{\mathbb{F}})=1_{\mathbb{F}}$ . The maps of $\mathbb{V}_{1}^{X}$ are the trivial maps. Clearly, we have

[TABLE]

Thus, we obtain that $\mathrm{dgm}(\mathbb{V}_{0}^{X})=\{\!\{(1,3),(1,1),(1,1),(1,1)\}\!\}$ and $\mathrm{dgm}(\mathbb{V}_{1}^{X})=\{\!\{(2,2)\}\!\}$ . We now apply the map $\mathbf{T}(\cdot,\mathrm{spec}(X))$ to both $\mathrm{dgm}(\mathbb{V}_{0}^{X})$ and $\mathrm{dgm}(\mathbb{V}_{1}^{X})$ to obtain

[TABLE]

4.1 Interpretation of Clustering via 0-Dimensional Persistence Diagram

Let $(X,d_{X})\in\mathcal{M}$ . We now make some observations about $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ . We first observe that the number of intervals in $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ is equal to $|X|$ . This is because by definition, $\mathrm{K}_{0}^{\mathrm{VR}}(X)$ consists of only singletons, and we know from Lemma 3.17 that $H_{0}(\mathrm{K}_{0}^{\mathrm{VR}}(X),\mathbb{F})=\mathbb{F}^{r}$ , where $r$ is the number of connected components of $\mathrm{K}_{0}^{\mathrm{VR}}(X)$ . Thus,

[TABLE]

This implies that there are $|X|$ intervals in the decomposition of $H_{0}(\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathbb{F})$ into interval persistence vector spaces. We simultaneously obtain that if $H_{0}(\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathbb{F})\cong\bigoplus_{i=1}^{|X|}\mathbb{I}(b_{i},d_{i})$ , then $b_{i}=0$ for all $i\in[1:|X|]$ . In the next proposition, we explicitly determine the intervals $\{\!\{(b_{i},d_{i})\}\!\}_{i=1}^{|X|}$ in $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ and provide a method of associating an interval with every point of $X$ .

We now recall the single linkage dendrogram of $(X,d_{X})$ , denoted by $\theta_{X}$ . We have that for every $t\geq 0$ , $\theta_{X}(t)$ is a partition of $X$ , and for $t^{\prime}\geq t$ , $\theta_{X}(t)$ is finer than $\theta_{X}(t^{\prime})$ . Let $\mathfrak{V}_{\mathbb{F}}:\mathcal{P}\rightarrow\mathcal{V}_{\mathbb{F}}$ denote the functor defined as $\mathfrak{V}_{\mathbb{F}}(\{B_{1},\ldots,B_{k}\})=\mathbb{F}^{k}$ . Here, $\{B_{1},\ldots,B_{k}\}$ denotes a partition of some finite metric space. It is straightforward to check that $\mathfrak{V}_{\mathbb{F}}$ is a functor. We observe that $\{\mathfrak{V}_{\mathbb{F}}\circ\theta_{X}(t)\rightarrow\mathfrak{V}_{\mathbb{F}}\circ\theta_{X}(t^{\prime})\}_{0\leq t\leq t^{\prime}}$ forms a pointwise finite dimensional persistence vector space, and thus admits an interval decomposition. Then, we have the following proposition.

Proposition 4.18.

For all $(X,d_{X})\in\mathcal{M}$ ,

[TABLE]

The proof of the above proposition is an easy exercise, and uses the observation that in the persistence vector space $H_{0}(\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathbb{F})$ , every time consecutive vector spaces in the sequence $H_{0}(\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathbb{F})$ are different, there is a merging of bars in the dendrogram $\theta_{X}$ . Thus, we observe that $H_{0}(\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathbb{F})$ is equivalent to single linkage clustering of $X$ , and therefore persistent homology generalizes clustering. In the next proposition, we explicitly determine the interval decomposition of $H_{0}(\mathrm{K}_{\bullet}^{\mathrm{VR}}(X),\mathbb{F})$ , and provide a map that associates to every element of $X$ , an interval of this decomposition.

Proposition 4.19.

Let $(X,d_{X})$ be a finite metric space, with $|X|=n$ . Let $X=\{x_{1},x_{2},\ldots,x_{n}\}$ , and let $u_{X}$ be the maximal sub-dominant ultrametric (Definition 2.11) on $X$ . Then,

[TABLE]

Proof.

We refer the reader to chapter $7$ of [EH10] for ideas used in this proof. In chapter $7.1$ of [EH10], the authors provide an algorithm for determining the intervals in $\mathrm{dgm}_{k}^{\mathrm{VR}}(X)$ for all $k\in\mathbb{Z}_{+}$ . A proof of correctness of this algorithm also appears in [EH10, Chapter 7.1]. Here, we briefly describe their algorithm for $k=0$ .

We define an arbitrary ordering $x_{1}<x_{2}<\ldots<x_{n}$ of elements of $X$ . Let $C_{1}\subset 2^{X}$ denote the collection of subsets of $X$ of cardinality at most $2$ . By definition, $C_{1}$ is a simplicial complex. We fix the following notation here: we have $\{x_{i},x_{j}\}\in C_{1}$ only for $i<j$ . We define a function $f:C_{1}\rightarrow\mathbb{R}_{+}$ as $f(\{x_{i},x_{j}\})=u_{X}(x_{i},x_{j})$ and $f(\{x_{i}\})=0$ , for all $1\leq i\leq j\leq n$ . We now define an ordering $<_{S}$ on elements of $C_{1}$ as follows: we first fix $\{x_{1}\}<_{S}\{x_{2}\}<_{S}\ldots<_{S}\{x_{n}\}$ . In order to determine the ordering of subsets of cardinality $2$ , we compare their values on the function $f$ . We set $\{x_{i},x_{j}\}<_{S}\{x_{k},x_{l}\}$ if $f(\{x_{i},x_{j}\})\leq f(\{x_{k},x_{l}\})$ . If subsets $\{x_{i},x_{j}\},\{x_{k},x_{l}\}$ are such that $f(\{x_{i},x_{j}\})=f(\{x_{k},x_{l}\})$ , then $\{x_{i},x_{j}\}<_{S}\{x_{k},x_{l}\}$ if and only if $i<k$ or $i=k,j<l$ . Thus, we use lexicographic ordering on elements of $C_{1}$ with same value on the function $f$ . We have that every element of $C_{1}$ of cardinality $2$ is a $1$ -dimensional face of itself, and every element of cardinality $1$ is a [math]-dimensional face of any set of cardinality $2$ containing it. The ordering $<_{S}$ satisfies that if $f(\{x_{i},x_{j}\})<f(\{x_{k},x_{l}\})$ , then $\{x_{i},x_{j}\}<_{S}\{x_{k},x_{l}\}$ , and the faces $\{x_{i}\},\{x_{j}\}$ of $\{x_{i},x_{j}\}$ satisfy $\{x_{i}\},\{x_{j}\}<_{S}\{x_{i},x_{j}\}$ . Thus, we have a compatible ordering of the faces of $C_{1}$ .

We now write the boundary matrix $B$ using this ordering of faces. The boundary matrix is a binary square matrix of size $n+\frac{n(n-1)}{2}$ . The size of $B$ is equal to $|C_{1}|$ , and the rows and columns of $B$ correspond to elements of $C_{1}$ ordered according to relation $<_{S}$ . By abuse of notation, we name the rows and columns of $B$ on their corresponding elements in $C_{1}$ . The columns of $B$ corresponding to the singleton sets of $C_{1}$ are set to be zero, while for all $1\leq i\leq j\leq n$ , the column $\{x_{i},x_{j}\}$ has $1$ in the rows $\{x_{i}\}$ and $\{x_{j}\}$ , and zero everywhere else. For every column $\{x_{i},x_{j}\}$ of $B$ , we denote by $\mathrm{low}(\{x_{i},x_{j}\})$ , the row of $B$ in which the lowest $1$ of the column $\{x_{i},x_{j}\}$ appears.

We perform some column additions in the matrix $B$ such that in the new matrix, no two columns have their lowest $1$ in the same row. This is done as follows: for simplicity, we number the columns from $1$ to $|C_{1}|$ , with the leftmost column being numbered $1$ and the rightmost column being numbered $|C_{1}|$ . We scan the boundary matrix from left to right and suppose that $t$ is the first column for which there is a column $s$ , $s<t$ , satisfying $\mathrm{low}(s)=\mathrm{low}(t)$ . In this case, we add column $s$ to column $t$ . Now, $\mathrm{low}(s)\neq\mathrm{low}(t)$ , but there might be some column $r$ , $r<t$ such that $\mathrm{low}(r)=\mathrm{low}(t)$ . We then add column $r$ to column $t$ . We keep performing such column additions till there is no column to the left of column $t$ with $\mathrm{low}$ value equal to $\mathrm{low}(t)$ . We then proceed to column $t+1$ and repeat. In the end, we obtain a matrix in which no two columns have their lowest $1$ in the same row. This matrix is called the reduced matrix and is denoted by $R$ . Now, for every non-zero column $\{x_{i},x_{j}\}$ in $R$ having lowest $1$ in the row $\{x_{j}\}$ , the interval $(f(\{x_{i}\}),f(\{x_{i},x_{l}\}))$ belongs to $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ . In addition, the interval $[0,\infty)$ belongs to $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ . This concludes the algorithm used to determine intervals in $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ .

We now provide an example in order to illustrate the above algorithm. Let $X=\{x_{1},x_{2},x_{3}\}$ , with $d_{X}(x_{1},x_{2})=1,d_{X}(x_{2},x_{3})=2$ and $d_{X}(x_{3},x_{1})=3$ . We have

[TABLE]

The function $f:C_{1}\rightarrow\mathbb{R}_{+}$ is defined as follows: $f(\{x_{1}\})=f(\{x_{2}\})=f(\{x_{3}\})=0,f(\{x_{1},x_{2}\})=u_{X}(x_{1},x_{2})=1,f(\{x_{2},x_{3}\})=u_{X}(x_{2},x_{3})=2,f(\{x_{1},x_{3}\})=u_{X}(x_{1},x_{3})=2$ . Thus, we have $\{x_{1}\}<_{S}\{x_{2}\}<_{S}\{x_{3}\}<_{S}\{x_{1},x_{2}\}<_{S}\{x_{1},x_{3}\}<_{S}\{x_{2},x_{3}\}$ . The boundary matrix is the following:

[TABLE]

The reduced matrix obtained after performing the required column operations is the following:

[TABLE]

The algorithm now implies that $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)=\{\!\{(0,1),(0,2),(0,\infty)\}\!\}$ . We now use the following claim to determine $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ for any $(X,d_{X})\in\mathcal{M}$ .

Claim 4.20.

For every $\{x_{k}\}\in C_{1}$ , the unique column in the reduced matrix $R$ with lowest $1$ in the row $\{x_{k}\}$ is the leftmost column in the boundary matrix $B$ with lowest $1$ in the row $\{x_{k}\}$ .

Proof of Claim.

Consider the row $\{x_{k}\}$ in the matrix $B$ , and the column in which $1$ appears for the first time in this row. If this $1$ is the lowest element of its column, then the column is $\{x_{i},x_{k}\}$ for some $i<k$ , and the interval $(f(\{x_{k}\},f(\{x_{i},x_{k}\})$ is added to $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)$ . We observe that $(f(\{x_{k}\},f(\{x_{i},x_{k}\})=(0,\min_{i<k}u_{X}(x_{i},x_{k}))$ . Now, suppose that the first $1$ of row $\{x_{k}\}$ is not the lowest element of its column. Thus, such a column is $\{x_{k},x_{m}\}$ for $k<m$ . In the column additions performed to obtain the reduced matrix, this particular $1$ becomes the lowest $1$ of some other column in two ways. First, if some column $\{x_{i},x_{m}\},i<k$ , on the left of column $\{x_{k},x_{m}\}$ is added to the column $\{x_{k},x_{m}\}$ , and second if column $\{x_{k},x_{m}\}$ is added to some column $\{x_{i},x_{m}\},i<k$ on its right. We first consider the case where there is a column $\{x_{i},x_{m}\},i<m$ to the left of column $\{x_{k},x_{m}\}$ . In this case, we have $u_{X}(x_{i},x_{m})\leq u_{X}(x_{k},x_{m})$ . Now, $u_{X}(x_{i},x_{k})\leq\max\{u_{X}(x_{i},x_{m}),u_{X}(x_{k},x_{m})\}=u_{X}(x_{k},x_{m})$ . Since $i<k$ , we have that $\{x_{i},x_{k}\}<_{S}\{x_{k},x_{m}\}$ . This contradicts the assumption that the first $1$ in the row $\{x_{k}\}$ appears in the column $\{x_{k},x_{m}\}$ . Therefore, there is no column $\{x_{i},x_{m}\}$ with $i<m$ to the left of $\{x_{k},x_{m}\}$ .

We now consider the second case i.e. there is a column $\{x_{i},x_{m}\},i<k$ to the right of $\{x_{k},x_{m}\}$ . Here, we have $u_{X}(x_{k},x_{m})\leq u_{X}(x_{i},x_{m})$ . Since

[TABLE]

we have that $\{x_{i},x_{k}\}<_{S}\{x_{i},x_{m}\}$ . Thus, we have a column $\{x_{i},x_{k}\}$ whose lowest $1$ is in the row $x_{k}$ , and this column appears before the column $\{x_{i},x_{m}\}$ . Thus, if column $\{x_{k},x_{m}\}$ is added to column $\{x_{i},x_{m}\}$ , the lowest $1$ of $\{x_{i},x_{m}\}$ is in row $\{x_{k}\}$ , but this does not affect the column $\{x_{i},x_{k}\}$ . This implies that the column $\{x_{k},x_{m}\}$ has no affect on the leftmost column of the boundary matrix with lowest $1$ in the row $\{x_{k}\}$ . Thus, we obtain that the interval corresponding to the row $\{x_{k}\}$ comes from the first column whose lowest $1$ is in row $\{x_{k}\}$ . This proves the claim. ∎

Note that this proof also suggests a method of associating an interval to every element of $X$ . In particular, the interval associated with $x_{k}\in X$ is the interval associated with row $\{x_{k}\}$ and the first column of the boundary matrix with lowest $1$ in row $\{x_{k}\}$ . By definition, the value of such a column under function $f$ is $\min_{i<k}u_{X}(x_{i},x_{k})$ . Thus, we have that the element $x_{k}\in X,k>1$ is associated with the interval $(0,\min_{i<k}u_{X}(x_{i},x_{k}))$ . We associate the interval $[0,\infty)$ with $x_{1}$ . ∎

Now, suppose that given finite metric spaces $(X,d_{X})$ and $(Y,d_{Y})$ , we construct the Vietoris-Rips simplicial complexes $\mathrm{K}_{\bullet}^{\mathrm{VR}}(X)$ and $\mathrm{K}_{\bullet}^{\mathrm{VR}}(Y)$ . Then, for a fixed $k\in\mathbb{Z}_{+}$ , we compute the persistence vector spaces $H_{k}\circ\mathrm{K}_{\bullet}^{\mathrm{VR}}(X)$ and $H_{k}\circ\mathrm{K}_{\bullet}^{\mathrm{VR}}(Y)$ , as well as their respective persistence diagrams. Suppose that we have a method of comparing two metric spaces as well as two persistence diagrams. Then, a natural question to ask is, if $(X,d_{X})$ and $(Y,d_{Y})$ are “almost identical”, then how do the persistence diagrams associated to $H_{k}\circ\mathrm{K}_{\bullet}^{\mathrm{VR}}(X)$ and $H_{k}\circ\mathrm{K}_{\bullet}^{\mathrm{VR}}(Y)$ compare. The next section focuses towards formalizing this question and then answering it.

5 Stability of Invariants

In this section, we formalize the following question: if $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ are almost identical, then how do their respective $k$ -persistence diagrams, $\mathrm{dgm}_{k}^{\mathrm{VR}}(X)$ and $\mathrm{dgm}_{k}^{\mathrm{VR}}(Y)$ compare. This is done by defining a notion of dissimilarity between metric spaces, as well as between persistence diagrams. Therefore, we now define a notion of distance between metric spaces, and a notion of distance between persistence vector spaces as well as between persistence diagrams.

5.1 Gromov-Hausdorff Distance

In thie section, we define a notion of distance between two finite metric spaces. Let $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ . We say that $(X,d_{X})$ and $(Y,d_{Y})$ are identical if they are isometric.

Definition 5.1 (Isometry).

An isometry between $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ is a map $\phi:X\rightarrow Y$ such that $\phi$ is surjective, and for all $x,x^{\prime}\in X$ , $d_{X}(x,x^{\prime})=d_{Y}(\phi(x),\phi(x^{\prime}))$ .

Note that the condition $d_{X}(x,x^{\prime})=d_{Y}(\phi(x),\phi(x^{\prime}))$ ensures that $\phi$ is injective. Thus, if $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ are isometric, then there exists a bijective and distance preserving map $\phi:X\rightarrow Y$ . Since $\phi$ is a bijection between finite metric spaces, it has an inverse, say $\psi:Y\rightarrow X$ , with $\phi\circ\psi=\mathrm{id}_{Y}$ and $\psi\circ\phi=\mathrm{id}_{X}$ . We now define the distortion and co-distortion of maps $\phi$ and $\psi$ .

Definition 5.2 (Distortion).

Given $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ , the distortion of a map $f:X\rightarrow Y$ is defined as

[TABLE]

Definition 5.3 (Co-distortion).

Given $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ , and maps $f:X\rightarrow Y$ , $g:Y\rightarrow X$ , the co-distortion of $f$ and $g$ is defined as

[TABLE]

We observe that if $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ are isometric, then there exist maps $\phi:X\rightarrow Y$ and $\psi:Y\rightarrow X$ such that $\mathrm{dis}(\phi)=0,~{}\mathrm{dis}(\psi)=0$ and $C(\phi,\psi)=0$ . We now want to relax the notion of isometry between metric spaces to the notion of $\eta$ -isometry, for some $\eta>0$ .

Definition 5.4 ( $\eta$ -isometry).

Given $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ and $\eta>0$ , maps $\phi:X\rightarrow Y$ and $\psi:Y\rightarrow X$ constitute an $\eta$ -isometry between $(X,d_{X})$ and $(Y,d_{Y})$ if $\mathrm{dis}(\phi)\leq\eta$ , $\mathrm{dis}(\psi)\leq\eta$ and $C(\phi,\psi)\leq\eta$ .

We observe that if $\phi:X\rightarrow Y$ and $\psi:Y\rightarrow X$ constitute an $\eta$ -isometry between $(X,d_{X})$ and $(Y,d_{Y})$ , then $\phi\circ\psi$ is an “approximate identity” on $Y$ and $\psi\circ\phi$ is an “approximate identity” on $X$ . This means the following: for every $x\in X$ , we have

[TABLE]

Similarly, for every $y\in Y$ , we have $d_{Y}(y,\phi\circ\psi(y))\leq\eta$ . Now, we are ready to define a notion of distance between metric spaces.

Definition 5.5 (Gromov-Hausdorff distance).

The Gromov-Hausdorff distance between $(X,d_{X}),~{}(Y,d_{Y})\in\mathcal{M}$ is defined as

[TABLE]

Theorem 5.6 ([BBI01, Theorem 7.3.30]).

The function $d_{\mathrm{GH}}:\mathcal{M}\times\mathcal{M}\rightarrow\mathbb{R}_{+}$ is non-negative, symmetric and satisfies the triangle inequality; moreover $d_{\mathrm{GH}}((X,d_{X}),(Y,d_{Y}))=0$ if and only if $X$ and $Y$ are isometric.

Thus, we now have a notion of distance between finite metric spaces. The next step is to define a notion of distance between persistence vector spaces.

5.2 Interleaving Distance

We recall that $P\mathcal{V}(\mathbb{F})$ denotes the category of all persistence vector spaces over field $\mathbb{F}$ . For $\mathbb{V},\mathbb{W}\in P\mathcal{V}(\mathbb{F})$ , $\mathbb{V}=\{V_{\delta}\xrightarrow{v_{\delta,\delta^{\prime}}}V_{\delta^{\prime}}\}_{\delta\leq\delta^{\prime}}$ and $\mathbb{W}=\{W_{\delta}\xrightarrow{w_{\delta,\delta^{\prime}}}W_{\delta^{\prime}}\}_{\delta\leq\delta^{\prime}}$ , we recall that an isomorphism $\alpha:\mathbb{V}\rightarrow\mathbb{W}$ consists of maps $\alpha_{\delta}:V_{\delta}\rightarrow W_{\delta}$ for all $\delta$ , such that the following diagram commutes for all $\delta\leq\delta^{\prime}$ ,

[TABLE]

and $\alpha_{\delta}:V_{\delta}\rightarrow W_{\delta}$ is an isomorphism of vector spaces for all $\delta\in\mathbb{R}_{+}$ . We now relax this notion of isomorphism between persistence vector spaces.

Definition 5.7 ( $\eta$ -interleaving).

Let $\eta\geq 0$ be fixed. Given a field $\mathbb{F}$ , an $\eta$ -interleaving between $\mathbb{V},\mathbb{W}\in P\mathcal{V}(\mathbb{F})$ , $\mathbb{V}=\{V_{\delta}\xrightarrow{v_{\delta,\delta^{\prime}}}V_{\delta^{\prime}}\}_{\delta\leq\delta^{\prime}}$ and $\mathbb{W}=\{W_{\delta}\xrightarrow{w_{\delta,\delta^{\prime}}}W_{\delta^{\prime}}\}_{\delta\leq\delta^{\prime}}$ consists of maps

[TABLE]

such that the following diagrams commute for all $\delta$ :

[TABLE]

The above conditions are referred to as the triangle conditions. We also want the following diagrams to commute for all $\delta^{\prime}\geq\delta$ . These conditions are referred to as the parallelogram conditions.

[TABLE]

We set $\alpha=\{\alpha_{\delta}\}_{\delta}$ and $\beta=\{\beta_{\delta}\}_{\delta}$ , and say that $(\alpha,\beta)$ is an $\eta$ -interleaving between $\mathbb{V}$ and $\mathbb{W}$ .

We observe that an $\eta$ -interleaving is indeed a generalization of isomorphism between persistence vector spaces. In fact, if $(\alpha,\beta)$ is a [math]-interleaving between $\mathbb{V}$ and $\mathbb{W}$ , then it is straightforward to see that for every $\delta$ , $\beta_{\delta}\circ\alpha_{\delta}=\mathrm{id}_{V_{\delta}}$ and $\alpha_{\delta}\circ\beta_{\delta}=\mathrm{id}_{W_{\delta}}$ . Thus, $\alpha_{\delta}$ and $\beta_{\delta}$ become isomorphisms of vector spaces for all $\delta$ . We are now ready to define the interleaving distance between persistence vector spaces.

Definition 5.8 (Interleaving distance).

Given a field $\mathbb{F}$ , the interleaving distance between $\mathbb{V},\mathbb{W}\in P\mathcal{V}(\mathbb{F})$ is defined as

[TABLE]

Proposition 5.9 ([COGDS16]).

The function $d_{\mathrm{I}}:P\mathcal{V}(\mathbb{F})\times P\mathcal{V}(\mathbb{F})\rightarrow\mathbb{R}_{+}\cup\infty$ is non-negative, symmetric and satisfies the triangle inequality. However, $d_{\mathrm{I}}(\mathbb{V},\mathbb{W})$ may take value $\infty$ and $d_{\mathrm{I}}(\mathbb{V},\mathbb{W})$ might be zero even if $\mathbb{V}$ and $\mathbb{W}$ are not isomorphic.

We are now ready to prove the following stability theorem. For $(X,d_{X})\in\mathcal{M}$ , we recall that $\mathrm{K}_{\bullet}^{\mathrm{VR}}(X)$ is the Vietoris-Rips filtered simplicial complex associated with $(X,d_{X})$ .

Theorem 5.10 (Stability of Vietoris-Rips persistent homology).

For all $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ and $k\in\mathbb{N}$ ,

[TABLE]

Proof.

Let $\eta\geq 0$ be fixed. We show that if $X$ and $Y$ are $\eta$ -isometric, then $H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(X)$ and $H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(Y)$ are $\eta$ -interleaved. Suppose that $(X,d_{X})$ and $(Y,d_{Y})$ are $\eta$ -isometric. Then, there exist maps $\phi:X\rightarrow Y$ and $\psi:Y\rightarrow X$ such that $\mathrm{dis}(\phi)\leq\eta$ , $\mathrm{dis}(\psi)\leq\eta$ and $C(\phi,\psi)\leq\eta$ . For some $\delta\geq 0$ , let $\sigma\in\mathrm{K}_{\delta}^{\mathrm{VR}}(X)$ . This implies that for all $x,x^{\prime}\in\sigma$ , $d_{X}(x,x^{\prime})\leq\delta$ . Since $\mathrm{dis}(\phi)\leq\eta$ , we obtain that $d_{Y}(\phi(x),\phi(x^{\prime}))\leq\delta+\eta$ for all $x,x^{\prime}\in\sigma$ . Thus, $\phi(\sigma)\in\mathrm{K}_{\delta+\eta}^{\mathrm{VR}}(Y)$ . Thus, for every $\delta>0$ , $\phi$ induces a map

[TABLE]

Similarly, for every $\delta>0$ , we obtain maps

[TABLE]

Thus, we obtain the following diagrams:

[TABLE]

If the above four diagrams were to commute, then, since $H_{k}(\ast,\mathbb{F}):\mathcal{S}\rightarrow\mathcal{V}(\mathbb{F})$ is a functor for every $k\in\mathbb{N}$ , the following diagrams will also commute: fix $k\in\mathbb{N}$ , and let $V^{X}_{\delta}:=H_{k}(\mathrm{K}_{\delta}^{\mathrm{VR}}(X),\mathbb{F})$ , $V^{Y}_{\delta}:=H_{k}(\mathrm{K}_{\delta}^{\mathrm{VR}}(Y),\mathbb{F})$ , $\phi_{\delta}^{\ast}:=H_{k}(\phi_{\delta})$ and $\psi_{\delta}^{\ast}:=H_{k}(\psi_{\delta})$ .

[TABLE]

We recall that given simplicial complexes $\mathrm{K}$ and $\mathrm{L}$ , simplicial maps $f,g:\mathrm{K}\rightarrow\mathrm{L}$ are called contiguous if for every simplex $\sigma\in\mathrm{K}$ , $f(\sigma)\cup g(\sigma)$ is a simplex in $\mathrm{L}$ . We have from Lemma 3.21 that for such maps, $H_{k}(f)=H_{k}(g):H_{k}(\mathrm{K},\mathbb{F})\rightarrow H_{k}(\mathrm{L},\mathbb{F})$ for all $k\in\mathbb{N}$ . Therefore, it suffices to show that the following maps in the four diagrams on $\mathrm{K}^{\mathrm{VR}}(X)$ and $\mathrm{K}^{\mathrm{VR}}(Y)$ are contiguous [Mun96]:

$i^{X}_{\delta,\delta+2\eta},~{}\psi_{\delta+\eta}\circ\phi_{\delta}:\mathrm{K}_{\delta}^{\mathrm{VR}}(X)\rightarrow\mathrm{K}_{\delta+2\eta}^{\mathrm{VR}}(X).$ 2. 2.

$i^{Y}_{\delta,\delta+2\eta},~{}\phi_{\delta+\eta}\circ\psi_{\delta}:\mathrm{K}_{\delta}^{\mathrm{VR}}(Y)\rightarrow\mathrm{K}_{\delta+2\eta}^{\mathrm{VR}}(Y).$ 3. 3.

$\phi_{\delta^{\prime}}\circ i^{X}_{\delta,\delta^{\prime}},~{}i^{Y}_{\delta+\eta,\delta^{\prime}+\eta}\circ\phi_{\delta}:\mathrm{K}_{\delta}^{\mathrm{VR}}(X)\rightarrow\mathrm{K}_{\delta^{\prime}+\eta}^{\mathrm{VR}}(Y).$ 4. 4.

$\psi_{\delta^{\prime}}\circ i^{Y}_{\delta,\delta^{\prime}},~{}i^{X}_{\delta+\eta,\delta^{\prime}+\eta}\circ\psi_{\delta}:\mathrm{K}_{\delta}^{\mathrm{VR}}(Y)\rightarrow\mathrm{K}_{\delta^{\prime}+\eta}^{\mathrm{VR}}(X).$

We first consider the pair of maps $i^{X}_{\delta,\delta+2\eta},~{}\psi_{\delta+\eta}\circ\phi_{\delta}:\mathrm{K}_{\delta}^{\mathrm{VR}}(X)\rightarrow\mathrm{K}_{\delta+2\eta}^{\mathrm{VR}}(X)$ . Let $\sigma\in\mathrm{K}_{\delta}^{\mathrm{VR}}(X)$ be a simplex. In order to show that $i^{X}_{\delta,\delta+2\eta}(\sigma)\cup\psi_{\delta+\eta}\circ\phi_{\delta}(\sigma)$ is a simplex in $\mathrm{K}_{\delta+2\eta}^{\mathrm{VR}}(X)$ , we need to show that $\mathrm{diam}(i^{X}_{\delta,\delta+2\eta}(\sigma)\cup\psi_{\delta+\eta}\circ\phi_{\delta}(\sigma))\leq\delta+2\eta$ . Let $x,x^{\prime}\in i^{X}_{\delta,\delta+2\eta}(\sigma)\cup\psi_{\delta+\eta}\circ\phi_{\delta}(\sigma)$ . If both $x,x^{\prime}\in i^{X}_{\delta,\delta+2\eta}(\sigma)$ , then $d_{X}(x,x^{\prime})\leq\delta\leq\delta+2\eta$ . If both $x,x^{\prime}\in\psi_{\delta+\eta}\circ\phi_{\delta}(\sigma)$ , then there exist $a,a^{\prime}\in\sigma$ such that $x=\psi_{\delta+\eta}\circ\phi_{\delta}(a)$ and $x^{\prime}=\psi_{\delta+\eta}\circ\phi_{\delta}(a^{\prime})$ . Then,

[TABLE]

Here, the first and second inequalities hold because $\mathrm{dis}(\phi),~{}\mathrm{dis}(\psi)\leq\eta$ , while the third inequality hold because $a,a^{\prime}\in\sigma\in\mathrm{K}_{\delta}^{\mathrm{VR}}(X)$ . Now, suppose that $x\in i^{X}_{\delta,\delta+2\eta}(\sigma)$ and $x^{\prime}\in\psi_{\delta+\eta}\circ\phi_{\delta}(\sigma)$ . This implies that $x\in\sigma$ , and there exists $a^{\prime}\in\sigma$ such that $x^{\prime}=\psi_{\delta+\eta}\circ\phi_{\delta}(a^{\prime})$ . We have

[TABLE]

Here, the first inequality holds because $C(\phi,\psi)\leq\eta$ , the second inequality holds because $\mathrm{dis}(\phi)\leq\eta$ and the third inequality holds because $x,a^{\prime}\in\sigma$ . Thus, we have shown that $\mathrm{diam}(i^{X}_{\delta,\delta+2\eta}(\sigma)\cup\psi_{\delta+\eta}\circ\phi_{\delta}(\sigma))\leq\delta+2\eta.$ This implies that the maps $i^{X}_{\delta,\delta+2\eta},~{}\psi_{\delta+\eta}\circ\phi_{\delta}:\mathrm{K}_{\delta}^{\mathrm{VR}}(X)\rightarrow\mathrm{K}_{\delta+2\eta}^{\mathrm{VR}}(X)$ are contiguous. Thus, we have that for every $k\in\mathbb{N}$ ,

[TABLE]

The last equality holds because $H_{k}(\ast,\mathbb{F}):\mathcal{S}\rightarrow\mathcal{V}(\mathbb{F})$ is a functor for every $k\in\mathbb{N}$ .

We can similarly show that the remaining three pairs of maps are also contiguous. Thus, we obtain that the four diagrams on $V_{\delta}^{X}$ and $V_{\delta}^{Y}$ also commute. Thus, we have shown that the persistence vector spaces $H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(X)$ and $H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(Y)$ are $\eta$ -interleaved.

Now, let $d_{\mathrm{GH}}((X,d_{X}),(Y,d_{Y}))\leq\eta$ . This implies that there exists a $2\eta$ -isometry between $(X,d_{X})$ and $(Y,d_{Y})$ . By the above arguments, we obtain that $H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(X)$ and $H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(Y)$ are $2\eta$ -interleaved for all $k\in\mathbb{N}$ . Thus, $d_{\mathrm{I}}(H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(X),H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(Y))\leq 2\eta$ . This implies that $2\cdot d_{\mathrm{GH}}((X,d_{X}),(Y,d_{Y}))\geq d_{\mathrm{I}}(H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(X),H_{k}\circ\mathrm{K}^{\mathrm{VR}}_{\bullet}(Y))$ , and proves the theorem. ∎

We recall that for $(X,d_{X})\in\mathcal{M}$ and $\delta\geq 0$ , the Čech complex is defined as

[TABLE]

Let $\check{\mathrm{C}}_{\bullet}(X)=\{\check{\mathrm{C}}_{\delta}(X)\subseteq\check{\mathrm{C}}_{\delta^{\prime}}(X)\}_{\delta\leq\delta^{\prime}}$ . Then, we have the following theorem.

Theorem 5.11 (Stability of Čech complex).

For all $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ , $k\in\mathbb{N}$ and $\eta\geq 0$ , if $X$ and $Y$ are $\eta$ -isometric, then the persistence vector spaces $H_{k}\circ\check{\mathrm{C}}_{\bullet}(X)$ and $H_{k}\circ\check{\mathrm{C}}_{\bullet}(Y)$ are $\eta$ -interleaved.

The proof of the above theorem is similar to that of Theorem 5.10. The next step is to define a notion of distance between persistence diagrams called the bottleneck distance.

5.3 Bottleneck Distance

We recall that given $n\in\mathbb{Z}_{+}$ and a field $\mathbb{F}$ , $P\mathcal{V}_{n}(\mathbb{F})$ denotes the category of pfd persistence vector spaces of length $n$ , and $\mathcal{D}$ denotes the collection of all persistence diagrams $\mathrm{dgm}(\mathbb{V})$ , $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ . In the last section, we showed that for every $\mathbb{V}\in P\mathcal{V}_{n}(\mathbb{F})$ , there exists a multiset of intervals $\{\!\{(b_{i},d_{i})\}\!\}_{i\in I}$ such that $\mathbb{V}\cong\bigoplus_{i\in I}\mathbb{I}(b_{i},d_{i})$ . A trivial fact is the following.

Fact 5.12.

Given a multiset of intervals $\{\!\{(b_{i},d_{i})\}\!\}_{i\in I}$ , we can construct a persistence vector space $\mathbb{V}$ such that $\mathbb{V}\cong\bigoplus_{i\in I}\mathbb{I}(b_{i},d_{i})$ .

We now define the bottleneck distance on $\mathcal{D}$ . We recall that every element of $\mathcal{D}$ is a collection of finite multisets of points $(b,d)$ , where $0\leq b\leq d\leq\infty$ .

Definition 5.13 (Bottleneck distance).

Let $D=\{\!\{(b_{\alpha},d_{\alpha})~{}|~{}\alpha\in A\}\!\}$ and $D^{\prime}=\{\!\{(b^{\prime}_{\beta},d^{\prime}_{\beta})~{}|~{}\beta\in B\}\!\}$ be elements of $\mathcal{D}$ . A partial matching $m:A\rightarrow B$ is a bijection between a subset of $A$ and a subset of $B$ , which are then the domain and co-domain of $m$ respectively. Let $M(A,B)$ denote the set of all partial matchings between $A$ and $B$ . Given $m\in M(A,B)$ , the cost of $m$ is defined as follows:

[TABLE]

The bottleneck distance between $D$ and $D^{\prime}$ is then defined as

[TABLE]

The definition implies that the bottleneck distance is symmetric, non-negative and vanishes if $D=D^{\prime}$ . The next theorem shows that the bottleneck distance satisfies the triangle inequality.

Theorem 5.14 ([COGDS16]).

For any $D,D^{\prime},D^{\prime\prime}\in\mathcal{D}$ , we have

[TABLE]

We now have the following theorem.

Theorem 5.15 (Isometry Theorem[Les15]).

Given $n\in\mathbb{Z}_{+}$ and $\mathbb{V},\mathbb{W}\in\mathcal{V}_{n}(\mathbb{F})$ , we have

[TABLE]

A direct corollary of the above theorem is the following.

Corollary 5.16.

For all $(X,d_{X}),(Y,d_{Y})\in\mathcal{M}$ and $k\in\mathbb{N}$ ,

[TABLE]

It is known that while computing $d_{\mathrm{GH}}(X,Y)$ is NP-hard, there is a polynomial time algorithm [EH10] for computing $d_{\mathcal{B}}(\mathrm{dgm}_{k}^{\mathrm{VR}}(X),\mathrm{dgm}_{k}^{\mathrm{VR}}(Y))$ for all $k\in\mathbb{N}$ .

Furthermore, the inequality $2\,d_{\mathrm{GH}}(X,Y)\geq d_{\mathcal{B}}(\mathrm{dgm}_{k}^{\mathrm{VR}}(X),\mathrm{dgm}_{k}^{\mathrm{VR}}(Y))$ is tight. This is depicted by the following examples: for $\delta\geq 0$ , let $X_{\delta}=\{a,b\}$ be the metric space with $d(a,b)=1+\delta$ . Clearly, $d_{\mathrm{GH}}(X_{\delta},X_{0})=\frac{\delta}{2}$ . We also observe that $\mathrm{dgm}_{0}^{\mathrm{VR}}(X_{\delta})=\{\!\{[0,1+\delta],[0,\infty)\}\!\}$ . Thus, $\mathrm{dgm}_{0}^{\mathrm{VR}}(X_{0})=\{\!\{[0,1],[0,\infty)\}\!\}$ , and we obtain that $d_{\mathcal{B}}(\mathrm{dgm}_{0}^{\mathrm{VR}}(X_{\delta}),\mathrm{dgm}_{0}^{\mathrm{VR}}(X_{0}))=\delta$ . Therefore, we have

[TABLE]

Now consider the metric spaces $X$ and $X_{0}$ , where $X$ is the one point metric space and $X_{0}$ is as defined above. Then, $d_{\mathrm{GH}}(X,X_{0})=\frac{\mathrm{diam}(X_{0})}{2}=\frac{1}{2}$ . We have $\mathrm{dgm}_{0}^{\mathrm{VR}}(X)=\{\!\{[0,\infty)\}\!\}$ and $\mathrm{dgm}_{0}^{\mathrm{VR}}(X_{0})=\{\!\{[0,1],[0,\infty)\}\!\}$ . Thus, we have $d_{\mathcal{B}}(\mathrm{dgm}_{0}^{\mathrm{VR}}(X),\mathrm{dgm}_{0}^{\mathrm{VR}}(X_{0}))=\frac{1}{2}$ . Therefore, we obtain

[TABLE]

6 Applications of Persistent Homology

In this section, we first describe two problems in neuroscience that have been studied using persistent homology. We state their problem definitions, the experimental procedures that generate a finite data set, the method used to construct an abstract simplicial complex from the data set, and finally the results obtained using persistent homology. In the third and fourth subsections, we describe further applications of persistent homology to biology as well as to other areas.

6.1 A Topological Paradigm for Hippocampal Spatial Map Formation using Persistent Homology

This subsection describes article [DMFC12] of the same title. The problem is that of identifying the topological features of an environment using the hippocampal activity of a rat moving in that environment. In every animal, the hippocampus is the region of the brain responsible for creating a mental map of the animal’s environment. This mental map is made possible by activity of the neurons in the hippocampus called place cells. As an animal explores a given environment, different place cells fire a series of action potentials in different, discrete regions of the environment. Each region, referred to as that cell’s place field, is defined by the pattern of neuronal firing, most intense at the center and attenuated towards the edges of the field. The cell remains silent when the animal is outside of the cell’s place field. Experiments on rats suggest that the information contained in place cell firing patterns encodes spatial navigation routes and somehow represents the spatial environment [BFT*+*98, MBO83, ZGMS98]. Now, suppose that spatial location is the primary determinant of each place cell’s firing. Then, co-firing of several place cells indicates that the corresponding place fields overlap, See Figure 4. Thus, the mental map formed by co-firing will be based on the properties of connectivity, adjacency and containment of place fields, and therefore will be a topological map of the environment.

A basic theorem of algebraic topology is the so called Nerve Theorem [Hat00] which we paraphrase here as: if a space $X$ is covered with a sufficient number of regions, then it is possible to reconstruct the topology of $X$ using the intersection information of the regions. This theorem and the assumption that the place fields cover the environment leads to the hypothesis that the overlaps between the place fields, as represented by temporal overlap of spike trains (an ordered list of times at which a place cell fires) provide a connectivity map that retains the topological features of the environment. Thus, the authors of [DMFC12] investigate whether a topological connectivity map can be effectively and reliably derived from neuronal spiking patterns using computational tools in the field of algebraic topology.

We now briefly describe the details of the experiments performed in [DMFC12]. The authors simulated map formation times (minimal time required to produce the correct topological signature of an environment) using different place cell parameters and three separate planar $2\times 2$ meter areas with $1$ or $2$ holes. The place cell parameters are the firing rates, the place field sizes and the number of place cells. The firing rates and the place field sizes are described by $\log$ -normal distributions, with $\overline{f}$ and $\overline{s}$ being the respective peak values, and $1.2\overline{f}$ and $1.7\overline{s}$ being the respective standard deviations. The question was for which parameters the place cell spiking signals would be able to produce a temporal simplicial complex with the correct number of topological loops, or Betti numbers, in every dimension. The authors probed ten distributions of firing rates, with $\overline{f}$ ranging from $2$ to $40$ Hz, and ten distributions of place field sizes, with $\overline{s}$ ranging from $10$ to $90$ cm. The number of place cells varied independently from $N=50$ to $N=400$ . In each case, the centers of the place fields were scattered randomly and uniformly over the environment. For each combination of the parameters, $\overline{f},\overline{s}$ and $N$ , the computation was repeated $10$ times, through which the authors computed the average time $T_{min}$ required for the emergence of correct topological features for each specific choice of ensemble parameters $\overline{f},\overline{s}$ and $N$ . The authors fix the simulated trajectory, but choose a new set of place field centers for each set of $\overline{f},\overline{s},N$ for each repetition.

We now describe the mechanism of generating a filtered simplicial complex from the experimental data. Let $\{PF_{1},PF_{2},\ldots,PF_{n}\}$ , $n\in\mathbb{N}$ be a set of place fields with specified shapes and locations. A simplicial complex $S$ with vertex set $\{v_{1},v_{2},\ldots,v_{n}\}$ , one $v_{i}$ for each place field, can be constructed as follows: given $k\in\mathbb{N}$ , a simplex $[v_{i_{1}},v_{i_{2}},\ldots,v_{i_{k}}]\in S$ if $PF_{i_{1}}\cap PF_{i_{2}}\cap\ldots\cap PF_{i_{k}}\neq\emptyset$ . We recall that this coincides with the Čech complex, also known as the nerve complex. The Nerve Theorem [Hat00] states that if there is a space $X$ such that $X=\cup_{i=1}^{n}PF_{i}$ and each finite intersection of the place fields is contractible, then under fairly general conditions, the nerve complex $S$ has the same homotopy type as the underlying space $X$ , and so the topological invariants computed from $S$ will agree with those corresponding to $X$ . We saw that the experimental data does not consist of the place fields, but of the spike trains of the place cells. Thus, an overlap of the place fields is identified by co-firing of the corresponding place cells. Let $\{c_{1},c_{2},\ldots,c_{n}\}$ denote the place cells corresponding to the place fields $\{PF_{1},\ldots,PF_{n}\}$ respectively, and let $\{s_{1},s_{2},\ldots,s_{n}\}$ denote the corresponding spike trains. We recall that for $i\in[1:n]$ , a spike train $s_{i}$ is an ordered list of times at which the place cell $c_{i}$ fires. We fix an $\epsilon>0$ and an $m\in\mathbb{N}$ . We define a filtered simplicial complex as follows: given a simplex $\sigma=[i_{1},i_{2},\ldots,i_{k}]$ , we define a function $f$ on $\sigma$ as

[TABLE]

By definition, we have $f(\sigma)\leq f(\tau)$ if $\sigma\subseteq\tau$ . Thus, we start with an empty simplicial complex, and then add simplices to this complex, according to the values of the simplices on the function $f$ . The homology functor $H_{k}(\ast,\mathbb{Z}_{2})$ is applied to this filtered simplicial complex, for $k=0,1$ . For each $k=0,1$ , this produces a persistence vector space, and thus a barcode. Barcodes are used to determine the first two Betti numbers, $\beta_{0}$ and $\beta_{1}$ . We recall that $\beta_{0}$ tells the number of connected components, and $\beta_{1}$ tells the number of $1$ -dimensional holes. The software used to analyze the data is jPLEX [SVJ08], a collection of MATLAB functions for computational topology that implements the concepts described above.

The results obtained in [DMFC12] and their interpretation are depicted in Figure 5. The authors observed that the place cell parameters of firing rate and place field size for which a reliable topological map of the environment is produced correspond well with experimentally observed place cell firing rates and place field sizes. Thus, the fact that these parameters fall into the biological range lends support to this topological paradigm.

6.2 Topological Analysis of Population Activity in Visual Cortex

This subsection describes article [SMI*+*08] of the same title. This work studies some basic aspects of the patterns of activity in the primary visual cortex (V1) evoked by natural images and during spontaneous activity. The authors focus on a topological characterization of population activity in visual cortex. The reason behind this approach is the following: it has been observed that spontaneous cortical states tend to reproduce the patterns evoked by oriented stimuli [KBT*+*03]. Now, if cortical activity is restricted to patterns evoked by an oriented stimulus, then considering that orientation is a circular variable, this leads to the hypothesis that the activity patterns of the cortical cells must have a topological structure equivalent to that of a circle. This implies that the basic question about the structure of the cortical activity data is topological in nature. This work offers the first estimate of the underlying topological structure of V1 activity.

We now describe the experimental procedures adopted in [SMI*+*08]. The authors first validate their method on simulated data by recovering the topological structure of data sets where the “ground truth” is known. The validation is done for a circle as well as torus. We refer the readers to the original paper [SMI*+*08] for details of the validation methods. The experimental studies were performed on three old-world monkeys (Macaca fascicularis), See Figure 6. The database considered in this study was obtained using micro-machined electrode arrays consisting of a square grid of $10\times 10$ electrodes $1.5$ mm long. The distance between neighboring electrodes was $400$ $\mu$ m. Spike sorting was performed online using principal component analysis on the waveform shapes. In the spontaneous condition, the eyes were covered. The stimuli in the evoked condition were image sequences generated by digitally sampling commercially available videotapes in VHS format. The selected movies included both man-made and natural landscape scenes, and $6$ segments of $30$ seconds duration were shown.

We now describe how the data points were generated from the experiment described above. The preparation of the data points for both the spontaneous and driven activity during natural image simulation was identical. After spike-sorting signals from each electrode, the authors sub-selected a group of $5$ neurons that showed the highest firing rates. Then, a point cloud was generated by binning spikes in $50$ ms windows. The spontaneous and evoked activity segments were collected in lengths of 10 s each. Thus, each of these segments contain $200$ points living in $\mathbb{R}^{5}$ , each neuron corresponding to a dimension. The statistical package PLEX was used with a weak witness complex construction which will be explained in the next paragraph. PLEX is a MATLAB collection of functions for computational topology. The authors recorded the maximal length of persistence intervals in the $1$ -dimensional and $2$ -dimensional barcodes.

We now describe the weak witness complex construction [SMI*+*08]. Given a finite metric space $(X,d_{X})$ , a set of points $L\subset X$ called the landmark set, and $\epsilon>0$ , a point $x\in X$ is called an $\epsilon$ -witness for a $k+1$ -tuple $\{l_{0},l_{1},\ldots,l_{k}\}$ of points in $L$ if $\max_{i}d_{X}(x,l_{i})\leq\epsilon+m_{x}$ , where $m_{x}$ denotes the $k+1$ smallest value of $d_{X}(x,l)$ as $l$ varies over all of $L$ . Now, a simplicial complex $W_{\epsilon}(X,L)$ is associated to $X,L$ and $\epsilon$ by fixing the vertex set of $W_{\epsilon}(X,L)$ to be $L$ , and declaring that a collection $\{l_{0},l_{1},\ldots,l_{k}\}$ spans a $k$ -simplex in $W_{\epsilon}(X,L)$ if and only if there is an $\epsilon$ -witness in $X$ for the collection $\{l_{0},l_{1},\ldots,l_{k}\}$ and for all its faces. Clearly, if there is an $\epsilon$ -witness for the simplex $\sigma=\{l_{0},l_{1},\ldots,l_{k}\}$ , then there is an $\epsilon^{\prime}$ -witness for $\sigma$ , $\epsilon^{\prime}\geq\epsilon$ . Thus, we obtain that for $\epsilon\leq\epsilon^{\prime}$ , $W_{\epsilon}(X,L)\subseteq W_{\epsilon^{\prime}}(X,L)$ and this results in a filtered simplicial complex. In [SMI*+*08], out of the $200$ data points in $\mathbb{R}^{5}$ , a landmark set of $35$ points is chosen by the max-min procedure as follows: first a random point, say $x_{1}$ from $X$ is picked. Then, the point $x_{2}$ is chosen such that $d_{X}(x_{1},x_{2})$ is maximized. The point $x_{3}$ is chosen such that $d_{X}(x_{3},\{x_{1},x_{2}\})$ is maximized, and so on. The weak witness construction was used because, unlike the Vietoris-Rips simplicial complexes, the construction of weak witness simplicial complexes for large data sets is much more computationally tractable.

We now describe the results obtained in [SMI*+*08]. In Figure 7, different topological signatures observed in $10$ s segments of the data labeled by the first three Betti numbers $(b_{0},b_{1},b_{2})$ are illustrated. Each row of Figure 7 represents a different “threshold” for the length of the interval of the signature (in the barcode) as a fraction of the covering radius of the data. The covering radius is defined as $R_{0}=\max_{x\in X}\min_{l\in L}d_{X}(x,l)$ , where $X$ is the data set and $L$ is the set of landmarks. Larger thresholds represent instances where the signature was long-lived and likely to represent a salient feature of the data.

6.3 Further Applications to Biology

This paragraph describes some more applications of persistent homology to neuroscience. In [BSH0], the authors propose a method based on persistent homology to automatically classify neuronal network dynamics using topological features of spaces built from spike-train distances. The dynamics of a neuronal network are believed to be indicative of the computations it can perform, and thus, understanding the neuronal network dynamics enables understanding of how neuronal networks perform computations and process information. The paper [CDM18] is an extension to [DMFC12], wherein the authors use the concept of zig-zag persistent homology [CdS10, CdSM09] to account for the possibility of forgetting information in the model for memory. The results obtained in [CDM18] show that in order to achieve the best possible results in “learning” an arena, the rodent needs a balance between remembering and forgetting information. These results are in accordance with recent findings in neuroscience, where it has been proposed that forgetting is an important step in the learning process. The work by Giusti et. al. in [GGB16] explores the method of persistent homology over the traditional graph-theoretic methods, for understanding neural data.

In [XW14], persistent homology is used for the first time for protein characterization, identification and classification. The authors extracted molecular topological fingerprints based on the persistence of molecular topological invariants. In [ESR16], persistent homology is used to characterize the complex structure of chromatin inside cell nucleus. The authors apply persistent homology to human cell line data and show how this method captures complex multiscale folding methods.

In [CCR13], persistent homology is used to study evolutionary events. The authors consider a set of genomes and calculate the genetic distance between each pair of sequences. Using these distances, they calculate the homology groups across all genetic distances $\epsilon$ in different dimensions. They observe that the zero-dimensional homology provides information about vertical evolution, i.e. at a particular $\epsilon$ , the Betti number $b_{0}$ represents the number of different strains or subclades. The one-dimensional homology provides information about horizontal evolution since reticulate events (merging of different clades to form a new hybrid lineage) are represented by loops in phylogenetic networks. Some examples of reticulate events include recombination and reassortment of genomes. The genomic datasets used are those of influenza strains, HIV, rabies, dengue, flaviviruses, West Nile virus and Newcastle virus. In a follow-up paper [CLR16], persistent homology is used to study the specific evolutionary event of recombination. In [CCR13], the relation between persistent homology and explicit evolutionary histories incorporating recombination events was not studied. Therefore, in [CLR16], persistent homology is applied on appropriate genomic sets in order to characterize the genomic regions where recombination takes place and identify the gametes involved in particular recombination events. The persistent homology barcodes derived from each of these sets are structured as a “barcode ensemble” where each bar captures a recombination event. A software called TARGet is developed that generates a graph in polynomial time, capturing ensembles of minimal recombination histories. The evolutionary event of recombination has been further studied in [LRR18] where the authors introduce “novelty profiles” of evolutionary histories. The novelty profile of an evolutionary history is a list of $k$ monotonically decreasing numbers, where $k$ is the number of recombination events in the history and each number roughly measures the contribution every recombination makes to the genetic diversity in the population. Persistent homology of sampled data is used to obtain information about a novelty profile. The authors of [LRR18] provide mathematical foundation for several works that have used persistent homology to study recombination. Some other articles showing the use of persistent homology for studying recombination events are [ER14, CRE*+*16, ER16].

In a different direction, another topological method for studying finite metric spaces is Mapper [SMC07]. It is a computational method for extracting simple descriptions of high dimensional data sets in the form of simplicial complexes. This method has been widely used for analysis of biological data sets as seen in [NLC11, YSH*+*09, LWS*+*17, dNG*+*15, TOTT*+*16, OHC*+*18, SSGC*+*18, FPT*+*18, RFH*+*14, BYCH*+*12, PPIM*+*18, KPC*+*15, STGM*+*14, Cám17, PIP17, SNM17].

6.4 Applications to Other Domains

Persistent homology has also been used for shape classification. The authors of [CCSG*+*09] use persistent homology to identify signatures of finite metric spaces that are stable under the Gromov-Hausdorff distance. The signatures are nothing but metric invariants obtained using persistent homology along with attributes of the metric spaces like diameter and eccentricity. These signatures are computed and then used to measure the degree of dissimilarity of a pair of metric spaces. The authors adapt this method to compare shapes, by first uniformly sampling points from each shape to generate a finite metric space and then comparing the finite metric spaces using the identified signatures.

In this paragraph, we provide two examples where persistent homology has been used for studying chemical compounds. In [XFTW15], persistent homology is used for studying fullerenes, which are special molecules consisting of only carbon atoms. Here, the point cloud is given by the atoms of the fullerenes, and a Vietoris-Rips filtration is constructed by the usual process of assigning radii to the point cloud. The authors thus study the stability of the fullerene molecules by observing that the total curvature energies of the fullerene isomers can be well represented with the lengths of their long-lived Betti $2$ -bars. In [LBD*+*18], persistent homology is used to build a descriptor for identifying and comparing zeolites, according to their pore shapes. Zeolites are nanoporous materials made of silica. The authors performed high-throughput screening of zeolites based on this descriptor and identified best zeolites for methane storage and carbon capture applications. The results obtained in [LBD*+*18] match the existing results on top-performing zeolites for these applications.

7 Software Packages for Persistent Homology

There are various open source softwares available for computing persistent homology. These are available in R, Python, C $++$ , Java as well as in MATLAB. The softwares are Perseus [Nan], PHAT [BKR12], DIPHA [Ren], CTL [Lew14], Ripser [Bau15], TDA [FKL*+*], javaPlex [TVJA14], Dionysus [Mor], Gudhi [gud14], TDAstats [WDWS18], Scikit-TDA [NS19] and the Topology Toolkit [TFL*+*17].

Bibliography69

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Azu 50] G. Azumaya. Corrections and supplementaries to my paper concerning krull-remak-schmidt’s theorem. Nagoya Mathematical Journal , 1:117–124, 1950.
2[Bau 15] U. Bauer. Ripser. https://github.com/Ripser/ripser , 2015.
3[BBI 01] D. Burago, Y. Burago, and S. Ivanov. A Course in Metric Geometry , volume 33 of AMS Graduate Studies in Math. American Mathematical Society, 2001.
4[BFT + 98] E. N. Brown, L. M. Frank, D. Tang, M. C. Quirk, and M. A. Wilson. A statistical paradigm for neural spike train decoding applied to position prediction from ensemble firing patterns of rat hippocampal place cells. Journal of Neuroscience , 18(18):7411–7425, 1998.
5[BKR 12] U. Bauer, M. Kerber, and J. Reininghaus. PHAT (Persistent Homology Algorithm Toolbox). https://bitbucket.org/phat-code/phat , 2012.
6[BSH 0] J. Bardin, G. Spreemann, and K. Hess. Topological exploration of artificial neuronal network dynamics. Network Neuroscience , 0(0):1–19, 0.
7[BYCH + 12] C. W. Bartlett, S. Yeon Cheong, L. Hou, J. Paquette, P. Yee Lum, G. Jäger, F. Battke, C. Vehlow, J. Heinrich, K. Nieselt, R. Sakai, J. Aerts, and W. C. Ray. An eqtl biological data visualization challenge and approaches from the visualization community. BMC Bioinformatics , 13(8):S 8, May 2012.
8[Cám 17] P. G. Cámara. Topological methods for genomics: Present and future directions. Current Opinion in Systems Biology , 1:95 – 101, 2017. Future of Systems Biology • Genomics and epigenomics.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Code & Models

Videos

A Primer on Persistent Homology of Finite Metric Spaces

1 Introduction

Organization.

Acknowledgements.

Contents

2 Clustering

Definition 2.1** (Clustering Method).**

Example 2.2

Example 2.3

Definition 2.4** (Category of Finite Metric Spaces).**

Definition 2.5** (Category of Partitions of Finite Sets ).**

Example 2.6** **(Vietoris-Rips clustering functor)

Theorem 2.7** ([CM13, Theorem 6.4]).**

2.1 Hierarchical Clustering

Definition 2.8** (Dendrogram).**

Definition 2.9** (Hierarchical Clustering).**

Definition 2.10** (Ultra-metric).**

Definition 2.11** (Ultra-metric induced by CVR\mathfrak{C}^{\mathrm{VR}}CVR).**

3 Simplicial Homology

Definition 3.1** (Simplicial Complex).**

Definition 3.2** (Subcomplex).**

Definition 3.3** (Simplex of a complex).**

Definition 3.4** (Face of a simplex).**

Definition 3.5** (Dimension of a complex).**

Definition 3.6** (Vertices of a complex).**

Definition 3.7** (nnn-skeleton of a complex).**

Definition 3.8** (Connected component).**

Definition 3.9** (Simplicial map).**

Definition 3.10** (Quotient vector space).**

Definition 3.11** (Isomorphic vector spaces).**

Definition 3.12** (Chain Complex).**

Lemma 3.13**.**

Proof.

Definition 3.14** (nnn-Cycle and nnn-Boundary).**

Definition 3.15** (Simplicial Homology).**

Definition 3.16** (Betti numbers).**

Lemma 3.17**.**

Proof.

Proposition 3.18**.**

Proof.

Corollary 3.19**.**

Definition 3.20** (Contiguous Simplicial Maps).**

Lemma 3.21** ([Mun96]).**

4 Persistent Homology

Definition 4.1** (Filtered simplicial complex).**

Definition 4.2** (Vietoris-Rips Complex).**

Proposition 4.3**.**

Definition 4.4** (Čech Complex).**

Definition 4.5** (Persistence Vector Space[Car14]).**

Definition 4.6** (Morphisms of Persistence Vector Spaces[Car14]).**

Definition 4.7** (Sampling map).**

Persistence diagrams of persistence vector spaces of length nnn.

Example 4.8

Definition 4.9** (Interval persistence vector space).**

Theorem 4.10** ([CB12]).**

Theorem 4.11** (Krull-Remak-Schmidt-Azumaya [Azu50]).**

Definition 4.12** (Persistence Diagram).**

Corollary 4.13**.**

Vietoris-Rips persistence diagrams of finite metric spaces.

Example 4.14

Definition 4.15** (kkk-th Vietoris-Rips Persistence Diagram).**

Example 4.16

Example 4.17

4.1 Interpretation of Clustering via 0-Dimensional Persistence Diagram

Proposition 4.18**.**

Proposition 4.19**.**

Proof.

Claim 4.20**.**

Proof of Claim.

5 Stability of Invariants

5.1 Gromov-Hausdorff Distance

Definition 5.1** (Isometry).**

Definition 5.2** (Distortion).**

Definition 2.1 (Clustering Method).

Definition 2.4 (Category of Finite Metric Spaces).

Definition 2.5 (Category of Partitions of Finite Sets ).

Example 2.6 (Vietoris-Rips clustering functor)

Theorem 2.7 ([CM13, Theorem 6.4]).

Definition 2.8 (Dendrogram).

Definition 2.9 (Hierarchical Clustering).

Definition 2.10 (Ultra-metric).

Definition 2.11 (Ultra-metric induced by $\mathfrak{C}^{\mathrm{VR}}$ ).

Definition 3.1 (Simplicial Complex).

Definition 3.2 (Subcomplex).

Definition 3.3 (Simplex of a complex).

Definition 3.4 (Face of a simplex).

Definition 3.5 (Dimension of a complex).

Definition 3.6 (Vertices of a complex).

Definition 3.7 ( $n$ -skeleton of a complex).

Definition 3.8 (Connected component).

Definition 3.9 (Simplicial map).

Definition 3.10 (Quotient vector space).

Definition 3.11 (Isomorphic vector spaces).

Definition 3.12 (Chain Complex).

Lemma 3.13.

Definition 3.14 ( $n$ -Cycle and $n$ -Boundary).

Definition 3.15 (Simplicial Homology).

Definition 3.16 (Betti numbers).

Lemma 3.17.

Proposition 3.18.

Corollary 3.19.

Definition 3.20 (Contiguous Simplicial Maps).

Lemma 3.21 ([Mun96]).

Definition 4.1 (Filtered simplicial complex).

Definition 4.2 (Vietoris-Rips Complex).

Proposition 4.3.

Definition 4.4 (Čech Complex).

Definition 4.5 (Persistence Vector Space[Car14]).

Definition 4.6 (Morphisms of Persistence Vector Spaces[Car14]).

Definition 4.7 (Sampling map).

Persistence diagrams of persistence vector spaces of length $n$ .

Definition 4.9 (Interval persistence vector space).

Theorem 4.10 ([CB12]).

Theorem 4.11 (Krull-Remak-Schmidt-Azumaya [Azu50]).

Definition 4.12 (Persistence Diagram).

Corollary 4.13.

Definition 4.15 ( $k$ -th Vietoris-Rips Persistence Diagram).

Proposition 4.18.

Proposition 4.19.

Claim 4.20.

Definition 5.1 (Isometry).

Definition 5.2 (Distortion).

Definition 5.3 (Co-distortion).

Definition 5.4 ( $\eta$ -isometry).

Definition 5.5 (Gromov-Hausdorff distance).

Theorem 5.6 ([BBI01, Theorem 7.3.30]).

Definition 5.7 ( $\eta$ -interleaving).

Definition 5.8 (Interleaving distance).

Proposition 5.9 ([COGDS16]).

Theorem 5.10 (Stability of Vietoris-Rips persistent homology).

Theorem 5.11 (Stability of Čech complex).

Fact 5.12.

Definition 5.13 (Bottleneck distance).

Theorem 5.14 ([COGDS16]).

Theorem 5.15 (Isometry Theorem[Les15]).

Corollary 5.16.