Persistent homology detects curvature

Peter Bubenik; Michael Hull; Dhruv Patel; and Benjamin Whittle

arXiv:1905.13196·cs.CG·February 19, 2025

Persistent homology detects curvature

Peter Bubenik, Michael Hull, Dhruv Patel, and Benjamin Whittle

PDF

TL;DR

This paper demonstrates that persistent homology, traditionally viewed as capturing topological features, also encodes geometric information such as curvature, by analyzing sampled points from curved disks.

Contribution

It provides theoretical evidence that short persistent homology intervals contain geometric information and introduces a computational framework for inverse problems using average persistence landscapes.

Findings

01

Persistent homology detects curvature of sampled disks.

02

Short intervals encode geometric, not just noise.

03

A framework for learning curvature from persistence landscapes.

Abstract

In topological data analysis, persistent homology is used to study the "shape of data". Persistent homology computations are completely characterized by a set of intervals called a bar code. It is often said that the long intervals represent the "topological signal" and the short intervals represent "noise". We give evidence to dispute this thesis, showing that the short intervals encode geometric information. Specifically, we prove that persistent homology detects the curvature of disks from which points have been sampled. We describe a general computational framework for solving inverse problems using the average persistence landscape, a continuous mapping from metric spaces with a probability measure to a Hilbert space. In the present application, the average persistence landscapes of points sampled from disks of constant curvature results in a path in this Hilbert space which may be…

Tables2

Table 1. Table 1. The root mean squared errors of the estimated curvature using pairwise distances.

		$H_{0}$	$H_{1}$	$H_{0}$ -and- $H_{1}$
Supervised learning	Nearest neighbors	0.032	0.070	0.056
	Support Vector Regression	0.027	0.038	0.017
Unsupervised learning	First Principal Component	0.091	0.139	0.128

Table 2. Table 2. The root mean squared errors of the estimated curvatures upon replacing distances with their ordinal numbers.

		$H_{0}$	$H_{1}$	$H_{0}$ -and- $H_{1}$
Supervised learning	Nearest neighbors	0.631	0.260	0.262
	Support Vector Regression	0.541	0.171	0.171
Unsupervised learning	First Principal Component	0.615	0.393	0.392

Equations55

i = 0 ⋂ p B_{t} (x_{i}) \neq = \emptyset,

i = 0 ⋂ p B_{t} (x_{i}) \neq = \emptyset,

λ : N \times R \to R : (k, t) \mapsto sup {m \geq 0 : β_{t - m}^{t + m} \geq k} .

λ : N \times R \to R : (k, t) \mapsto sup {m \geq 0 : β_{t - m}^{t + m} \geq k} .

(λ (1, a), λ (1, a + δ), \dots, λ (1, a + m δ), λ (2, a), λ (2, a + δ), \dots, λ (N, a + m δ)),

(λ (1, a), λ (1, a + δ), \dots, λ (1, a + m δ), λ (2, a), λ (2, a + δ), \dots, λ (N, a + m δ)),

d s^{2} = \frac{4 ( d x ^{2} + d y ^{2} )}{( 1 - \frac{x ^{2} + y ^{2}}{R ^{2}} ) ^{2}} .

d s^{2} = \frac{4 ( d x ^{2} + d y ^{2} )}{( 1 - \frac{x ^{2} + y ^{2}}{R ^{2}} ) ^{2}} .

A (r) = ⎩ ⎨ ⎧ \frac{4 π}{- K} sinh^{2} (\frac{r - K}{2}) π r^{2} \frac{4 π}{K} sin^{2} (\frac{r K}{2}) if K < 0 if K = 0 if K > 0.

A (r) = ⎩ ⎨ ⎧ \frac{4 π}{- K} sinh^{2} (\frac{r - K}{2}) π r^{2} \frac{4 π}{K} sin^{2} (\frac{r K}{2}) if K < 0 if K = 0 if K > 0.

\frac{s in ( α )}{a} = \frac{s in ( β )}{b} = \frac{s in ( γ )}{c} .

\frac{s in ( α )}{a} = \frac{s in ( β )}{b} = \frac{s in ( γ )}{c} .

\frac{K s in ( α )}{s in ( a K )} = \frac{K s in ( β )}{s in ( b K )} = \frac{K s in ( γ )}{s in ( c K )} .

\frac{K s in ( α )}{s in ( a K )} = \frac{K s in ( β )}{s in ( b K )} = \frac{K s in ( γ )}{s in ( c K )} .

\frac{- K s in ( α )}{s inh ( a - K )} = \frac{- K s in ( β )}{s inh ( b - K )} = \frac{- K s in ( γ )}{s inh ( c - K )} .

\frac{- K s in ( α )}{s inh ( a - K )} = \frac{- K s in ( β )}{s inh ( b - K )} = \frac{- K s in ( γ )}{s inh ( c - K )} .

c^{2} = a^{2} + b^{2} + ab cos (γ)

c^{2} = a^{2} + b^{2} + ab cos (γ)

cos (c K) = cos (a K) cos (b K) + sin (a K) sin (b K) cos (γ)

cos (c K) = cos (a K) cos (b K) + sin (a K) sin (b K) cos (γ)

cosh (c - K) = cosh (a - K) cosh (b - K) - sinh (a - K) sinh (b - K) cos (γ) .

cosh (c - K) = cosh (a - K) cosh (b - K) - sinh (a - K) sinh (b - K) cos (γ) .

\frac{1}{2} ∥ w ∥^{2} + C i = 1 \sum N (ζ_{1, i} + ζ_{2, i})

\frac{1}{2} ∥ w ∥^{2} + C i = 1 \sum N (ζ_{1, i} + ζ_{2, i})

⎩ ⎨ ⎧ y_{i} - (⟨ w, x_{i} ⟩ + b) \leq ε + ζ_{1, i} (⟨ w, x_{i} ⟩ + b) - y_{i} \leq ε + ζ_{2, i} ζ_{1, i}, ζ_{2, i} \geq 0

L_{ε - ins} = {0 ∣ y_{i} - f (x_{i}) ∣ - ε if ∣ y_{i} - f (x_{i}) ∣ \leq ε otherwise.

L_{ε - ins} = {0 ∣ y_{i} - f (x_{i}) ∣ - ε if ∣ y_{i} - f (x_{i}) ∣ \leq ε otherwise.

L_{τ - pin} = {(τ - 1) (y_{i} - f (x_{i})) τ (y_{i} - f (x_{i})) if y_{i} < f (x_{i}) if y_{i} \geq f (x_{i}),

L_{τ - pin} = {(τ - 1) (y_{i} - f (x_{i})) τ (y_{i} - f (x_{i})) if y_{i} < f (x_{i}) if y_{i} \geq f (x_{i}),

b (T) = min {r ∣ B_{r} (X) \cap B_{r} (Y) \neq = \emptyset \forall X, Y \in {A, B, C}} .

b (T) = min {r ∣ B_{r} (X) \cap B_{r} (Y) \neq = \emptyset \forall X, Y \in {A, B, C}} .

d (T) = min {r ∣ B_{r} (A) \cap B_{r} (B) \cap B_{r} (C) \neq = \emptyset} .

d (T) = min {r ∣ B_{r} (A) \cap B_{r} (B) \cap B_{r} (C) \neq = \emptyset} .

p(T_{K,a})=\begin{cases}\dfrac{2}{a\sqrt{-K}}\sinh^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sinh\bigg{(}\dfrac{a\sqrt{-K}}{2}\bigg{)}\bigg{)}&\text{if }K<0\\ \dfrac{2}{\sqrt{3}}&\text{if }K=0\\ \dfrac{2}{a\sqrt{K}}\sin^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sin\bigg{(}\dfrac{a\sqrt{K}}{2}\bigg{)}\bigg{)}&\text{if }K>0.\end{cases}

p(T_{K,a})=\begin{cases}\dfrac{2}{a\sqrt{-K}}\sinh^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sinh\bigg{(}\dfrac{a\sqrt{-K}}{2}\bigg{)}\bigg{)}&\text{if }K<0\\ \dfrac{2}{\sqrt{3}}&\text{if }K=0\\ \dfrac{2}{a\sqrt{K}}\sin^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sin\bigg{(}\dfrac{a\sqrt{K}}{2}\bigg{)}\bigg{)}&\text{if }K>0.\end{cases}

\frac{d ( T )}{b ( T )} = \frac{d ( A , P )}{d ( A , M )} = \frac{sin ( ∠ A M P )}{sin ( ∠ A P M} = \frac{sin π /2}{sin π /3} = \frac{2}{3} .

\frac{d ( T )}{b ( T )} = \frac{d ( A , P )}{d ( A , M )} = \frac{sin ( ∠ A M P )}{sin ( ∠ A P M} = \frac{sin π /2}{sin π /3} = \frac{2}{3} .

\frac{sin ( d ( T ) K )}{sin ( b ( T ) K )} = \frac{2}{3} .

\frac{sin ( d ( T ) K )}{sin ( b ( T ) K )} = \frac{2}{3} .

d(T)=\dfrac{1}{\sqrt{K}}\sin^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sin\bigg{(}\dfrac{a\sqrt{K}}{2}\bigg{)}\bigg{)}.

d(T)=\dfrac{1}{\sqrt{K}}\sin^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sin\bigg{(}\dfrac{a\sqrt{K}}{2}\bigg{)}\bigg{)}.

d(T)=\dfrac{1}{\sqrt{-K}}\sinh^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sinh\bigg{(}\dfrac{a\sqrt{-K}}{2}\bigg{)}\bigg{)}.\qed

d(T)=\dfrac{1}{\sqrt{-K}}\sinh^{-1}\bigg{(}\dfrac{2}{\sqrt{3}}\sinh\bigg{(}\dfrac{a\sqrt{-K}}{2}\bigg{)}\bigg{)}.\qed

F (r) = \frac{π r ^{2}}{π 1 ^{2}} = r^{2}

F (r) = \frac{π r ^{2}}{π 1 ^{2}} = r^{2}

r = F^{- 1} (u) = u .

r = F^{- 1} (u) = u .

F (r) = \frac{\frac{4 π}{K} sin ^{2} ( \frac{r K}{2} )}{\frac{4 π}{K} sin ^{2} ( \frac{1 K}{2} )},

F (r) = \frac{\frac{4 π}{K} sin ^{2} ( \frac{r K}{2} )}{\frac{4 π}{K} sin ^{2} ( \frac{1 K}{2} )},

r = F^{- 1} (u) = \frac{2}{K} sin^{- 1} (u sin (\frac{K}{2})) .

r = F^{- 1} (u) = \frac{2}{K} sin^{- 1} (u sin (\frac{K}{2})) .

F (r) = \frac{\frac{4 π}{- K} sinh ^{2} ( \frac{r - K}{2} )}{\frac{4 π}{- K} sinh ^{2} ( \frac{1 - K}{2} )}

F (r) = \frac{\frac{4 π}{- K} sinh ^{2} ( \frac{r - K}{2} )}{\frac{4 π}{- K} sinh ^{2} ( \frac{1 - K}{2} )}

r = F^{- 1} (u) = \frac{2}{- K} sinh^{- 1} (u sinh (\frac{- K}{2})) .

r = F^{- 1} (u) = \frac{2}{- K} sinh^{- 1} (u sinh (\frac{- K}{2})) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Persistent homology detects curvature

Peter Bubenik

Department of Mathematics, University of Florida

[email protected] https://people.clas.ufl.edu/peterbubenik/ ,

Michael Hull

Department of Mathematics & Statistics, University of North Carolina at Greensboro

[email protected] https://mathstats.uncg.edu/people/directory/michael-hull/ ,

Dhruv Patel

Department of Statistics, University of North Carolina – Chapel Hill

[email protected]

and

Benjamin Whittle

Abstract.

In topological data analysis, persistent homology is used to study the “shape of data”. Persistent homology computations are completely characterized by a set of intervals called a bar code. It is often said that the long intervals represent the “topological signal” and the short intervals represent “noise”. We give evidence to dispute this thesis, showing that the short intervals encode geometric information. Specifically, we prove that persistent homology detects the curvature of disks from which points have been sampled. We describe a general computational framework for solving inverse problems using the average persistence landscape, a continuous mapping from metric spaces with a probability measure to a Hilbert space. In the present application, the average persistence landscapes of points sampled from disks of constant curvature results in a path in this Hilbert space which may be learned using standard tools from statistical and machine learning.

Key words and phrases:

topological data analysis, persistent homology, average persistence landscape

2010 Mathematics Subject Classification:

55N99

1. Introduction

Persistent homology is an important tool of topological data analysis (TDA). A goal of TDA is to summarize and learn from the “shape of data”. Often this “shape” is interpreted as the topological structure, such as the number of connected components and other homological features such as holes and voids. However, persistent homology is also sensitive to geometry.

The result of a persistent homology computation may be summarized as a set of intervals called a bar code or a set of points $(x,y)$ with $x<y$ called a persistence diagram. These give the parameter values for which a homological feature persists. In either case, one hopes to use this summary to make inferences on the underlying object from which the data has been sampled. An oft-repeated philosophy is that the long intervals in the bar code or the points distant to the diagonal in the persistence diagram represent the “topological signal” while the short intervals or the points close to the diagonal represent “noise”.

However, TDA has been used to understand geometric structures in many applications, such as: force networks in particulate systems [29, 27]; protein compressibility [19]; fullerene molecules [38]; amorphous solids [22]; the dynamics of flow patterns [30]; phase transitions [16]; sphere packing and colloids [34]; brain arteries [4]; craze formation in glassy polymers [23]; branching neuronal morphologies [25]; and pores in rocks [24].

In these examples, the relevant geometry is the local embedding of the underlying object or the local spatial arrangement of the analyzed object.

Here we will consider the curvature of the underlying object. We will prove that the short intervals in the bar code can be used to infer the curvature of the underlying object that has been sampled. Furthermore, we will present a general framework for solving inverse problems using a continuous mapping of bar codes or persistence diagrams to a Hilbert space, called the average persistence landscape [6, 11]. We will apply this framework to learning curvature.

1.1. Theoretical results: short bars detect geometry

Let $D_{K}$ denote the unit disk in the surface of constant curvature $K$ , with $K\in[-2,2]$ . For $K=0$ , $K=1$ , and $K=-1$ , these surfaces are the Euclidean plane, the unit sphere, and the hyperbolic plane. All of these disks are contractible, so their reduced singular homology is trivial, and thus homology is unable to distinguish between them. In fact, the spaces are homeomorphic. Endow $D_{K}$ with the probability measure proportional to the surface area measure. We will show that the persistent homology of points sampled from $D_{K}$ can both recover $K$ in theory and effectively estimate $K$ in practice.

We prove that for three points sampled from $D_{K}$ the persistence of the corresponding cycle in the Čech complex is largest when the points are pairwise equidistant (Theorem 3.6).

Furthermore if this pairwise distance is fixed then we derive an analytic expression for the corresponding persistence (Theorem 3.7), which is continuous and increasing as a function of the curvature $K$ (Corollary 3.8). Combining these results, we have the following.

Theorem 1.1.

Let $p(K)$ denote the maximum (Čech) persistence for three points on a surface of constant curvature $K$ with pairwise distances at most some fixed constant. Then $p(K)$ is an invertible function.

We will also give several procedures for estimating $K$ from the persistent homology of the Vietoris-Rips complex on points sampled from $D_{K}$ . Before we summarize our computational results we describe our general framework.

1.2. A framework for solving inverse problems: inference using average persistence landscapes

Consider a compact metric space $(\mathbb{X},d)$ together with a Borel probability measure $\mu$ with full support. Call $(\mathbb{X},d,\mu)$ a metric measure space. Let $T$ be the diameter of $\mathbb{X}$ . Let $m\in\mathbb{N}$ . Sample $X=(x_{1},\ldots,x_{m})\in\mathbb{X}$ independently according to $\mu$ and consider the pairwise distances $\{d(x_{i},x_{j})\ |\ 1\leq i\leq j\leq m\}$ . From this data one may compute the persistent homology of the corresponding Vietoris-Rips complex, which may be represented by the corresponding persistence landscape $\lambda_{X}$ [6]. Sampling $X$ is equivalent to sampling a point in $\mathbb{X}^{m}$ according to $\mu^{\otimes m}$ [11]. Let $\Psi_{\mu}^{m}$ be the measure induced by $\mu^{\otimes m}$ on $\mathcal{L}$ , the convex hull of the persistence landscapes of persistence diagrams consisting of at most $m$ points $(x,y)$ with $0\leq x<y\leq T$ . The average persistence landscape is $\mathbb{E}_{\Psi_{\mu}^{m}}[\lambda_{X}]$ , the expectation of the random variable $\lambda_{X}$ with respect to the probability measure $\Psi_{\mu}^{m}$ .

We may estimate the average persistence landscape as follows. If we sample $X=(x_{1},\ldots,x_{m})$ as above $n$ times and average the resulting persistence landscapes, we obtain the empirical average persistence landscape $\bar{\lambda}^{m}_{n}=\frac{1}{n}\sum_{i=1}^{n}\lambda_{X^{(i)}}$ . The empirical average persistence landscape converges to the average persistence landscape (pointwise [6] and uniformly [12]).

Now assume that $C\subset\mathbb{R}^{d}$ is a compact subset and that we have a continuous map $\varphi$ from $C$ to metric measure spaces with the Gromov-Wasserstein metric [31]. Fix $m\in\mathbb{N}$ . By [11, Remark 6], the map from metric measure spaces with the Gromov-Wasserstein metric to their average persistence landscapes is continuous. Thus, composing $\varphi$ with the average persistence landscape we have a continuous map from $C$ to $L^{2}(\mathbb{N}\times\mathbb{R})$ , a Hilbert space containing the persistence landscapes and average persistence landscapes [6].

Assume that for some unknown $c\in C$ , we are able to sample points from the metric measure space $\phi(c)$ and compute their pairwise distances. In this case we can compute the empirical average persistence landscape $\bar{\lambda}_{n}^{m}(c)$ . We now have the following inverse problem. Given training data $\{c_{i},\bar{\lambda}_{n}^{m}(c_{i})\}$ , can we estimate $c$ from $\bar{\lambda}_{n}^{m}(c)$ ?

We will demonstrate the feasibility of solving this inverse problem for the case in which $K\in[-2,2]\subset\mathbb{R}$ and $\varphi(K)$ is the unit disk in the surface of constant curvature $K$ with probability measure proportional to the surface area measure. In this case, the composition of $\varphi$ with the average persistence landscape is a parametrized path in $L^{2}(\mathbb{N}\times\mathbb{R})$ . Our goal is to learn this parametrized path and to use it to estimate curvatures from empirical average persistence landscapes.

*Remark 1.2**.*

It would be great to have an analytic derivation of the average persistence landscape for the Vietoris-Rips complex for $m$ points sampled from the unit disk in a surface of constant curvature. Unfortunately, not much is known in this direction. The expected persistence diagram for the Vietoris-Rips complex for $m$ points sampled from the circle is known [8]. In addition, the order of the maximally persistent degree- $k$ cycle for the Vietoris-Rips complex for $m$ points sampled from the $d$ -dimensional cube as $m\to\infty$ is known [5].

There is also a Vietoris-Rips complex for the unit disk in a surface of constant curvature [10]. This is a simplicial complex with uncountably many $k$ -simplices for all $k\geq 0$ . The persistent homology of the Vietoris-Rips complex for the circle has been derived analytically [1]. Note that the persistence landscape of such Vietoris-Rips complexes is not the same as the average persistence landscape for samples of $m$ points.

1.3. Computational results

We apply the framework in the previous section to estimating curvature from sampled points and pairwise distance data.

We estimate curvature in the supervised and unsupervised settings. In the supervised setting we start with training data given by curvatures $K=\{-2,-1.96,-1.92,\ldots,1.96,2\}$ and corresponding empirical average persistence landscapes for homology in degree [math] and homology in degree $1$ , for $m=1000$ points. In both settings, we sample $100$ values of $K$ iid from $[-2,2]$ and compute the corresponding empirical average persistence landscapes. Using these empirical average persistence landscapes, we estimate the corresponding curvatures: using both nearest neighbors and support vector regression in the supervised setting; and using principal components analysis in the unsupervised setting. See Figure 1, where we use the concatenations of the degree [math] and degree $1$ persistence landscapes. The root mean squared error in our estimates is 0.056 for nearest neighbors, 0.017 for support vector regression, and 0.128 for principal components analysis. For more computational results, see Table 1. Furthermore, we estimate the fifth and ninety-fifth percentiles using quantile support vector regression. See Figure 9.

We also repeat most of the above estimates for the much more difficult computational setting in which all nonzero pairwise distances are sorted and replaced with their corresponding ordinal numbers. This is appropriate for neuroscience data in which the distances are only known up to rescaling by an unknown monotonic function [21]. In this case, the set of nonzero pairwise distances is the same for all curvatures. Nevertheless, we are still able to provide reasonable curvature estimates. See Figure 11. The root mean squared error in our estimates is 0.262 for nearest neighbors, 0.171 for support vector regression, and 0.392 for principal components analysis. For more computational results, see Table 2. This example makes it clear that the short bars in persistent homology do indeed encode subtle geometric information.

1.4. Expected impact

Our theoretical work

showing that persistent homology detects curvature may be used to help justify the use of persistent homology to study other geometric structures in applications, such as those listed in the start of the introduction.

We have outlined a framework for using topological data analysis for solving inverse problems. Persistent homology together with the average persistence landscape gives a continuous mapping from metric spaces with a probability measure to a Hilbert space. In situations in which it is easy to sample or subsample points and measure pairwise distances one may compute empirical average persistence landscapes. Convergence results are known [11] and in practice, they quickly converge with little noise. Furthermore this mapping is sensitive to the starting metric structure. Finally, as our constructions lie a Hilbert space, one can apply tools from statistical and machine learning. This approach should facilitate learning geometric structures in a broad range of applications.

1.5. Related work

Persistence landscapes have been used to study the geometry of microstructures [15]; protein conformations [28]; and financial times series [20]. Average persistence landscapes and average death vectors were used to detect differences in images of leaves in [33]. B. Schweinhart recently proved that persistent homology of random samples may be used to determine the fractal dimension of certain metric spaces [35].

2. Background

In this section we provide some necessary background from persistent homology, geometry, and statistics. For details, we refer the reader to [17, 18, 32] for persistent homology, [13, 3, 9] for geometry, and [37, 36] for statistics.

2.1. Filtered simplicial complexes from points

A simplicial complex is a collection $K$ of subsets of a set $V$ of vertices, such that if $\sigma\in K$ and $\tau\subset\sigma$ then $\tau\in X$ . A filtered simplicial complex is a collection of simplicial complexes $\{K_{t}\;|\;t\in\mathbb{R},t\geq 0\}$ with the property that whenever $s\leq t$ , there is an inclusion $K_{s}\subseteq K_{t}$ .

Let $Y$ be a metric space and let $X\subset Y$ be a finite subset. There are two common ways ways to turn $X$ into a filtered simplicial complex and we will use of both of them. First, for $t\geq 0$ let $\check{C}_{t}(X)$ be the simplicial complex where the 0-simplices of $\check{C}_{t}(X)$ are the points of $X$ and for $p\geq 1$ , $\check{C}_{t}(X)$ contains a $p$ –simplex $[x_{0},...,x_{p}]$ if and only if

[TABLE]

where $B_{r}(x)\subset Y$ denotes the closed ball of radius $r$ centered at the point $x\in X$ . The collection $\{\check{C}_{t}(X):t\geq 0\}$ forms a filtered simplicial complex, called the Čech complex of $X$ .

Now for $t\geq 0$ , let $R_{t}(X)$ be the simplicial complex whose 0-simplices are the points of $X$ and which includes the $p$ -simplex $[x_{0},...,x_{p}]$ if and only if for all $1\leq i,j\leq p$ , $d(x_{i},x_{j})\leq t$ . This filtered simplicial complex is called the Vietoris-Rips Complex. Notice that unlike the Čech complex, which depends on $Y$ , the Vietoris-Rips complex depends only on $X$ .

2.2. Persistent homology

Let $K$ be a simplicial complex. Taking reduced simplicial homology in degree $d$ with coefficients in some fixed field yields a vector space $H_{d}(K)$ . Furthermore an inclusion of simplicial complexes induces a linear map between the corresponding vector spaces [2, Chapter 8]. Let $\{K_{t}\}$ be a filtered simplicial complex. Taking homology in degree $d$ with coefficients in some fixed field yields a persistence module, $M$ , given by the collection of vector spaces $\{H_{d}(K_{t})\;|\;t\in\mathbb{R},t\geq 0\}$ and linear maps $f_{s}^{t}\colon H_{d}(K_{s})\rightarrow H_{d}(K_{t})$ induced by the inclusions $K_{s}\subseteq K_{t}$ whenever $s\leq t$ . As a special case, one has the interval persistence modules which are one dimensional on an interval, zero outside the interval, and all linear maps are the identity whenever not forced to be zero. The structure theorem of persistent homology says that under mild hypotheses, every persistence module $M$ is isomorphic to a direct sum of interval modules. The collection of these intervals is called the bar code of $M$ . Replacing an interval with its ordered pair of endpoints, we instead obtain the persistence diagram of $M$ . To enable us to use ideas from statistics and machine learning, we construct the following vector summaries.

For homology in degree [math] of both the Čech complex and the Vietoris-Rips complex, all of the intervals in the bar code have left endpoint [math]. In this case we can represent the bar code by a sorted list of the right end points in decreasing order. We call this order statistic a death vector. Note that since we are using reduced homology and all of our complexes are eventually connected, all of the values in the death vector are finite.

In other cases, we need a more sophisticated vector encoding. The persistent Betti number of M corresponding to $s\leq t$ is defined to be $\beta_{s}^{t}=$ dim(image( $f^{t}_{s}$ )). The persistence landscape of $M$ [6] is the function

[TABLE]

We discretize this function to obtain a vector,

[TABLE]

which we also call the persistence landscape. The persistence landscape can be efficiently computed from the bar code [7]. Note that since we are using reduced homology and all of our simplicial complexes are eventually contractible, all of the values in the persistence landscape are finite.

For homology in degree [math], we prefer the death vector to the persistence landscape since it provides a sparser encoding of the same information.

2.3. Geometries of constant curvature

Let $M_{K}$ be the complete, simply-connected 2-dimensional Riemannian manifold of constant Gaussian curvature $K$ . Note that $M_{K}$ is unique up to isometry by the Killing-Hopf Theorem. When $K=0$ , we can identify $M_{0}$ with $\mathbb{R}^{2}$ with the standard Euclidean metric. When $K>0$ we can identify $M_{K}$ with the sphere of radius $R:=\frac{1}{\sqrt{K}}$ centered at the origin in $\mathbb{R}^{3}$ , that is $M_{K}=\{(x,y,z)\in\mathbb{R}^{3}\;|\;x^{2}+y^{2}+z^{2}=R^{2}\}$ . When $K<0$ , we identify $M_{K}$ with the Poincaré disk model of the hyperbolic plane of curvature $K$ . That is, for $R=\frac{1}{\sqrt{-K}}$ , $M_{K}=\{(x,y)\in\mathbb{R}^{2}\;|\;x^{2}+y^{2}<R\}$ with Riemannian metric

[TABLE]

The geodesics in this model correspond to the intersection of $M_{K}$ and a (Euclidean) line through the origin in $\mathbb{R}^{2}$ or a (Euclidean) circle which is orthogonal to the boundary circle $\{(x,y)\in\mathbb{R}^{2}\;|\;x^{2}+y^{2}=R\}$ .

We think of $M_{K}$ as a model for hyperbolic, Euclidean, and spherical geometry when $K<0$ , $K=0$ , and $K>0$ respectively. The results in Section 3 will be derived using only elementary properties of these geometries. We review some of these properties next. First, however, we note that if $S$ is a surface with a Riemannian metric of constant Gaussian curvature $K$ , then we can naturally identify the universal cover $\widetilde{S}$ with $M_{K}$ . Hence $S$ will be locally isometric to $M_{K}$ . So while the model spaces $M_{K}$ that we work with are all simply-connected, we will see the same behavior locally on any surface of constant curvature. Note also that by the Uniformization Theorem, every orientable surface admits a Riemannian metric of constant Gaussian curvature.

2.4. Triangles

Let $P,Q$ be distinct points in $M_{K}$ . Unless $K>0$ and $P$ and $Q$ are antipodal, there is a unique line $\overleftrightarrow{PQ}$ containing $P$ and $Q$ and a unique shortest geodesic between $P$ and $Q$ whose image $\overline{PQ}$ is a subset of $\overleftrightarrow{PQ}$ .

Let $A$ , $B$ , and $C$ be three points in $M_{K}$ which are assumed to not be collinear. If $K>0$ , then this implies that no pair of these points is a pair of antipodal points on the sphere. It follows that there is a unique shortest geodesic segment between each pair of points. Let $T=\overline{AB}\cup\overline{AC}\cup\overline{BC}$ called the triangle with vertices $A$ , $B$ , $C$ , and edges or sides $\overline{AB}$ , $\overline{AC}$ , $\overline{BC}$ . The subspace $M_{K}\setminus T$ has two components. If $K\leq 0$ then exactly one of these has finite area, called the interior of $T$ . If $K>0$ then the component with smaller area is called the interior of $T$ .

2.5. Circumcircles

A circumcircle of a triangle $T$ is a circle containing the vertices of $T$ . A center of this circle is called a circumcenter and the corresponding radius is a called a circumradius. In $M_{0}$ , every triangle has a unique circumcircle with a unique circumcenter. If $K>0$ , then each triangle in $M_{K}$ has a unique circumcircle with two circumcenters. If $K<0$ , then a triangle in $M_{K}$ may or may not have a circumcircle, but if it does then the circumcenter is unique.

Lemma 2.1.

Let $P$ and $Q$ be points in $M_{K}$ . Then the perpendicular bisector of a line segment $\overline{PQ}$ consists of those points equidistant to $P$ and $Q$ .

Proof.

Suppose $A$ is equidistant from $P$ and $Q$ . Let $l$ be the line through $A$ which bisects the angle $\angle PAQ$ , and let $D$ be the point where $l$ intersects $\overline{PQ}$ . Then $\triangle PAD\cong\triangle QAD$ by Side-Angle-Side. Hence $\overline{PD}\cong\overline{DQ}$ , so $D$ is the midpoint of $\overline{PQ}$ . Also $\angle PDA\cong\angle QDA$ , and since these angles sum to $\pi$ they must both be right angles. Hence $l$ is the perpendicular bisector of $\overline{PQ}$ .

Conversely, if $A$ lies on the perpendicular bisector $l$ of $\overline{PQ}$ and $D$ is the midpoint of $\overline{PQ}$ , then triangles $\triangle PDA$ and $\triangle QDA$ are congruent by Side-Angle-Side, so $\overline{PA}\cong\overline{QA}$ . ∎

Theorem 2.2.

For a triangle in $M_{K}$ , the following statements are equivalent.

(a)

The perpendicular bisectors of two of the sides intersect. 2. (b)

The triangle has a circumcircle. 3. (c)

The perpendicular bisectors of the sides have a common intersection.

*Moreover, when at least one of these equivalent statements holds then the intersection point of the perpendicular bisectors of the sides is the circumcenter of the triangle. *

Proof.

Let $A$ , $B$ , $C$ be the vertices of triangle $T$ .

(a) implies (b). Assume that a point $P$ is in the intersection of the perpendicular bisectors of two of the sides of $T$ . Then $P$ is equidistant from $A$ , $B$ , and $C$ . So $P$ is a circumcenter of $T$ .

(b) implies (c). Let $P$ be a circumcenter. Then $P$ is equidistant from $A$ , $B$ , $C$ . So $P$ lies on the perpendicular bisector of each side.

(c) implies (a) is immediate. ∎

2.6. Areas of disks

We will use the following basic fact. The area of a disk of radius $r$ on a surface of constant curvature $K$ is given by

[TABLE]

2.7. Distances between points on a unit disk

We will want to compute the distances between points sampled from a disk of radius one on $M_{K}$ . We will represent the points in this disk using polar coordinates $(r,\theta)$ , where $0\leq r\leq 1$ and $0\leq\theta<2\pi$ .

For the Euclidean case, $K=0$ , we convert to Cartesian coordinates $(r\cos\theta,r\sin\theta)$ and compute the Euclidean distance.

In the spherical case, $K>0$ , $M_{K}$ is realized as the sphere of radius $R$ centered at the origin in $\mathbb{R}^{3}$ , where $R=\frac{1}{\sqrt{K}}$ . We consider our disk to be a spherical cap of this sphere. The point on the disk corresponding to $(r,\theta)$ can be written in spherical coordinates as $(R,\theta,\frac{r}{R})$ . Converting to Cartesian coordinates, we have $(R\sin(\frac{r}{R})\cos\theta,R\sin(\frac{r}{R})\sin\theta,R\cos(\frac{r}{R}))$ . The distance between two such points $x$ and $y$ is given by $R\cos^{-1}(\frac{x\cdot y}{R^{2}})$ . However, $\cos^{-1}(t)$ is not numerically stable near zero, so instead we use the following robust formula, $R\tan^{-1}(\frac{|x\times y|}{x\cdot y})$ . More specifically, we will use the two-argument arctangent function $R\operatorname{atan}\!2(|x\times y|,x\cdot y)$ .

For the hyperbolic case, $K<0$ , $M_{K}$ is realized as the Poincaré disk, with $R=\frac{1}{\sqrt{-K}}$ . We consider our disk of hyperbolic radius one to be centered at the origin. The point on the disk corresponding to $(r,\theta)$ can be written in Cartesian coordinates as $\left(R\tanh\left(\frac{r}{2R}\right)\cos\theta,R\tanh\left(\frac{r}{2R}\right)\sin\theta\right)$ . The hyperbolic distance between between to points $u$ and $v$ in the Poincaré $R$ -disk is given by $2R\tanh^{-1}\frac{|z-w|}{|1-z\bar{w}|}$ where $z=u/R$ and $w=v/R$ are thought of as complex numbers.

2.8. Laws of sines and cosines

We will need the laws of sines and cosines for a triangle on a surface of constant curvature $K$ .

Theorem 2.3.

Generalized Law of Sines

Let $\Delta ABC$ be a triangle in $M_{K}$ with lengths $a,b,c$ and angles $\alpha,\beta,\gamma$ respectively. When K=0,

[TABLE]

When $K>0$ ,

[TABLE]

When $K<0$ ,

[TABLE]

Theorem 2.4.

Generalized Law of Cosines

Let $\Delta ABC$ be a triangle in $M_{K}$ with lengths $a,b,c$ and angles $\alpha,\beta,\gamma$ respectively. When $K=0$ ,

[TABLE]

When $K>0$ ,

[TABLE]

When $K<0$ ,

[TABLE]

2.9. Inversion sampling

The following theorem allows us to sample points from a distribution knowing only the inverse of the cumulative distribution $F$ by sampling a point $u$ uniformly from $[0,1]$ and then calculating $F^{-1}(u)$ . This method of sampling from F is called inversion sampling.

Theorem 2.5.

[14]** Let F be an invertible continuous cumulative distribution function on some domain D. If U is distributed uniformly on $[0,1]$ , then $F^{-1}(U)$ has cumulative distribution function F.

2.10. Support vector regression

We assume that our data $(x_{1},y_{1}),\ldots,(x_{N},y_{N})$ with $x_{i}\in\mathbb{R}^{d}$ and $y_{i}\in\mathbb{R}$ is drawn from some unknown joint distribution on $\mathbb{R}^{d}\times\mathbb{R}$ . Our goal is to estimate a functional relationship between the variables.

(Linear) support vector regression (SVR) is an approach to this problem which computes a predictor $f(x)=\langle w,x\rangle+b$ by solving the following convex optimization problem:

[TABLE]

for each $i\in\{1,...,N\}$ . The slack variables $\zeta_{1,i}$ and $\zeta_{2,i}$ and cost parameter $C$ allow for some errors among the training data. A larger value of $C$ increases the penalty for an error in the training data. If $\varepsilon=0$ then this problem corresponds to using the linear loss function $L=|y_{i}-f(x_{i})|$ . For $\varepsilon>0$ , we instead have the $\varepsilon$ -insensitive loss function given by

[TABLE]

This function ignores errors within $\varepsilon$ of the true values.

In Section 4.3.2, in which the data seems to have little noise, we are able to set $\varepsilon=0$ and $C=100$ . In Section 4.5, in which the data seems to be noisier, we take $\varepsilon=1$ or $0.2$ and $C=10$ to avoid overfitting.

For quantile regression (Section 4.3.3) we will use the pinball loss function,

[TABLE]

where $0<\tau<1$ . This loss function allows us to estimate the $\tau$ -quantile.

3. Persistence of triangles

In this section we study how triangles contribute to the persistent homology of the Čech complex formed from points on $M_{K}$ . Specifically, we show the maximal interval of parameter values for which three points contribute a non-trivial element to the homology in degree one of the Čech complex depends on $K$ . Moreover, we will show that for all $K$ this interval is maximized by the vertices of an equilateral triangle $T$ and give formulas depending on $K$ and the length of the sides of $T$ .

3.1. Triangles and their persistent homology

Let $X$ be a finite set of points on $M_{K}$ . Let $A$ , $B$ and $C$ be points in $X$ which we assume are not collinear. When $K>0$ , we will also assume that no pair of these points is antipodal, or equivalently the pairwise distances are all less then $\frac{\pi}{\sqrt{K}}$ . There are two triangles of interest corresponding to the vertices $A$ , $B$ , and $C$ . There is the (geometric) triangle $T$ , which is a subset of $M_{K}$ (Section 2.4). There is also the abstract triangle $\{A,B,C\}\subset X$ which may be an element of the Čech complex on $X$ . It will be convenient to refer to both of these as the triangle $T$ corresponding to the vertices $A$ , $B$ , and $C$ . It should be clear from the context which of these we mean.

The boundary of the triangle $T$ contributes a 1–cycle in $\check{C}_{t}(X)$ in the Čech complex whenever $t\geq b(T)$ , where

[TABLE]

The value $b(T)$ is called the birth of the triangle $T$ . In the other direction, $T$ contributes a 2–simplex to $\check{C}_{t}(X)$ whenever $t\geq d(T)$ , where

[TABLE]

The value of $d(T)$ is called the death of $T$ . Hence $T$ induces an element of $H_{1}(\check{C}_{t}(X))$ for all $t\geq b(T)$ , and this element is trivial for all $t\geq d(T)$ . In particular, if $b(T)=d(T)$ , then $T$ does not contribute any non-trivial elements to the persistent homology.

The persistence of an interval $[b(T),d(T))$ is usually given by the difference $d(T)-b(T)$ , the length of the time that $T$ is contributing to homology. However, in situations where a scale-free version is desired [5], it is preferable to use logarithmic coordinates and to instead consider the ratio $\frac{d(T)}{b(T)}$ , which we will refer to as the persistence of $T$ and denote by $p(T)$ .

We now we fix notation that will be used for the rest of this section. Let $T$ denote a triangle with vertices $A$ , $B$ , and $C$ . Let $a$ , $b$ , and $c$ be the lengths of the sides of $T$ opposite $A$ , $B$ , and $C$ , respectively. We assume $T$ is labeled such that $a\geq b\geq c$ . See Figure 2. When $K>0$ , we let $R=\frac{1}{\sqrt{K}}$ , that is $R$ is the radius of the sphere realizing $M_{K}$ . Recall that in this case we are also assuming that $a<\pi R$ . Note that the birth of $T$ is simply half the length of the longest side, so with this notation we have $b(T)=\frac{a}{2}$ . We will also let $M$ denote the midpoint of the side $\overline{BC}$ , and let $m$ denote the distance from $A$ to $M$ . If $T$ has a circumcircle then we denote the corresponding circumcenter by $P$ .

In this section we will prove the following.

Proposition 3.1.

The following are equivalent:

(a)

$T$ * produces persistent $H_{1}$ in the Čech complex. That is, $b(T)<d(T)$ .* 2. (b)

$\frac{a}{2}<m$ . 3. (c)

$T$ * has a circumcircle and the circumcenter $P$ is in the interior of $T$ .*

Furthermore, if these equivalent conditions hold, then $b(T)$ equals $\frac{a}{2}$ and $d(T)$ equals the circumradius.

Lemma 3.2.

Let $P$ and $Q$ be points in $M_{K}$ and let $l$ be a line through $Q$ which is perpendicular to $\overleftrightarrow{PQ}$ . If $K>0$ , then we also assume that $d(P,Q)<\frac{\pi}{2}R$ . Let $t\geq 0$ . Let $H$ be a half plane bounded by $\overleftrightarrow{PQ}$ and let $Q_{t}$ be the point in $H$ on $l$ such that $d(Q,Q_{t})=t$ . Then $d(P,Q_{t})$ is a strictly increasing function of $t$ .

Proof.

Since $\cos\angle PQQ_{t}=0$ , this follows from the Generalized Law of Cosines (Theorem 2.4). ∎

The next lemma explains the role of the distance $m$ from $A$ to the midpoint $M$ of $\overline{BC}$ .

Lemma 3.3.

$b(T)<d(T)$ * if and only if $\frac{a}{2}<m$ .*

Proof.

Note that $M$ is the unique element of $B_{\frac{a}{2}}(B)\cap B_{\frac{a}{2}}(C)$ . Hence $B_{\frac{a}{2}}(A)\cap B_{\frac{a}{2}}(B)\cap B_{\frac{a}{2}}(C)\neq\emptyset$ if and only if $M\in B_{\frac{a}{2}}(A)$ , thus $b(T)=d(T)$ if and only if $m\leq\frac{a}{2}$ . Therefore, $b(T)<d(T)$ if and only if $\frac{a}{2}<m$ . ∎

Lemma 3.4.

Suppose that $\frac{a}{2}<m$ . Then the triangle $T$ has a circumcenter $P$ , and $P$ is contained in the interior of $T$ .

Proof.

Assume that $\frac{a}{2}<m$ . Note that the distance from $M$ to $C$ is $\frac{a}{2}$ , so $d(M,A)>d(M,C)$ . Let $l$ denote the perpendicular bisector of $\overline{BC}$ . Then $l$ must intersect one of the other two sides of $T$ . Since $b\geq c$ , $l$ intersects $\overline{AC}$ in a point $N$ . If $K>0$ , then $d(M,C)=\frac{a}{2}<\frac{\pi}{2}R$ . Hence for any $K$ we can apply Lemma 3.2 to get that $d(N,C)>d(M,C)=\frac{a}{2}$ . Since the length of $\overline{AC}$ is $b\leq a$ , we must have $d(A,N)=d(A,C)-d(N,C)<a-\frac{a}{2}=\frac{a}{2}$ . Thus $d(N,A)<d(N,C)$ .

Now, as a point moves along $l$ from $M$ to $N$ , by the continuity of the distance function and the intermediate value theorem there must exist a point $P$ in the interior of $\overline{MN}$ where the $d(P,A)=d(P,C)$ . Since $P$ is on the perpendicular bisector of $\overline{BC}$ , we also have the distance from $P$ to $B$ is equal to the distance from $P$ to $C$ . Thus, $P$ is a circumcenter of $T$ . Since $P$ is in the interior of $\overline{MN}$ and this segment is contained in $T$ by construction, we have that $P$ is in the interior of $T$ . ∎

Lemma 3.5.

Suppose $\frac{a}{2}<m$ and there exists an $r>0$ and a point $D$ such that $D\in B_{r}(A)\cap B_{r}(B)\cap B_{r}(C)$ but $D$ is not the circumcenter of triangle $\triangle ABC$ . Then there exists $r^{\prime}<r$ such that $B_{r^{\prime}}(A)\cap B_{r^{\prime}}(B)\cap B_{r^{\prime}}(C)\neq\emptyset$ .

Proof.

Since $D$ is not the circumcenter, there must exist at least one vertex whose distance to $D$ is less then $r$ . Suppose without loss of generality that $d(D,A)<r$ . First, suppose that $D\notin\overleftrightarrow{BC}$ . Let $l$ be a line that contains $D$ and is perpendicular to $\overleftrightarrow{BC}$ . Let $D^{\prime}\neq D$ be a point on the segment of $l$ from $D$ to $\overleftrightarrow{BC}$ such that $d(D,D^{\prime})<r-d(D,A)$ . Hence $d(D^{\prime},A)<r$ . Also, by construction and Lemma 3.2 $D^{\prime}$ is closer to $B$ and $C$ than $D$ , hence $d(D^{\prime},B)<d(D,B)\leq r$ and similarly $d(D^{\prime},C)<r$ . Letting $r^{\prime}=\max\{d(D^{\prime},A),d(D^{\prime},B),d(D^{\prime},C)\}$ , we get that $r^{\prime}<r$ and $D^{\prime}\in B_{r^{\prime}}(A)\cap B_{r^{\prime}}(B)\cap B_{r^{\prime}}(C)$ .

Now suppose $D\in\overleftrightarrow{BC}$ . Suppose without loss of generality that $d(D,B)\leq d(D,C)$ . Then either $d(D,B)<d(D,C)\leq r$ , or $d(D,B)=d(D,C)=\frac{a}{2}<m\leq r$ since $D$ is the midpoint of $\overline{BC}$ in this case. Either way we get that $d(D,B)<r$ , so we can repeat the same proof as above using the point $B$ instead of $A$ . ∎

Proof of Proposition 3.1.

(a) and (b) are equivalent by Lemma 3.3. Lemma 3.4 shows that (b) implies (c). Assume now that (c) holds, that is $T$ has a circumcircle and the circumcenter $P$ is in the interior of $T$ . Since $P$ lies on the perpendicular bisector $l$ of $\overline{BC}$ , by Lemma 3.2 we get that $\frac{a}{2}=d(B,M)<d(B,P)=d(A,P)$ .

Now let $Q$ be a point on $l$ such that $\overleftrightarrow{AQ}$ is perpendicular to $l$ . Then $\overleftrightarrow{AQ}$ and $\overleftrightarrow{BC}$ are both perpendicular to $l$ . When $K\leq 0$ , this means that $\overleftrightarrow{AQ}$ and $\overleftrightarrow{BC}$ are parallel. When $K>0$ , this means that the two intersection points of $\overleftrightarrow{AQ}$ and $\overleftrightarrow{BC}$ both have distance $\frac{\pi}{2}R$ from $l$ . Furthermore, the line $l$ must intersect either $\overline{AB}$ or $\overline{AC}$ . Since $b\geq c$ , $l$ intersects $\overline{AC}$ . So $d(A,Q)\leq d(A,C)\leq a<\frac{\pi}{2}R$ . Thus, for all $K$ we get that $\overline{AQ}$ does not intersect $\overleftrightarrow{BC}$ . Thus, $Q$ and $A$ are on the same side of $\overleftrightarrow{BC}$ . By a similar argument, it also follows that $\overleftrightarrow{AQ}$ does not intersect $\overline{BC}$ .

We also note that $Q$ cannot be in the interior of $T$ . Indeed, suppose $Q$ is in the interior of $T$ , and let $S$ be the point where $l$ and $\overline{AC}$ intersect. Hence $Q$ lies in the interior of the segment $\overline{MS}$ . Since $\overleftrightarrow{AQ}$ intersects one side of triangle $\triangle MSC$ , it must intersect one of the other two sides. We have already shown that $\overleftrightarrow{AQ}$ does not intersect $\overline{MC}\subseteq\overline{BC}$ , hence it must intersect $\overline{SC}\subseteq\overleftrightarrow{AC}$ . But this means that $\overleftrightarrow{AQ}$ and $\overleftrightarrow{AC}$ must intersect in two non-antipodal points, a contradiction.

Since $Q$ and $P$ are on the same side of $\overleftrightarrow{BC}$ and $P$ is in the interior of $T$ and $Q$ is not, we must have $P\in\overline{QM}$ . Thus, we get that $d(P,Q)<d(M,Q)$ , and so by Lemma 3.2 $d(A,P)<d(A,M)=m$ . Combining this with the previous inequality gives that $\frac{a}{2}<m$ .

Finally, suppose (a), (b), and (c) hold. Let $r=d(T)$ . By definition, $B_{r}(A)\cap B_{r}(B)\cap B_{r}(C)\neq\emptyset$ . Lemma 3.5 then implies that $P$ must be the unique element of $B_{r}(A)\cap B_{r}(B)\cap B_{r}(C)$ , and hence $r=d(P,A)=d(P,B)=d(P,C)$ , that is $r$ is the circumradius of $T$ . ∎

3.2. The most persistent triangles

In this section we show that among triangles $T$ with fixed birth $b(T)$ , those with maximal persistence $p(T)=\frac{d(T)}{b(T)}$ are the equilateral triangles.

Let $T$ be a triangle with vertices $A$ , $B$ , and $C$ , and corresponding edge lengths $a\geq b\geq c$ . Assume that $b(T)<d(T)$ . If $K>0$ then we also assume that $a<\frac{2\pi}{3}R$ , where $R=\frac{1}{\sqrt{K}}$ . This assumption is necessary for an equilateral triangle with side lengths $a$ to exist on $M_{K}$ .

Theorem 3.6.

Suppose $T$ is not an equilateral triangle. Then there exists an equilateral triangle $T^{\prime}$ such that $b(T^{\prime})=b(T)$ and $d(T^{\prime})>d(T)$ .

Proof.

We will first show that $T$ can be replaced by an isosceles triangle with two sides of length $a$ . If $T$ is not already of this form, then longest side of $T$ is strictly bigger then the length of the other two sides, that is $a>b\geq c$ . Let $l_{1}$ , $l_{2}$ , and $l_{3}$ be the perpendicular bisectors to $\overline{BC}$ , $\overline{AB}$ ,and $\overline{AC}$ respectively. By Proposition 3.1, these bisectors intersect in the point $P$ , that is the circumcenter of $T$ , which is in the interior of $T$ .

Let $A^{\prime}$ be the point on $\overleftrightarrow{AB}$ such that $A$ is between $A^{\prime}$ and $B$ and $\max\{d(B,A^{\prime}),d(C,A^{\prime}\}=a$ . Let $T^{\prime}$ be the triangle formed by $A^{\prime}$ , $B$ , and $C$ . See Figure 3. By construction, $T$ has two sides of length $a$ , and $a$ is still the length of the longest side of $T^{\prime}$ . Thus $b(T^{\prime})=b(T)$ .

We will show that $T^{\prime}$ satisfies $d(T^{\prime})>b(T^{\prime})$ using Proposition 3.1. Let $M$ be the midpoint of $\overline{BC}$ . Since $a>b$ , $l_{2}$ intersects $\overline{BC}$ at a point $N$ . Since $P$ is inside $T$ , $N$ is on the opposite side of $l_{1}$ as $B$ .

This means that $M$ and $B$ are on the same side of $l_{2}$ , and hence $M$ and $A$ are on opposite sides of $l_{2}$ .

Now let $Q$ be the point on $\overleftrightarrow{AB}$ such that $\overleftrightarrow{MQ}$ is perpendicular to $\overleftrightarrow{AB}$ . Then $\overleftrightarrow{MQ}$ and $l_{2}$ are both perpendicular to $\overleftrightarrow{AB}$ . When $K\leq 0$ , this means that $\overleftrightarrow{MQ}$ and $l_{2}$ are parallel. When $K>0$ , this means that the two intersection points of $l_{2}$ and $\overleftrightarrow{MQ}$ both have distance $\frac{\pi}{2}R$ from $\overleftrightarrow{AB}$ . In this case, $d(M,Q)\leq d(M,B)=\frac{a}{2}<\frac{\pi}{2}R$ . Hence for all $K$ we get that the segment $\overline{MQ}$ does not intersect $l_{2}$ . Thus $M$ and $Q$ are on the same side of $l_{2}$ , which means that $Q$ and $A$ are on opposite sides of $l_{2}$ .

Since $A$ is closer to $l_{2}$ than $A^{\prime}$ , it follows that $A$ is closer to $Q$ than $A^{\prime}$ . Hence Lemma 3.2 implies that $d(A^{\prime},M)>d(A,M)>\frac{a}{2}$ , which means that the conclusions of Proposition 3.1 hold for $T^{\prime}$ .

Let $P^{\prime}$ be the circumcenter of triangle $T^{\prime}$ and $l^{\prime}_{2}$ be the perpendicular bisector of $\overline{BA^{\prime}}$ . Then $P^{\prime}$ lies on $l_{1}$ and $d(M,P^{\prime})>d(M,P)$ . Since $l_{1}$ is perpendicular to $\overleftrightarrow{BM}=\overleftrightarrow{BC}$ , the distance from a point on $l_{1}$ to $B$ increases as that point moves away from $M$ . Hence $d(B,P^{\prime})>d(B,P)$ , or equivalently $d(T^{\prime})>d(T)$ .

Thus, we can assume $T$ has two sides of length $a$ , that is $a=b>c$ .

Consider the circles of radius $a$ centered at $B$ and $C$ , see Figure 4. When $K\leq 0$ , it is easy to see that they intersect at two points, one on each side of $\overleftrightarrow{BC}$ . When $K>0$ , let $M^{\prime}$ denote the point on the sphere that is antipodal to $M$ . Then $d(B,M)=\frac{a}{2}<a$ and since we assumed that $a<\frac{2\pi}{3}R$ , $d(B,M^{\prime})=\pi R-\frac{a}{2}>\frac{2\pi}{3}R>a$ . Note that $M$ and $M^{\prime}$ both lie on $l_{1}$ , hence there exist two points on $l_{1}$ , one on each side of $\overleftrightarrow{BC}$ , whose distance to $B$ is equal to $a$ . Since these points lie on $l_{1}$ , they also have distance $a$ to $C$ , and hence they lie on the intersection of the two circles.

Let $A^{\prime}$ be the intersection point of these two circles on the same side of $\overleftrightarrow{BC}$ as $A$ . Again, let $T^{\prime}$ be the triangle with vertices $A^{\prime}$ , $B$ , and $C$ . By construction $T^{\prime}$ is an equilateral triangle with side lengths $a$ , hence $b(T^{\prime})=\frac{a}{2}=b(T)$ .

Let $l_{2}^{\prime}$ be the perpendicular bisector of $\overline{A^{\prime}B}$ . By construction, the angle of $T$ at vertex $C$ is smaller then the angle of $T^{\prime}$ at vertex $C$ . Since these are both isosceles triangles, $l_{2}$ and $l_{2}^{\prime}$ bisect these angles respectively. Hence, the angle formed by $\overline{BC}$ and $l_{2}$ is smaller then the angle formed by $\overline{BC}$ and $l_{2}^{\prime}$ . It follows that the point $P$ where $l_{2}$ intersects $l_{1}$ is closer to $\overline{BC}$ then the point $P^{\prime}$ where $l_{2}^{\prime}$ intersects $l_{2}$ . As before, this means that $d(B,P^{\prime})>d(B,P)$ . Since $P$ and $P^{\prime}$ are the circumcenters of $T$ and $T^{\prime}$ , we get that $d(T^{\prime})>d(T)$ . ∎

3.3. Persistence of equilateral triangles

In this section, we give formulas for the persistence $p(T)$ where $T$ is an equilateral triangle in $M_{K}$ . In general it is possible to give formulas for the persistence of arbitrary triangles in $M_{K}$ in terms of the side lengths of $T$ and $K$ since $b(T)$ is half the length of the longest side of $T$ and $d(T)$ is the circumradius of $T$ . In the general case these formulas are not particularly enlightening, however for equilateral triangles the generalized law of sines (Theorem 2.3) allows us to simplify the formulas considerably.

Theorem 3.7.

Let $T_{K,a}$ be an equilateral triangle in $M_{K}$ with side length $a$ .

[TABLE]

Proof.

Let $T$ be a equilateral triangle in $M_{K}$ with vertices $A$ , $B$ , and $C$ and side lengths a. Let $M$ be the midpoint of $AB$ , and let $P$ be the circumcenter of $T$ . See Figure 5. Since $P$ is the circumcenter, it is the intersection of the perpendicular bisectors of the sides of $T$ . Thus, $\angle AMP=\pi/2$ . Moreover, these perpendicular bisector split $T$ into 6 congruent triangles which all contain and surround the vertex $P$ . It follows that the angles of these triangles at the vertex $P$ sum to $2\pi$ , and since the angles are all congruent we get $\angle APM=\pi/3$ . Furthermore, the length of $\overline{AM}$ is $b(T)$ and the length of $\overline{AP}$ is $d(T)$ .

We apply the generalized law of sines (Theorem 2.3) to the triangle $\Delta AMP$ . For $K=0$ , we have

[TABLE]

For $K>0$ , we have

[TABLE]

Then

[TABLE]

Similarly when $K<0$ ,

[TABLE]

From these formulas, one can easily compute that for any fixed $a$ , the function which assigns to $K$ the persistence of an equilateral triangle of side length $a$ in $M_{K}$ is an increasing and continuous function. Indeed, the fact that this function converges to $\dfrac{2}{\sqrt{3}}$ as $K\to 0$ is straightforward application of l’Hôpital’s rule. To get a sense of scale, if $a=1$ then the values for $p(T)$ for $K=-2$ , $-1$ , [math], $1$ , and $2$ are approximately $1.1294$ , $1.1406$ , $1.1547$ , $1.1733$ , and $1.1996$ .

Corollary 3.8.

Let $a>0$ . Let $p_{a}(K)$ denote the persistence of an equilateral triangle of side length $a$ in a surface of constant curvature $K$ . Then $p_{a}(K)$ is a continuous and increasing function.

Combining Theorems 3.6 and 3.7

and Corollary 3.8,

we obtain Theorem 1.1.

4. Estimating curvature using persistence

In this section, we demonstrate that using the persistent homology of the Vietoris-Rips complex of points sampled on disks of constant curvature we are able to produce good estimates of the curvature.

4.1. Sampling points uniformly for a unit disk of constant curvature

We need to sample points uniformly (with respect to the area measure) from disks of constant curvature with radius one. See Figure 6.

4.1.1. Euclidean case

We start with the Euclidean case, $K=0$ . Consider the disk of radius one centered at the origin. Parametrize points on this disk by an angle, $0\leq\theta<2\pi$ , and a radius, $0\leq r\leq 1$ . We will sample $\theta$ and $r$ independently. For $\theta$ , sample uniformly, drawing from the uniform distribution on $[0,2\pi]$ . For $r$ , the probability of a point lying within the disk of radius $r$ should equal the proportion to the area of that disk relative to the area of the disk of radius 1. The area of a disk of radius $r$ equals $\pi r^{2}$ . So the cumulative distribution function of $r$ is given by

[TABLE]

and the inverse cumulative distribution function is given by

[TABLE]

So we can sample $u$ uniformly on $[0,1]$ and use $F^{-1}(u)$ to obtain the desired sample of $r$ (see Section 2.9).

4.1.2. Spherical case

Next, consider the spherical case, $K>0$ . We will assume that $K\leq 2$ which will ensure that we are able to embed a disk with radius one on the upper hemisphere of a sphere with constant curvature $K$ .

Our sampling procedure follows the Euclidean case. We parametrize points on a disk of radius one with an angle $0\leq\theta\leq 2\pi$ and a radius $0\leq r\leq 1$ . We sample $\theta$ and $r$ independently, sampling $\theta$ from uniform distribution on $[0,2\pi]$ . For $r$ , the disk of radius $r$ has area $\frac{4\pi}{K}\sin^{2}(\frac{r\sqrt{K}}{2})$ . So the cumulative distribution for $r$ is given by

[TABLE]

and the inverse cumulative distribution is given by

[TABLE]

So we can sample $u$ uniformly on $[0,1]$ and use $F^{-1}(u)$ to sample $r$ .

4.1.3. Hyperbolic case

It remains to consider the hyperbolic case, $K<0$ . We parametrize points on this disk by an angle $0\leq\theta<2\pi$ and a radius $0\leq r\leq 1$ . As before, we sample $\theta$ and $r$ independently, taking $\theta$ from the uniform distribution on $[0,2\pi]$ . The area of a hyperbolic disk of hyperbolic radius $r$ is given by $\frac{4\pi}{-K}\sinh^{2}(\frac{r\sqrt{-K}}{2})$ . Thus the cumulative distribution of $r$ is given by

[TABLE]

and the inverse cumulative distribution is given by

[TABLE]

4.2. Average death vectors and average persistence landscapes

For a given curvature $K$ , we independently sample $1000$ points from the unit disk in the surface of constant curvature $K$ , uniformly with respect to the area measure (Section 4.1) and compute the pairwise distances between such points (Section 2.7). From this pairwise distance data, we compute the persistent homology of the corresponding Vietoris-Rips complex (Section 2.1 and 2.2). We encode the persistent (reduced) homology in degree [math] as a death vector and the persistent homology in degree $1$ as a persistence landscape (Section 2.2). We then repeat this $100$ times and average the vectors to obtain an average death vector and average persistence landscape. See Figures 7 and 8.

4.3. Supervised learning

As training data, for each $K\in\{-2,-1.96,-1.92,\ldots,1.96,2\}$ we compute the average death vector and average persistence landscape as in Section 4.2. Call the average death vectors the $H_{0}$ training vectors, call the average persistence landscapes the $H_{1}$ training vectors, and call the concatenations of the average death vectors and the average persistence landscapes the $H_{0}$ -and- $H_{1}$ training vectors.

For testing data, sample 100 curvatures uniformly in $[-2,2]$ and compute their corresponding average death vectors and average persistence landscapes as in Section 4.2. Call the average death vectors the $H_{0}$ testing vectors, call the average persistence landscapes the $H_{1}$ testing vectors, and call the concatenations of the average death vectors and the average persistence landscapes the $H_{0}$ -and- $H_{1}$ testing vectors. Now we assume that the testing curvatures are unknown.

4.3.1. Nearest neighbors

For each testing vector, find the three nearest training vectors using the Euclidean distance. Estimate the curvature of the testing vector to be the weighted average of the curvatures of the three nearest training vectors, with the weighting given by the reciprocal of the distance. Results are given in Figure 1 and Table 1.

4.3.2. Support vector regression

We apply support vector regression to the training data to construct a model. We use a linear loss function and the dot product on the training vectors. This dot product corresponds to the inner product on the space of persistence landscapes [6]. We use the ksvm function in the kernlab package [26] in R with cost $100$ (and $\varepsilon=0$ ). Apply the testing vectors to the linear model computed using support vector regression to estimate the corresponding curvature. Results are given in Figure 1 and Table 1.

4.3.3. Quantile regression

We use the pinball loss function (Section 2.10) and the dot product on the training $H_{0}$ -and- $H_{1}$ vectors to construct models that estimate the $\tau$ -quantiles for $\tau=0.05$ , $0.5$ , and $0.95$ . We use the kqr function in the kernlab package [26] in R with cost $100$ . Applying the testing vectors to these models we obtain the curves given in Figure 9.

4.4. Unsupervised learning

Remarkably, we are still able to provide reasonable curvature estimates (up to sign) without any training data.

Sample 100 curvatures uniformly in $[-2,2]$ . For each of these compute the average death vectors and average persistence landscapes as in Section 4.2. Call the average death vectors the $H_{0}$ vectors, call the average persistence landscapes the $H_{1}$ vectors, and call the concatenations of the average death vectors and the average persistence landscapes the $H_{0}$ -and- $H_{1}$ vectors.

For each of these three sets of vectors apply principal components analysis (PCA). The projections onto the first two PCA coordinates for the $H_{0}$ -and- $H_{1}$ vectors are given in Figure 10. Rescale the first principal component axis to [-2,2] and use this to estimate the curvature. Results are given in Figure 1 and Table 1. Note that with probability $\frac{1}{2}$ this procedure will choose the wrong sign for the estimated curvature.

4.5. Using ordinals of sorted pairwise distances

In this section we show that our methods do not depend on differences in distributions of the pairwise distances.

In neuroscience [21], certain observed correlations are believed to be given by an unknown monotonic function on an underlying distance in the relevant stimulus space. Therefore, the distance data should only be used up to monotone transformations. This can be done by replacing scalar values with ordinal values.

We sort the nonzero pairwise distances and replace them with their corresponding ordinal numbers. Thus, for each curvature, the set of nonzero pairwise distances is the set $\{1,2,3,\ldots,\binom{m}{2}\}$ . We redo all of the computations in Sections 4.3 and 4.4 in this setting. For nearest neighbors we use the five nearest neighbors. For support vector regression we use cost 10 and an $\varepsilon$ -insensitive linear loss function with $\varepsilon=1$ for $H_{0}$ , $\varepsilon=0.2$ for $H_{1}$ , and $\varepsilon=0.2$ for $H_{0}$ and $H_{1}$ . We choose different hyper-parameters to avoid over-fitting due to the greater variance in the data. The results are given in Figure 11 and Table 2.

Acknowledgments

This research was partially supported by the Southeast Center for Mathematics and Biology, an NSF-Simons Research Center for Mathematics of Complex Biological Systems, under National Science Foundation Grant No. DMS-1764406 and Simons Foundation Grant No. 594594. This material is based upon work supported by, or in part by, the Army Research Laboratory and the Army Research Office under contract/grant number W911NF-18-1-0307. We would also like to thank the anonymous referees for their helpful comments.

Bibliography38

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] MichałAdamaszek and Henry Adams. The Vietoris-Rips complexes of a circle. Pacific J. Math. , 290(1):1–40, 2017.
2[2] Mark Anthony Armstrong. Basic topology . Undergraduate Texts in Mathematics. Springer-Verlag, New York-Berlin, 1983.
3[3] Alan F. Beardon. The geometry of discrete groups , volume 91 of Graduate Texts in Mathematics . Springer-Verlag, New York, 1995.
4[4] Paul Bendich, J. S. Marron, Ezra Miller, Alex Pieloch, and Sean Skwerer. Persistent homology analysis of brain artery trees. Ann. Appl. Stat. , 10(1):198–218, 03 2016.
5[5] Omer Bobrowski, Matthew Kahle, and Primoz Skraba. Maximally persistent cycles in random geometric complexes. Ann. Appl. Probab. , 27(4):2032–2060, 2017.
6[6] Peter Bubenik. Statistical topological data analysis using persistence landscapes. J. Mach. Learn. Res. , 16:77–102, 2015.
7[7] Peter Bubenik and Pawel Dlotko. A persistence landscapes toolbox for topological statistics. Journal of Symbolic Computation , 78:91 – 114, 2017.
8[8] Peter Bubenik and Peter T. Kim. A statistical approach to persistent homology. Homology, Homotopy Appl. , 9(2):337–362, 2007.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Persistent homology detects curvature

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

1.1. Theoretical results: short bars detect geometry

Theorem 1.1**.**

1.2. A framework for solving inverse problems: inference using average persistence landscapes

Remark 1.2*.*

1.3. Computational results

1.4. Expected impact

1.5. Related work

2. Background

2.1. Filtered simplicial complexes from points

2.2. Persistent homology

2.3. Geometries of constant curvature

2.4. Triangles

2.5. Circumcircles

Lemma 2.1**.**

Proof.

Theorem 2.2**.**

Proof.

2.6. Areas of disks

2.7. Distances between points on a unit disk

2.8. Laws of sines and cosines

Theorem 2.3**.**

Theorem 2.4**.**

2.9. Inversion sampling

Theorem 2.5**.**

2.10. Support vector regression

3. Persistence of triangles

3.1. Triangles and their persistent homology

Proposition 3.1**.**

Lemma 3.2**.**

Proof.

Lemma 3.3**.**

Proof.

Lemma 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

Proof of Proposition 3.1.

3.2. The most persistent triangles

Theorem 3.6**.**

Proof.

3.3. Persistence of equilateral triangles

Theorem 3.7**.**

Proof.

Corollary 3.8**.**

4. Estimating curvature using persistence

4.1. Sampling points uniformly for a unit disk of constant curvature

4.1.1. Euclidean case

4.1.2. Spherical case

4.1.3. Hyperbolic case

4.2. Average death vectors and average persistence landscapes

4.3. Supervised learning

4.3.1. Nearest neighbors

4.3.2. Support vector regression

4.3.3. Quantile regression

4.4. Unsupervised learning

4.5. Using ordinals of sorted pairwise distances

Acknowledgments

Theorem 1.1.

*Remark 1.2**.*

Lemma 2.1.

Theorem 2.2.

Theorem 2.3.

Theorem 2.4.

Theorem 2.5.

Proposition 3.1.

Lemma 3.2.

Lemma 3.3.

Lemma 3.4.

Lemma 3.5.

Theorem 3.6.

Theorem 3.7.

Corollary 3.8.