Persistence Steenrod modules
Umberto Lupo, Anibal M. Medina-Mardones, Guillaume Tauzin

TL;DR
This paper introduces a new family of computable invariants called $Sq^k$-barcodes for mod 2 persistent cohomology, enhancing the discriminative power of barcode invariants in topological data analysis.
Contribution
It presents the first complete algorithmic pipeline for computing $Sq^k$-barcodes and demonstrates their applicability on molecular conformation data.
Findings
$Sq^k$-barcodes improve topological data analysis.
Algorithm successfully applied to cyclo-octane molecule.
Enhances the discriminative power of persistent invariants.
Abstract
It has long been envisioned that the strength of the barcode invariant of filtered cellular complexes could be increased using cohomology operations. Leveraging recent advances in the computation of Steenrod squares, we introduce a new family of computable invariants on mod 2 persistent cohomology termed -barcodes. We present a complete algorithmic pipeline for their computation and illustrate their real-world applicability using the space of conformations of the cyclo-octane molecule.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Data Visualization and Analytics · Digital Image Processing Techniques
Persistence Steenrod modules
Umberto Lupo
U.L., Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
,
Anibal M. Medina-Mardones
A.M-M., Max Planck Institute for Mathematics and University of Notre Dame
and
Guillaume Tauzin
G.T., INAIT SA
Abstract.
It has long been envisioned that the strength of the barcode invariant of filtered cellular complexes could be increased using cohomology operations. Leveraging recent advances in the computation of Steenrod squares, we introduce a new family of computable invariants on mod 2 persistent cohomology termed -barcodes. We present a complete algorithmic pipeline for their computation and illustrate their real-world applicability using the space of conformations of the cyclo-octane molecule.
Key words and phrases:
Persistent homology, Steenrod squares, Steenrod barcode, cyclo-octane molecule
2020 Mathematics Subject Classification:
55N31, 55S10, 62R40, 68T09
1. Introduction
Persistent homology is one of the primary tools in the rapidly developing field of topological data analysis. A motivating example for this technique is the study of a finite point cloud of data embedded in Euclidean space. From it, we can produce a collection of nested simplicial complexes
[TABLE]
For example, by taking the nerve of balls of uniformly increasing diameter whose centers are the given points. The homology construction provides us with a collection of linear maps
[TABLE]
This is an example of a (graded) persistence module and its barcode, a summary of the way Betti numbers are shared by consecutive simplicial complexes, serves as a principled and robust feature of the data. This invariant is effectively computable and some of the open source software developed for this end can be found in [Tau+21, The20, Bau21, Pér+21]. For an expository treatment of persistent homology, we refer the reader to [Car09] or [EH10].
Assuming is a field, a straightforward duality argument shows that the barcode of (1) is equivalent to the barcode of its persistent cohomology
[TABLE]
When is the field with elements, we can define for each an additional barcode naturally associated to (2) by consistently recording the ranks of Steenrod’s cohomology operation in the commutative diagram
[TABLE]
In this work we focus on the case where Steenrod operations are denoted and referred to as Steenrod squares, a term that comes from the fact that where is the cohomological degree of and denotes the cup product on cohomology. The ranks of Steenrod operations and in particular of Steenrod squares are able to detect finer information beyond the Betti numbers of a space. For example:
- (1)
The real projective plane and the wedge of a circle and a sphere have, with -coefficients, the same Betti numbers, yet the rank of tells them apart. 2. (2)
Similarly, the complex projective plane and the wedge of a 2-sphere and a 4-sphere have the same Betti numbers with any coefficients, yet the rank of distinguishes them. 3. (3)
The suspensions of the two spaces above have the same Betti numbers and also isomorphic cohomology rings, yet the rank of tells them apart.
The main contribution of this work is the theoretical and algorithmic developments needed to use the finer discriminatory power of Steenrod squares in persistent cohomology. Specifically, we define -barcodes and introduce a method for their computation. Using a performance oriented implementation of our methods, we present examples showing that the finer information they reveal is non-trivially present in the point cloud of conformations of the cyclo-octane molecule .
Outline
We begin in Section 2 with an overview of the basic notions used in the rest of this article. They concern homological algebra, simplicial topology, and persistence theory. In Section 3 we introduce the Steenrod squares through explicit formulas at the cochain level and discuss their relevance. In Section 4 we introduce persistence Steenrod modules and their -barcodes, and show how to computationally incorporate these invariants into the persistent cohomology pipeline. We present three examples in Section 5, including one obtained from data sampled from the space of conformations of . We close by providing conclusions in Section 6.
Acknowledgments
We would like to thank Ulrich Bauer, Prasit Bhattacharya, Matteo Caorsi, Kathryn Hess, Maximilian Schmahl, Gard Spreemann, Dennis Sullivan, and Ulrike Tillmann for their suggestions, questions, and comments about this project. We thank the reviewers for their careful reading and many keen suggestions. We are grateful to Ingrid Membrillo Solis for providing useful datasets and insight on the topic of molecular conformational spaces. We thank the hospitality of the Laboratory for Topology and Neuroscience at EPFL and acknowledge partial support from Innosuisse grant 32875.1 IP-ICT-1.
2. Conventions and preliminaries
We assume familiarity with the notions of chain complex over a ring and of its associated homology graded -module.
2.1. Tensor and hom complexes
In this subsection we review two natural chain complexes associated to any pair of chain complexes and .
The tensor product is the chain complex whose degree- part is
[TABLE]
where is the tensor product of -modules, and whose boundary map is defined by
[TABLE]
The hom complex is the chain complex whose degree- part is the subset of linear maps between them that increase degree by , i.e.,
[TABLE]
and boundary map defined by
[TABLE]
A chain map is a [math]-cycle in this chain complex, and two chain maps are chain homotopy equivalent if they are homologous cycles. We extend this terminology and say that two maps are homotopic if their difference is nullhomologous, referring to a map such that as a homotopy between them.
Regarding as a chain complex with [math]-part equal to and all other parts [math], the linear dual of a chain complex is the chain complex . For historical reasons we will use cohomological grading for the dual of a chain complex, placing the dual of a chain in degree also in degree instead of as would be more appropriate.
For any three chain complexes, there is a natural adjunction isomorphism:
[TABLE]
2.2. Invariants and coinvariants
Symmetries on chain complexes play an important role in this work. Let be a finite group. We will later focus solely on the symmetric group . We denote by the group ring of , i.e., the free -module generated by together with the ring product defined by linearly extending the group structure on . We refer to a chain complex of -modules as a chain complex with a -action.
To any chain complex with a -action we naturally associate the following two chain complexes. The subcomplex of invariant chains of , denoted by , contains all elements satisfying for every . The quotient complex of coinvariant chains of , denoted by , is the chain complex obtained by identifying elements if there exists such that .
Let and be chain complexes and assume has a -action. The chain complex has a -action induced from and there is an isomorphism:
[TABLE]
2.3. Simplicial complexes
Simplicial complexes are used to combinatorially encode the topology of spaces and occur naturally on real-world data.
An abstract and ordered simplicial complex, or a simplicial complex for short, is a pair of sets where is a poset and the elements of are non-empty finite subsets of , such that:
- (1)
The restriction of the partial order of to any element in defines a total order on it. 2. (2)
For every in , the singleton is in . 3. (3)
If is in and is a subset of , then is in .
We abuse notation and denote the pair simply by .
The elements of are called simplices and the dimension of a simplex is defined by subtracting from its cardinality. Simplices of dimension are called -simplices. We abuse terminology and refer to the elements of and to their associated [math]-simplices both as vertices.
A simplicial complex is a subcomplex of a simplicial complex if every simplex of is a simplex of . In this case we say that is a simplicial complex pair and write if and .
A filtered simplicial complex is a simplicial complex together with subcomplexes
[TABLE]
2.4. Simplicial cohomology
Let be a simplicial complex pair. Denoting the subsets of -dimensional simplices by and , the chain complex of relative chains of the pair is defined as follows: Its degree- part is
[TABLE]
i.e., the -module freely generated by the -dimensional simplices in modulo those in , and its differential, referred to as boundary map, is defined on basis elements by
[TABLE]
where is the operator that removes the element in with respect to the induced total order. We refer to simply as the absolute chains of and use the notation .
The relative cochains of the pair is the cochain complex defined explicitly by
[TABLE]
with
[TABLE]
We refer to as the absolute cochains of and use the notation for it. Notice that is isomorphic to the subspace of that vanish on . The cohomology of this pair, denoted by , is defines as the cohomology of .
Given pairs there are natural maps:
[TABLE]
respectively defined and induced by restriction.
Relative and absolute homology in the simplicial context is defined similarly but we do not use them in this work.
2.5. Persistence theory
In this subsection is assumed to be a field. We will now review the basic concepts of the theory of persistence over from a point of view that prioritizes persistent cohomology. We refer to [ZC05] or [SMV11] for a more detailed exposition.
The totally ordered set , known as extended integers, is the union of with two elements and such that
[TABLE]
for any integer .
A persistence module (over ) is a diagram of vector spaces and linear maps
[TABLE]
For in we denote by the unique composition in the diagram. We say that is pointwise finite-dimensional (p.f.d.) if the dimension of is finite for each .
A graded persistence module is a collection of persistence modules indexed by the integers. We say is p.f.d. if each is.
A morphism of persistence modules is a diagram of vector spaces and linear maps
[TABLE]
To any morphism of persistence modules we can naturally associate persistence modules corresponding to its kernel and image .
A multiset is a pair where is a set and is a function attaining only values greater than [math]. We refer to as the multiplicity of and define the cardinality of a multiset as
[TABLE]
if this sum is defined and otherwise. We sometimes regard sets as multisets with multiplicity function constant and equal to .
Let be a p.f.d. persistence module, its barcode is the multiset of pairs such that for any two extended integers
[TABLE]
The barcode is a complete invariant of p.f.d. persistence modules. We will sometimes use the notation . The finite and infinite parts of the barcode are defined by
[TABLE]
The barcode of a graded persistence module is the collection
[TABLE]
Given a filtered simplicial complex , define respectively its persistent relative and absolute cohomology by
[TABLE]
with linear maps induced by restriction. When and are clear from the context we omit them from the notation. We say is p.f.d. if either, and therefore both, of these are.
The barcodes of these persistence modules contain equivalent information. More precisely, as shown in [SMV11] or more categorically in [BS20], the finite parts of these are equal as graded multisets after a degree shift and there is a bijection of multisets between their infinite parts. Explicitly,
[TABLE]
Although we do not use them in this work, persistent relative and absolute homology can be defined similarly and shown to have barcodes containing the same information encoded by those above.
3. Steenrod squares
In this section we introduce the cohomology operations
[TABLE]
defined for any simplicial complexes pair and every integer . These operations are natural. In particular, for the diagram
[TABLE]
commutes. Therefore, as will be developed in Section 4, any defines an endomorphism of the persistent (absolute and relative) cohomology of a filtered complex.
3.1. History and definition
The diagonal map of spaces
[TABLE]
induces a product in cohomology with field coefficients
[TABLE]
which is (graded) commutative, since the diagonal is invariant under the transposition
[TABLE]
One can then ask if this product be defined with integer coefficients. During the mid 1930’s Alexander, Kolmogorov, Čech and Whitney [Whi88] defined the cup product dualizing a simplicial chain approximation to given by
[TABLE]
The chain map is not invariant under the transposition map
[TABLE]
that is to say, .
In 1947, Steenrod published his seminal paper [Ste47] introducing the square operations through an effective construction of “coherent homotopies” correcting the broken symmetry of (denoted by from now on). To explain this, let us consider the map as a [math]-cycle in , a chain complex with an -action induced from . The cup- coproduct, defined explicitly by
[TABLE]
is a boundary for this cycle (). The cup- coproduct corrects the lack of symmetry of homologically, but it is itself not symmetric. Steenrod gave formulae for higher corrections, the cup- coproducts , satisfying
[TABLE]
More abstractly, if is the minimal resolution of by free -modules
[TABLE]
he effectively constructed a natural equivariant chain map
[TABLE]
where denotes the chains of a simplicial complex. Passing to mod coefficients, Steenrod extracted from this construction finer invariants on the cohomology of spaces which we now review.
Using the linear duality functor on the map (7) and passing to invariant chains we have a chain map
[TABLE]
which we can complete, using the isomorphisms (3) and (4) of Section 2, to a commutative diagram
[TABLE]
where the choice of coefficients ensures the doubling map is linear. Using the adjunction isomorphism (3), the dashed arrow defines a linear map
[TABLE]
descending to mod homology, and the Steenrod square operations are defined by reindexing this map. Explicitly,
[TABLE]
The importance of Steenrod operations in stable homotopy theory is hard to overstate, see for example [Ada74]. For a more leisure exposition of the construction and properties of Steenrod squares we refer to, for example, [MT68].
Remark**.**
The name of these operations comes from the fact that where is the cohomological degree of and denotes the cup product on cohomology. The non-triviality of Steenrod squares is an obstruction to the existence of a commutative product of cocycles lifting .
Remark**.**
The operation agrees with the Bockstein homomorphism. Namely, the connecting homomorphism induced from the following exact sequence of coefficient
[TABLE]
Remark**.**
Steenrod square operations are parameterized by classes on the mod homology of . From this viewpoint, Steenrod defined operations at odd primes non-constructively using the mod homology of [SE62]. We do not treat these operations in the present paper.
3.2. Cup- formulas
Throughout the rest of this article we set the ground ring to be the field with two elements . We will describe explicitly a natural equivariant chain map
[TABLE]
or, equivalently, an equivariant chain map
[TABLE]
Let be a simplicial complex and . For a set
[TABLE]
we use the notation .
Definition 1** ([Med21b]).**
The simplicial cup- coproduct
[TABLE]
is the linear map defined on a basis element in dimension by
[TABLE]
where the sum is taken over all sets with and
[TABLE]
These formulas are in a sense dual to Steenrod’s original in [Ste47] but, as shown in [Med22], they are equivalent. We have the homological relation
[TABLE]
for any integer , and naturality for pairs making the diagram
[TABLE]
commute.
Definition 2**.**
Let be a pair of complexes. The -th Steenrod square
[TABLE]
is the linear map sending a class represented by a cocycle to the class represented by the cocycle whose value on is
[TABLE]
We notice that thanks to (8), the Steenrod square operations are natural for pairs , i.e., diagram (6) commutes.
Example 3**.**
Let us consider the model of the real projective plane presented in Figure 1 together with the cocycle , dual to
[TABLE]
representing the generator of . According to Definition 2, the cocycle represents the class Sq^{1}\big{(}[\alpha]\big{)}\in H^{2}(\mathbb{R}\mathrm{P}^{2}). Using Definition 1 and bilinearity, we are looking for basis elements appearing in with and such that . The cocycle is given by adding together for each such basis element. In our case, out of 25 basis elements appearing in only contributes a non-zero term and, therefore, Sq^{1}\big{(}[\alpha]\big{)} is represented by the cocycle dual to , and
[TABLE]
Remark**.**
The maps are deeply rooted in the combinatorics of simplices. To illustrate their primitive nature we mentioned that another fundamental construction can derive from them: the nerve of -categories [Str87, Med20a]. This is a reflection of a profound connection between convex geometry, higher category theory and Steenrod higher diagonals [KV91, LMV22].
Remark**.**
We have focused on simplicial complexes since they are better known and lead to faster computations, but there are also effective constructions of Steenrod cup- coproducts for cubical complexes [Kad03, KP16, KM21]. Our algorithms, presented in § 4.4, can be adapted using these to compute Steenrod barcodes of cubical complexes.
Remark**.**
To define Steenrod operations effectively at any prime , the cup- coproducts where generalized in [KM21a] to cup- coproducts for simplicial and cubical chains using the operadic methods of P. May [May70] and the model of the -operad introduced by the second named author [Med20, Med21a]. These have been implemented in the computer algebra system ComCH [Med21], and the incorporation of Steenrod operations at odd primes into the persistence pipeline is left to future work.
3.3. Self-intersections
From a geometric viewpoint, the cup product can be interpreted in terms of intersections of cycles in certain cases. For any space, Thom showed that every mod homology class is represented by the push-forward of the fundamental class of a closed manifold along some map to the space. Furthermore, if the target is a closed -manifold, and therefore satisfies Poincaré duality
[TABLE]
The cohomology class dual to the homology class represented by the intersection of two transverse maps and , or more precisely their pull-back , is the cohomology class where and are respectively dual to the homology classes represented by and .
By taking of cohomological degree , we have that Sq^{k}\big{(}[\alpha]\big{)}=[\alpha]\smile[\alpha] is represented by the transverse self-intersection of , that is, the intersection of this map and a generic perturbation of itself.
For example, let us consider the Torus and the Klein bottle . Two manifolds with the same mod 2 Betti numbers. These surfaces are distinguished by the fact that
[TABLE]
which we can verify geometrically, as depicted in Figure 2, by noticing that the transverse self-intersection of a map parallel to any boundary component is always an even number of disjoint points, whereas that of a map parallel to the untwisted boundary component is always an odd number.
In manifold topology, the relationship at the (co)homology level between cup product and intersection is classical. For a comparison between these at the level of (co)chain see [FMS21].
3.4. Relations and further structure
The cup- coproducts or, equivalently, their linear dual products, arise from effectively constructing coboundaries that coherently enforce the commutativity relation of the cup product in cohomology. This is an example of a general principle: constructing cochains enforcing cohomological relations lead to further cohomological structures. In our case, the commutativity relation of cup product gives rise to Steenrod operations.
There are two notable relations satisfied by the Steenrod square operations. The first one, known as the Cartan relation, expresses the interaction between these operations and the cup product:
[TABLE]
whereas the second, the Adem relation [Ade52], expresses dependencies appearing among the iteration of operations:
[TABLE]
where denotes the integer part function and the binomial coefficient is reduced mod .
To tap into the secondary structure associated with these relations, one needs to provide effective proofs for them, that is to say, construct explicit cochains that enforce the relations when passing to cohomology. Such effective proofs were recently given respectively in [Med20b] and [BMM21], and we expect that the additional structure they unlock will also play a role in applied topology.
4. Persistence Steenrod modules
In this section we define persistence Steenrod modules and their associated barcodes. We also introduce, for finite filtered simplicial complexes, a complete algorithmic pipeline for their computation.
Definition 4**.**
A persistence Steenrod module (over ) is a graded persistence module of vector spaces together with a degree- endomorphism
[TABLE]
for each integer , such that satisfies the Adem relations (9) for every .
Similar to how persistence modules can be thought of as modules over the polynomial algebra , persistence Steenrod modules correspond to modules over the algebra where is the Steenrod algebra, the free algebra generated by symbols modulo the ideal generated by the Adem relations.
4.1. Steenrod barcode
The following computable invariant of persistence Steenrod modules is central to applications.
Definition 5**.**
For any integer , the -barcode of a persistence Steenrod module , denoted by , is the barcode of the image persistence module . We refer to the collection of all of these as the Steenrod barcode of .
The following example illustrates that, unlike barcodes of regular persistence modules, Steenrod barcodes of persistence Steenrod modules are not a complete invariant. Let be the graded vector space given by
[TABLE]
and equal to [math] in all other degrees. Let be the graded persistence module
[TABLE]
We make into a persistence Steenrod module in two non-isomorphic ways, but with the same Steenrod barcodes, by defining
[TABLE]
and
[TABLE]
We thank Prasit Bhattacharya for suggesting this example.
The most prominent examples of persistence Steenrod modules are given by persistent relative and absolute cohomology of a filtered complex . In this case, denoting both and by , we have
[TABLE]
since is the identity.
4.2. Duality
The following example illustrates that, unlike the case of regular barcodes discussed in (5), the Steenrod barcode of persistent relative and absolute cohomology need not completely determine each other. Let be the Möbius band and consider the filtration where the circle is included as the boundary of . Given that is isomorphic to the reduced absolute cohomology of the real projective plane , and that is homotopy equivalent to its central circle, one can verify that the Steenrod barcode of absolute cohomology is empty but that of relative cohomology is not.
An important case where the Steenrod barcodes of persistent relative and absolute cohomology determine each other is when there are only finite bars in their regular barcodes. More precisely, let be a p.f.d. filtered complex such that for some integer either
[TABLE]
or, equivalently,
[TABLE]
Then, for every integer there is a bijection of multisets
[TABLE]
We illustrate the argument in an example that contains all the ideas of the proof. For complete details we refer to the study of dualities in the categorical framework presented in [BS20]. Consider and a two stage filtration or, more explicitly,
[TABLE]
and the diagram
[TABLE]
where the horizontal maps are part of the long exact sequence of the pair . Consider with . Since all regular bars are finite so and we have an -bar . By exactness, for some where is the connecting homomorphism. Since these commute with Steenrod squares, we have . Furthermore, is not in the image of since otherwise would be [math]. Therefore, there is a -bar .
Conversely, given with the finiteness assumption implies that so we have a -bar . Denote by the element and notice that with exactness implying , so we have a -bar .
4.3. Truncations
Given a filtered complex and an integer there are two naturally associated filtered complexes and defined respectively by
[TABLE]
and referred to as the above and below truncations at . Persistent relative (resp. absolute) cohomology behaves well with respect to above (resp. below) truncations. Explicitly, there exist canonical inclusions
[TABLE]
and
[TABLE]
We remark that this form of “stability” of Steenrod barcodes may fail when considering persistent relative (resp. absolute) cohomology and below (resp. above) truncations. For example, consider the filtration , where is the cone on the Möbius band. The Steenrod barcode of the relative absolute cohomology of this filtration is empty whereas, as discussed at the beginning of § 4.2, its below truncation at is not.
4.4. Computing the Steenrod barcode
In this subsection we provide algorithms to compute the Steenrod barcode of the persistent relative cohomology of a finite filtered simplicial complex
[TABLE]
together with a total order of its elements
[TABLE]
such that for all we have
[TABLE]
Most of this pipeline is applicable to other filtered cellular complexes, with the exception of Algorithm 2.
4.4.1. Regular barcode
Let us begin by reviewing an effective construction of the barcode of the persistent relative cohomology of . Let be the matrix representing
[TABLE]
in the canonical ordered basis . We index columns and rows in this matrix starting at [math], and denote for all . Consider defined by
[TABLE]
Notice that represents the coboundary of .
Applying to a version of Algorithm 1 in Figure 3 that remembers the performed operations we produce a reduced matrix and an upper triangular invertible matrix satisfying
[TABLE]
Denoting the -th column of by , let
[TABLE]
There exists a canonical bijection between the union of and , and the barcode of persistent relative cohomology given by
[TABLE]
that provides a preferred cocycle representative for each of these bars:
[TABLE]
More specifically, a basis for thought of as a subspace in the direct sum
[TABLE]
is given by the set of cochains corresponding to the vectors in the union of
[TABLE]
and a basis for is given by
[TABLE]
4.4.2. Steenrod barcode
We now describe an effective construction of the Steenrod barcode of the persistent relative cohomology of . For any integer , let be an algorithm taking as input a vector corresponding to a cochain and producing the vector corresponding to the cochain
[TABLE]
Such an algorithm, based on the explicit formulas of Section 3, is presented as Algorithm 2 in Figure 4. Let be the square matrix with columns given by
[TABLE]
For matrices and of dimensions and we define the matrix by
[TABLE]
We now have all the elements needed to introduce Algorithm 3 in Figure 5 whose output is the -barcode of the persistent relative cohomology of . Intuitively, the step from to either adds a new non-zero coboundary (which implies ) or the image of a persistent cocycle generator (which implies ). In either case, we need to reduce with respect to the subspace of coboundaries, generated by , the image of , which is generated by . This process is done keeping track of when columns in become zero and extracting from this information the -barcode of the filtration.
We leave the development of a pipeline for persistent absolute cohomology to future work, remarking that, as described in (10), its associated Steenrod barcode is equal to that of persistent relative cohomology if all regular bars are finite, a situation often countered in practice.
5. Examples
To demonstrate the feasibility of extracting Steenrod barcodes from realistic datasets using the computational pipeline described in Section 4, we have produced two open-source software implementations: one111Available at https://github.com/Steenroder/steenroder. is a Python package optimized by means of the Numba library [LPS15], and the other222Available at https://github.com/Steenroder/steenroder_cpp is a performance-oriented C**+****+** package inspired by the PHAT library [Bau+14].
While detailed performance benchmarking is beyond the scope of this paper, some remarks are in order. First, we note that both our implementations apply the clearing optimization [CK11] to Algorithm 1. It is well-known (see e.g. the discussion in [Bau21]) that clearing is particularly effective when computing relative persistent cohomology – and even more so when the filtration is constructed via a Vietoris–Rips process. Second, although the computation of matrix (input to Algorithm 3) is in principle embarrassingly parallelizable by tasking fully independent threads with the calculation of different columns, we have not yet pursued this path in our code.
Third, we expect (and observe experimentally) that “sparsifying” our filtrations via simplicial collapses [Pri20, BP20], operations that preserve the homotopy type of each complex, can lead to a cascade of space and time improvements across our computational pipeline. This is presumably because:
- (1)
the run-time and memory usage in Algorithm 1 is reduced, yielding sparser and matrices and hence cocycle representatives with smaller sizes on average; 2. (2)
the outer forall loop in Algorithm 2 becomes faster for smaller cocycles; 3. (3)
the leaner matrix reduces the computational run-time and memory usage once again in the final Steenrod barcode computation, Algorithm 3.
Item 2 above deserves more emphasis: our experiments suggest that, in typical datasets, the main bottleneck in the entire pipeline is the computation of for a few exceptionally sizable cocycle representatives. This is due to the quadratic complexity of the forall loop in Algorithm 2. By replacing the largest cocycle representatives returned by (any implementation of) Algorithm 1 with cohomologous ones with a smaller size, one could presumably alleviate this problem. Our preliminary attempts using right-to-left reductions on the matrix output by Algorithm 1 have yielded promising results; in the future, we hope to further improve our implementations in this direction, as well as making it easily accessible through its incorporation into giotto-tda [Tau+21].
We now report the results of computing -barcodes in a synthetic and a natural dataset. In both cases we start from a point cloud and construct an associated filtered simplicial complexes through the Vietoris–Rips process with a fixed simplex dimension threshold of 3 (simplices with 4 or less vertices) and some distance threshold. We close this section with a comparison of the -barcode of two filtered complex models of the cone on the suspension of, respectively, and . Our experiments are fully reproducible as Jupyter notebooks.1
5.1. Flat Klein bottle
Our first example is constructed from a matrix of geodesic distances among points in a metrically flat Klein bottle. This is the Riemannian manifold obtained from with its usual metric via the equivalence relation . To define this point cloud we selected points corresponding to the vertices of a square grid inside the unit square .
5.1.1. Persistent relative cohomology
Let be the Vietoris–Rips filtered complex associated to this point cloud with distance threshold . We apply our pipeline to compute the regular barcode of for and their associated -barcode. The results are presented in Figure 6. There are three infinite bars: two in degree – which happen to have identical birth and death due to the symmetry in our construction – and one in degree . Our implementation detects an infinite bar in . This cohomological profile agrees with that expected from a filtered Klein bottle.
5.1.2. Persistent absolute cohomology
Let be the Vietoris–Rips complex obtained from the same point cloud with no distance threshold. In Figure 7 we present the regular barcode of for and associated -barcode, obtained using our pipeline and the duality of Steenrod persistent cohomology (§ 4.2). Our implementation detects a single Steenrod bar in , which is born with the bars and dies with the bar. Once again, this cohomological profile is consistent with that of a filtered Klein bottle.
5.2. Conformational space of cyclo-octane
Our second example involves a sampling of the conformational space of the cyclo-octane molecule . We started with a dataset, originally from [Mar+10], which consists of vectors in . Each of these vectors collects the 3D coordinates of all carbon atoms in a given cyclo-octane conformation after alignment to a reference one. In [Mar+10], this dataset was used to argue that the full conformational space of cyclo-octane is not a manifold, being in fact the union of a -sphere with a Klein bottle glued together along two circles of singularities. The reader can consult the papers [Mem+19, AM21] for further details and references.
Candidate singular points in this dataset can be identified in a variety of ways; we used a set of singular points isolated in [Sto+20] via local persistent cohomology.333Data retrieved from https://github.com/stolzbernadette/Geometric-Anomalies. We removed these points from the dataset, and clustered the remaining points using the HDBSCAN algorithm [CMS13] to obtain (samplings of) four -strata – presumably corresponding to a dense open subset of the Klein bottle, and three open connected subsets of the -sphere.
As computed in [Mem+19], the persistent absolute homology of the Vietoris–Rips filtered complex associated to the point cloud with elements supported on the presumed Klein bottle has, excluding the basic bar from , three prominent bars in its barcode. Their birth and death values occur respectively before and after the value and two come from while the other from .
5.2.1. Persistent relative cohomology
We study the persistent relative cohomology of , the Vietoris–Rips filtered complex associated to this point cloud with distance thresholds . In Figure 8 we show, discarding short-lived bars () for ease of visualization, the regular barcode of for and their associated -barcode. Our implementation detects two -bars. One is infinite and born with one of the two infinite bars, while the other is born with the other bar and dies with the most prominent finite bar. The infinite parts of these barcodes are consistent with a filtered Klein bottle, where one of the infinite degree 1 bars interacts non-trivially with the degree 2 one. The finite Steenrod bar adds extra information revealing a non-trivial interaction between the other infinite degree bar and a finite degree bar. Refinements to the model for the conformation space of the cyclo-octane molecule resulting from the incorporation of this finer feature go beyond the scope of this example and are left unexplored.
5.2.2. Persistent absolute cohomology
For this data set we will also compute a Steenrod barcode for absolute persistent cohomology. We will use a distance threshold chosen to be larger than the death value of all prominent features. To do so we consider a subsample consisting of randomly selected points. The persistent homology barcode of the associated Vietoris–Rips filtered complex with no distance threshold contains three prominent bars, and their death values are all less than . As expected, two of these are associated to and the other to . Let be the the Vietoris–Rips filtered complex with distance thresholds obtained from this subsample. We remark that the threshold chosen ensures a correspondence between persistent relative and absolute cohomology of the Steenrod bars associated to their prominent features (§ 4.2). In Figure 9 we show, discarding no bars, the regular barcode of for and associated -barcode. As expected, the interaction between the more prominent bars witnessed by the Steenrod barcode is consistent with a filtered Klein Bottle.
5.3. Complex projective space and a wedge of spheres
We conclude this section comparing the -barcodes of the persistent absolute cohomology of two filtered simplicial complexes. On one hand, we have a filtration of the cone on the suspension of the complex projective space , and, on the other, one of , where, as usual, denotes the -dimensional sphere. As mentioned in the introduction, and have isomorphic cohomology rings over any coefficients, but they can be distinguished by the action of on their mod 2 cohomology.
Interpreting bars as points in the plane, we plot in Figure 10 the regular and -barcodes of the persistent absolute cohomology of these filtrations. After rescaling by the number of simplices, we can see that the regular barcodes, symbolized by colored circles, are very similar; yet there is a -bar, represented by a brown diamond, present in the second figure only.
6. Conclusion
Steenrod barcodes increase the discriminatory power of traditional barcodes, providing finer computable topological invariants of filtered spaces. Furthermore, as we showed using the conformation space of , the additional information these invariants reveal is non-trivially present in real-world examples.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Ada 74] J.. Adams “Stable homotopy and generalised homology”, Chicago Lectures in Mathematics University of Chicago Press, Chicago, Ill.-London, 1974, pp. x+373 URL: https://www.worldcat.org/title/stable-homotopy-and-generalised-homology/oclc/824645259&referer=brief_results
- 2[Ade 52] José Adem “The iteration of the Steenrod squares in algebraic topology” In Proc. Nat. Acad. Sci. U.S.A. 38 , 1952, pp. 720–726 DOI: 10.1073/pnas.38.8.720 · doi ↗
- 3[AM 21] Henry Adams and Michael Moy “Topology Applied to Machine Learning: From Global to Local” In Frontiers in Artificial Intelligence 4 , 2021, pp. 54 DOI: 10.3389/frai.2021.668302 · doi ↗
- 4[Bau+14] Ulrich Bauer, Michael Kerber, Jan Reininghaus and Hubert Wagner “PHAT – Persistent Homology Algorithms Toolbox” In Mathematical Software – ICMS 2014 Berlin, Heidelberg: Springer Berlin Heidelberg, 2014, pp. 137–143 DOI: 10.1016/j.jsc.2016.03.008 · doi ↗
- 5[Bau 21] Ulrich Bauer “Ripser: efficient computation of Vietoris-Rips persistence barcodes” In J. Appl. Comput. Topol. 5.3 , 2021, pp. 391–423 DOI: 10.1007/s 41468-021-00071-5 · doi ↗
- 6[BMM 21] Greg Brumfiel, Anibal Medina-Mardones and John Morgan “A cochain level proof of Adem relations in the mod 2 Steenrod algebra” In J. Homotopy Relat. Struct. , 2021 DOI: 10.1007/s 40062-021-00287-3 · doi ↗
- 7[BP 20] Jean-Daniel Boissonnat and Siddharth Pritam “Edge Collapse and Persistence of Flag Complexes” In 36th International Symposium on Computational Geometry (So CG 2020) 164 , Leibniz International Proceedings in Informatics (LIP Ics) Dagstuhl, Germany: Schloss Dagstuhl–Leibniz-Zentrum für Informatik, 2020, pp. 19:1–19:15 DOI: 10.4230/LIP Ics.So CG.2020.19 · doi ↗
- 8[BS 20] Ulrich Bauer and Maximilian Schmahl “The Structure of Morphisms in Persistent Homology, I. Functorial Dualities” In ar Xiv e-prints , 2020 ar Xiv: 2012.12881
