HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability

Sudhanva Manjunath Athreya; Paul Rosen

arXiv:2512.07988·cs.LG·April 8, 2026

HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability

Sudhanva Manjunath Athreya, Paul Rosen

PDF

TL;DR

HOLE is a novel topological analysis method using persistent homology to interpret neural network representations, revealing class separation and robustness patterns.

Contribution

Introduces HOLE, a topological approach for analyzing neural network embeddings through persistent homology and visualization tools.

Findings

01

Topological features correlate with class separation.

02

Analysis reveals insights into feature disentanglement.

03

Method enhances understanding of model robustness.

Abstract

Deep learning models have achieved remarkable success across various domains, yet their learned representations and decision-making processes remain largely opaque and hard to interpret. This work introduces HOLE (Homological Observation of Latent Embeddings), a method for analyzing and interpreting discriminative neural networks through persistent homology. HOLE extracts topological features from intermediate activations and presents them using a suite of visualization techniques, including cluster flow diagrams, blob graphs, and heatmap dendrograms. These tools facilitate the examination of representation structure and quality across layers. We evaluate HOLE using a range of discriminative models, focusing on representation quality, interpretability across layers, and robustness to input perturbations and model compression. The results indicate that topological analysis reveals…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.