HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability
Sudhanva Manjunath Athreya, Paul Rosen

TL;DR
HOLE is a novel topological analysis method using persistent homology to interpret neural network representations, revealing class separation and robustness patterns.
Contribution
Introduces HOLE, a topological approach for analyzing neural network embeddings through persistent homology and visualization tools.
Findings
Topological features correlate with class separation.
Analysis reveals insights into feature disentanglement.
Method enhances understanding of model robustness.
Abstract
Deep learning models have achieved remarkable success across various domains, yet their learned representations and decision-making processes remain largely opaque and hard to interpret. This work introduces HOLE (Homological Observation of Latent Embeddings), a method for analyzing and interpreting discriminative neural networks through persistent homology. HOLE extracts topological features from intermediate activations and presents them using a suite of visualization techniques, including cluster flow diagrams, blob graphs, and heatmap dendrograms. These tools facilitate the examination of representation structure and quality across layers. We evaluate HOLE using a range of discriminative models, focusing on representation quality, interpretability across layers, and robustness to input perturbations and model compression. The results indicate that topological analysis reveals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
