Explainable embeddings with Distance Explainer
Christiaan Meijer, E. G. Patrick Bos

TL;DR
Distance Explainer provides a novel, post-hoc method for interpreting the meaning of distances in embedded vector spaces, improving transparency in models like CLIP and cross-modal embeddings.
Contribution
It introduces a new explanation technique for embedded spaces that adapts saliency methods to explain pairwise distances, filling a gap in XAI for vector representations.
Findings
Effectively identifies features influencing similarity/dissimilarity in embeddings
Maintains high robustness and consistency in explanations
Parameter tuning impacts explanation quality
Abstract
While eXplainable AI (XAI) has advanced significantly, few methods address interpretability in embedded vector spaces where dimensions represent complex abstractions. We introduce Distance Explainer, a novel method for generating local, post-hoc explanations of embedded spaces in machine learning models. Our approach adapts saliency-based techniques from RISE to explain the distance between two embedded data points by assigning attribution values through selective masking and distance-ranked mask filtering. We evaluate Distance Explainer on cross-modal embeddings (image-image and image-caption pairs) using established XAI metrics including Faithfulness, Sensitivity/Robustness, and Randomization. Experiments with ImageNet and CLIP models demonstrate that our method effectively identifies features contributing to similarity or dissimilarity between embedded data points while maintaining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Semantic Web and Ontologies
MethodsContrastive Language-Image Pre-training
