Explainable embeddings with Distance Explainer

Christiaan Meijer; E. G. Patrick Bos

arXiv:2505.15516·cs.LG·March 26, 2026

Explainable embeddings with Distance Explainer

Christiaan Meijer, E. G. Patrick Bos

PDF

Open Access

TL;DR

Distance Explainer provides a novel, post-hoc method for interpreting the meaning of distances in embedded vector spaces, improving transparency in models like CLIP and cross-modal embeddings.

Contribution

It introduces a new explanation technique for embedded spaces that adapts saliency methods to explain pairwise distances, filling a gap in XAI for vector representations.

Findings

01

Effectively identifies features influencing similarity/dissimilarity in embeddings

02

Maintains high robustness and consistency in explanations

03

Parameter tuning impacts explanation quality

Abstract

While eXplainable AI (XAI) has advanced significantly, few methods address interpretability in embedded vector spaces where dimensions represent complex abstractions. We introduce Distance Explainer, a novel method for generating local, post-hoc explanations of embedded spaces in machine learning models. Our approach adapts saliency-based techniques from RISE to explain the distance between two embedded data points by assigning attribution values through selective masking and distance-ranked mask filtering. We evaluate Distance Explainer on cross-modal embeddings (image-image and image-caption pairs) using established XAI metrics including Faithfulness, Sensitivity/Robustness, and Randomization. Experiments with ImageNet and CLIP models demonstrate that our method effectively identifies features contributing to similarity or dissimilarity between embedded data points while maintaining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Semantic Web and Ontologies

MethodsContrastive Language-Image Pre-training