When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective
Beatrix M. G. Nielsen, Emanuele Marconato, Andrea Dittadi, Luigi Gresele

TL;DR
This paper investigates the relationship between distributional closeness and representational similarity in neural networks, revealing that small distributional divergence does not guarantee similar representations, and proposing a new distributional distance measure.
Contribution
It introduces an identifiability-based framework to analyze when distributional similarity implies representational similarity and proposes a new distance measure that aligns these concepts.
Findings
Small KL divergence does not ensure similar representations.
Models with high likelihood can learn dissimilar representations.
Wider networks tend to have closer distributions and more similar representations.
Abstract
When and why representations learned by different deep neural networks are similar is an active research topic. We choose to address these questions from the perspective of identifiability theory, which suggests that a measure of representational similarity should be invariant to transformations that leave the model distribution unchanged. Focusing on a model family which includes several popular pre-training approaches, e.g., autoregressive language models, we explore when models which generate distributions that are close have similar representations. We prove that a small Kullback--Leibler divergence between the model distributions does not guarantee that the corresponding representations are similar. This has the important corollary that models with near-maximum data likelihood can still learn dissimilar representations -- a phenomenon mirrored in our experiments with models trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
