Comparing the information content of probabilistic representation spaces

Kieran A. Murphy; Sam Dillavou; Dani S. Bassett

arXiv:2405.21042·cs.LG·February 20, 2025

Comparing the information content of probabilistic representation spaces

Kieran A. Murphy, Sam Dillavou, Dani S. Bassett

PDF

Open Access 1 Repo

TL;DR

This paper introduces new information-theoretic measures for comparing probabilistic representation spaces, enabling better understanding of learned representations and their information content across models and datasets.

Contribution

It extends classic comparison methods to probabilistic spaces, proposes a lightweight estimation technique, and demonstrates applications in disentanglement, model comparison, and fusion.

Findings

01

Identified recurring information fragments in latent dimensions of VAEs and InfoGANs.

02

Revealed consistent information content across models and datasets despite training variability.

03

Enabled model fusion by leveraging differentiable information measures.

Abstract

Probabilistic representation spaces convey information about a dataset and are shaped by factors such as the training data, network architecture, and loss function. Comparing the information content of such spaces is crucial for understanding the learning process, yet most existing methods assume point-based representations, neglecting the distributional nature of probabilistic spaces. To address this gap, we propose two information-theoretic measures to compare general probabilistic representation spaces by extending classic methods to compare the information content of hard clustering assignments. Additionally, we introduce a lightweight method of estimation that is based on fingerprinting a representation space with a sample of the dataset, designed for scenarios where the communicated information is limited to a few bits. We demonstrate the utility of these measures in three case…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

murphyka/representation-space-info-comparison
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Numerical Analysis Techniques · Generative Adversarial Networks and Image Synthesis

MethodsHuMan(Expedia)||How do I get a human at Expedia? · *Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Dense Connections · Feedforward Network · InfoGAN · Sparse Evolutionary Training