A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation

Dario Serez; Marco Cristani; Alessio Del Bue; Vittorio Murino; Pietro Morerio

arXiv:2501.13718·cs.CV·January 1, 2026

A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation

Dario Serez, Marco Cristani, Alessio Del Bue, Vittorio Murino, Pietro Morerio

PDF

Open Access 1 Models

TL;DR

This paper introduces a mutual information-based framework to analyze and improve multiple latent variable generative models, enhancing their understanding and application in self-supervised learning with diverse synthetic data generation.

Contribution

It provides a systematic mutual information analysis of MLVGMs, introduces a new data generation method for SSCRL, and proposes a continuous sampling strategy to increase data variability.

Findings

01

Mutual information reveals underutilized latent variables in current models.

02

Generated views from MLVGMs can match or surpass real data in quality.

03

The proposed methods improve self-supervised learning performance.

Abstract

In image generation, Multiple Latent Variable Generative Models (MLVGMs) employ multiple latent variables to gradually shape the final images, from global characteristics to finer and local details (e.g., StyleGAN, NVAE), emerging as powerful tools for diverse applications. Yet their generative dynamics remain only empirically observed, without a systematic understanding of each latent variable's impact. In this work, we propose a novel framework that quantifies the contribution of each latent variable using Mutual Information (MI) as a metric. Our analysis reveals that current MLVGMs often underutilize some latent variables, and provides actionable insights for their use in downstream applications. With this foundation, we introduce a method for generating synthetic data for Self-Supervised Contrastive Representation Learning (SSCRL). By leveraging the hierarchical and disentangled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
SerezD/mi_ml_gen
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques

MethodsDense Connections · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · Feedforward Network · R1 Regularization · Adaptive Instance Normalization · StyleGAN