A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation
Dario Serez, Marco Cristani, Alessio Del Bue, Vittorio Murino, Pietro Morerio

TL;DR
This paper introduces a mutual information-based framework to analyze and improve multiple latent variable generative models, enhancing their understanding and application in self-supervised learning with diverse synthetic data generation.
Contribution
It provides a systematic mutual information analysis of MLVGMs, introduces a new data generation method for SSCRL, and proposes a continuous sampling strategy to increase data variability.
Findings
Mutual information reveals underutilized latent variables in current models.
Generated views from MLVGMs can match or surpass real data in quality.
The proposed methods improve self-supervised learning performance.
Abstract
In image generation, Multiple Latent Variable Generative Models (MLVGMs) employ multiple latent variables to gradually shape the final images, from global characteristics to finer and local details (e.g., StyleGAN, NVAE), emerging as powerful tools for diverse applications. Yet their generative dynamics remain only empirically observed, without a systematic understanding of each latent variable's impact. In this work, we propose a novel framework that quantifies the contribution of each latent variable using Mutual Information (MI) as a metric. Our analysis reveals that current MLVGMs often underutilize some latent variables, and provides actionable insights for their use in downstream applications. With this foundation, we introduce a method for generating synthetic data for Self-Supervised Contrastive Representation Learning (SSCRL). By leveraging the hierarchical and disentangled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
MethodsDense Connections · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · Feedforward Network · R1 Regularization · Adaptive Instance Normalization · StyleGAN
