Toward a Generalization Metric for Deep Generative Models
Hoang Thanh-Tung, Truyen Tran

TL;DR
This paper evaluates existing metrics for measuring the generalization of deep generative models, finds their limitations, and proposes a new, more robust metric based on the Minimum Description Length principle that better detects memorization.
Contribution
It introduces a framework for comparing evaluation metrics, analyzes their robustness, and proposes a novel metric based on MDL to accurately assess model generalization.
Findings
Existing metrics can be fooled by memorization
Better metric scores do not necessarily mean better generalization
The proposed MDL-based metric effectively detects memorization
Abstract
Measuring the generalization capacity of Deep Generative Models (DGMs) is difficult because of the curse of dimensionality. Evaluation metrics for DGMs such as Inception Score, Fr\'echet Inception Distance, Precision-Recall, and Neural Net Divergence try to estimate the distance between the generated distribution and the target distribution using a polynomial number of samples. These metrics are the target of researchers when designing new models. Despite the claims, it is still unclear how well can they measure the generalization capacity of a generative model. In this paper, we investigate the capacity of these metrics in measuring the generalization capacity. We introduce a framework for comparing the robustness of evaluation metrics. We show that better scores in these metrics do not imply better generalization. They can be fooled easily by a generator that memorizes a small subset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Explainable Artificial Intelligence (XAI)
