Toward a Generalization Metric for Deep Generative Models

Hoang Thanh-Tung; Truyen Tran

arXiv:2011.00754·cs.LG·May 25, 2021

Toward a Generalization Metric for Deep Generative Models

Hoang Thanh-Tung, Truyen Tran

PDF

Open Access 1 Repo

TL;DR

This paper evaluates existing metrics for measuring the generalization of deep generative models, finds their limitations, and proposes a new, more robust metric based on the Minimum Description Length principle that better detects memorization.

Contribution

It introduces a framework for comparing evaluation metrics, analyzes their robustness, and proposes a novel metric based on MDL to accurately assess model generalization.

Findings

01

Existing metrics can be fooled by memorization

02

Better metric scores do not necessarily mean better generalization

03

The proposed MDL-based metric effectively detects memorization

Abstract

Measuring the generalization capacity of Deep Generative Models (DGMs) is difficult because of the curse of dimensionality. Evaluation metrics for DGMs such as Inception Score, Fr\'echet Inception Distance, Precision-Recall, and Neural Net Divergence try to estimate the distance between the generated distribution and the target distribution using a polynomial number of samples. These metrics are the target of researchers when designing new models. Despite the claims, it is still unclear how well can they measure the generalization capacity of a generative model. In this paper, we investigate the capacity of these metrics in measuring the generalization capacity. We introduce a framework for comparing the robustness of evaluation metrics. We show that better scores in these metrics do not imply better generalization. They can be fooled easily by a generator that memorizes a small subset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

htt210/GeneralizationMetricGAN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Explainable Artificial Intelligence (XAI)