Gram-MMD: A Texture-Aware Metric for Image Realism Assessment
Jo\'e Napolitano, Pascal Nguyen

TL;DR
Gram-MMD (GMMD) is a new image realism metric that captures fine-grained textural and structural features using Gram matrices and MMD, outperforming existing metrics in various datasets and scenarios.
Contribution
We introduce GMMD, a texture-aware realism metric leveraging Gram matrices and MMD, with a hyperparameter selection protocol and extensive experiments demonstrating its effectiveness.
Findings
GMMD captures finer textural details than semantic-level metrics.
GMMD maintains correct realism rankings in cross-domain scenarios.
Experiments show GMMD's effectiveness across multiple backbone architectures.
Abstract
Evaluating the realism of generated images remains a fundamental challenge in generative modeling. Existing distributional metrics such as the Frechet Inception Distance (FID) and CLIP-MMD (CMMD) compare feature distributions at a semantic level but may overlook fine-grained textural information that can be relevant for distinguishing real from generated images. We introduce Gram-MMD (GMMD), a realism metric that leverages Gram matrices computed from intermediate activations of pretrained backbone networks to capture correlations between feature maps. By extracting the upper-triangular part of these symmetric Gram matrices and measuring the Maximum Mean Discrepancy (MMD) between an anchor distribution of real images and an evaluation distribution, GMMD produces a representation that encodes textural and structural characteristics at a finer granularity than global embeddings. To select…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
