GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation

Nicolas Salvy; Hugues Talbot; Bertrand Thirion

arXiv:2602.16449·cs.LG·February 19, 2026

GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation

Nicolas Salvy, Hugues Talbot, Bertrand Thirion

PDF

Open Access

TL;DR

This paper introduces GICDM, a method to correct hubness in high-dimensional embedding spaces, improving the reliability of distance-based generative model evaluation and aligning better with human judgment.

Contribution

GICDM extends ICDM with a multi-scale approach to mitigate hubness effects, enhancing the accuracy of distance metrics for evaluating generative models.

Findings

01

GICDM effectively resolves hubness-induced failures.

02

It restores reliable metric behavior in high-dimensional spaces.

03

GICDM improves alignment with human judgment in evaluations.

Abstract

Generative model evaluation commonly relies on high-dimensional embedding spaces to compute distances between samples. We show that dataset representations in these spaces are affected by the hubness phenomenon, which distorts nearest neighbor relationships and biases distance-based metrics. Building on the classical Iterative Contextual Dissimilarity Measure (ICDM), we introduce Generative ICDM (GICDM), a method to correct neighborhood estimation for both real and generated data. We introduce a multi-scale extension to improve empirical behavior. Extensive experiments on synthetic and real benchmarks demonstrate that GICDM resolves hubness-induced failures, restores reliable metric behavior, and improves alignment with human judgment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification