A New Family of Near-metrics for Universal Similarity
Chu Wang, Iraj Saniee, William S. Kennedy, Chris A. White

TL;DR
This paper introduces a family of near-metrics based on local graph diffusion that effectively measures similarity across diverse data types, outperforming traditional methods in various applications.
Contribution
The paper presents a novel family of near-metrics derived from local graph diffusion, broadening similarity measurement capabilities beyond traditional metric constraints.
Findings
Normalized forward k-step diffusion performs well on structured data.
Reverse k-step diffusion excels with deep learning vector representations.
Near-metrics outperform traditional similarity measures in experiments.
Abstract
We propose a family of near-metrics based on local graph diffusion to capture similarity for a wide class of data sets. These quasi-metametrics, as their names suggest, dispense with one or two standard axioms of metric spaces, specifically distinguishability and symmetry, so that similarity between data points of arbitrary type and form could be measured broadly and effectively. The proposed near-metric family includes the forward k-step diffusion and its reverse, typically on the graph consisting of data objects and their features. By construction, this family of near-metrics is particularly appropriate for categorical data, continuous data, and vector representations of images and text extracted via deep learning approaches. We conduct extensive experiments to evaluate the performance of this family of similarity measures and compare and contrast with traditional measures of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
