Attributable Visual Similarity Learning
Borui Zhang, Wenzhao Zheng, Jie Zhou, Jiwen Lu

TL;DR
This paper introduces an attributable visual similarity learning framework that models image similarity using graphs and semantic hierarchies, enhancing accuracy and interpretability over traditional point-based methods.
Contribution
It presents a novel graph-based similarity representation and a hierarchical inference method that improves explainability and accuracy in visual similarity learning.
Findings
Significant performance improvements on benchmark datasets.
Enhanced interpretability through similarity attribution.
Effective correction of unreliable similarity nodes.
Abstract
This paper proposes an attributable visual similarity learning (AVSL) framework for a more accurate and explainable similarity measure between images. Most existing similarity learning methods exacerbate the unexplainability by mapping each sample to a single point in the embedding space with a distance metric (e.g., Mahalanobis distance, Euclidean distance). Motivated by the human semantic similarity cognition, we propose a generalized similarity learning paradigm to represent the similarity between two images with a graph and then infer the overall similarity accordingly. Furthermore, we establish a bottom-up similarity construction and top-down similarity inference framework to infer the similarity based on semantic hierarchy consistency. We first identify unreliable higher-level similarity nodes and then correct them using the most coherent adjacent lower-level similarity nodes,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Image Retrieval and Classification Techniques · Machine Learning in Healthcare
