Central Similarity Quantization for Efficient Image and Video Retrieval
Li Yuan, Tao Wang, Xiaopeng Zhang, Francis EH Tay, Zequn Jie, Wei Liu,, Jiashi Feng

TL;DR
This paper introduces a global similarity metric called central similarity for hash learning, which improves efficiency and accuracy in image and video retrieval by encouraging similar data to converge to common centers.
Contribution
It proposes a novel central similarity metric and hash center construction method using Hadamard matrices, enhancing hash code cohesion and dispersion for retrieval tasks.
Findings
Achieves 3-20% mAP improvement over state-of-the-art methods.
Effective in both image and video hashing scenarios.
Provides an efficient way to construct well-separated hash centers.
Abstract
Existing data-dependent hashing methods usually learn hash functions from pairwise or triplet data relationships, which only capture the data similarity locally, and often suffer from low learning efficiency and low collision rate. In this work, we propose a new \emph{global} similarity metric, termed as \emph{central similarity}, with which the hash codes of similar data pairs are encouraged to approach a common center and those for dissimilar pairs to converge to different centers, to improve hash learning efficiency and retrieval accuracy. We principally formulate the computation of the proposed central similarity metric by introducing a new concept, i.e., \emph{hash center} that refers to a set of data points scattered in the Hamming space with a sufficient mutual distance between each other. We then provide an efficient method to construct well separated hash centers by leveraging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Central Similarity Quantization for Efficient Image and Video Retrieval· youtube
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Multimodal Machine Learning Applications
