A Filter of Minhash for Image Similarity Measures
Jun Long, Qunfeng Liu, Xinpan Yuan, Chengyuan Zhang, Junfeng Liu

TL;DR
This paper introduces a dynamic threshold filter for Minwise Hashing that accelerates large-scale image similarity searches by reducing unnecessary comparisons, and extends to other hashing algorithms with similar statistical properties.
Contribution
It proposes a novel filter for Minwise Hashing that improves efficiency in image similarity measures and can be applied to other binomial distribution-based hashing algorithms.
Findings
The filter significantly reduces computation time in image similarity searches.
The filter is proven correct and effective through experiments on real datasets.
Extension of the filter to other hashing algorithms like b-Bit Minwise Hashing.
Abstract
Image similarity measures play an important role in nearest neighbor search and duplicate detection for large-scale image datasets. Recently, Minwise Hashing (or Minhash) and its related hashing algorithms have achieved great performances in large-scale image retrieval systems. However, there are a large number of comparisons for image pairs in these applications, which may spend a lot of computation time and affect the performance. In order to quickly obtain the pairwise images that theirs similarities are higher than the specific threshold T (e.g., 0.5), we propose a dynamic threshold filter of Minwise Hashing for image similarity measures. It greatly reduces the calculation time by terminating the unnecessary comparisons in advance. We also find that the filter can be extended to other hashing algorithms, on when the estimator satisfies the binomial distribution, such as b-Bit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Video Surveillance and Tracking Methods
