Pigeonring: A Principle for Faster Thresholded Similarity Search
Jianbin Qin, Chuan Xiao

TL;DR
The paper introduces the pigeonring principle, a stronger constraint-based approach that accelerates thresholded similarity search by improving candidate filtering over traditional pigeonhole principle methods.
Contribution
It proposes the pigeonring principle, extending the pigeonhole principle to constrain multiple boxes in a ring, enabling faster similarity search algorithms.
Findings
Stronger filtering conditions improve candidate selection.
Algorithms based on the new principle outperform existing methods.
Minor modifications suffice to adapt current algorithms.
Abstract
The pigeonhole principle states that if items are contained in boxes, then at least one box has no more than items. It is utilized to solve many data management problems, especially for thresholded similarity searches. Despite many pigeonhole principle-based solutions proposed in the last few decades, the condition stated by the principle is weak. It only constrains the number of items in a single box. By organizing the boxes in a ring, we propose a new principle, called the pigeonring principle, which constrains the number of items in multiple boxes and yields stronger conditions. To utilize the new principle, we focus on problems defined in the form of identifying data objects whose similarities or distances to the query is constrained by a threshold. Many solutions to these problems utilize the pigeonhole principle to find candidates that satisfy a filtering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
