$b$-Bit Sketch Trie: Scalable Similarity Search on Integer Sketches
Shunsuke Kanda, Yasuo Tabei

TL;DR
This paper introduces a space-efficient trie data structure for scalable similarity search on integer sketches, significantly improving speed and memory efficiency over existing methods using real-world datasets.
Contribution
The paper proposes the $b$-bit sketch trie, a novel data structure leveraging succinct data structures for efficient similarity search on integer sketches.
Findings
Trie-based index improves search time by up to ten times.
Method reduces memory usage to 10 GiB on billion-scale datasets.
Experimental results outperform state-of-the-art methods in speed and space efficiency.
Abstract
Recently, randomly mapping vectorial data to strings of discrete symbols (i.e., sketches) for fast and space-efficient similarity searches has become popular. Such random mapping is called similarity-preserving hashing and approximates a similarity metric by using the Hamming distance. Although many efficient similarity searches have been proposed, most of them are designed for binary sketches. Similarity searches on integer sketches are in their infancy. In this paper, we present a novel space-efficient trie named -bit sketch trie on integer sketches for scalable similarity searches by leveraging the idea behind succinct data structures (i.e., space-efficient data structures while supporting various data operations in the compressed format) and a favorable property of integer sketches as fixed-length strings. Our experimental results obtained using real-world datasets show that a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Analysis and Summarization · Image Retrieval and Classification Techniques
MethodsPruning
