Similarity Search and Locality Sensitive Hashing using TCAMs
Rajendra Shinde, Ashish Goel, Pankaj Gupta, Debojyoti Dutta

TL;DR
This paper introduces a novel approximate nearest neighbor search method using TCAMs, achieving near-linear space and constant query time by leveraging a new variant of locality sensitive hashing called TLSH.
Contribution
The work presents TLSH, a new LSH variant utilizing TCAMs for efficient high-speed approximate nearest neighbor search with near-linear space complexity.
Findings
Achieves O(1) query time for approximate NNS
Uses TCAMs to reduce storage to near linear in database size
Operates within known lower bounds in the cell probe model
Abstract
Similarity search methods are widely used as kernels in various machine learning applications. Nearest neighbor search (NNS) algorithms are often used to retrieve similar entries, given a query. While there exist efficient techniques for exact query lookup using hashing, similarity search using exact nearest neighbors is known to be a hard problem and in high dimensions, best known solutions offer little improvement over a linear scan. Fast solutions to the approximate NNS problem include Locality Sensitive Hashing (LSH) based techniques, which need storage polynomial in with exponent greater than , and query time sublinear, but still polynomial in , where is the size of the database. In this work we present a new technique of solving the approximate NNS problem in Euclidean space using a Ternary Content Addressable Memory (TCAM), which needs near linear space and has O(1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Caching and Content Delivery · Algorithms and Data Compression
