Recurrent Binary Embedding for GPU-Enabled Exhaustive Retrieval from Billion-Scale Semantic Vectors
Ying Shan, Jian Jiao, Jie Zhu, JC Mao

TL;DR
This paper introduces Recurrent Binary Embedding (RBE), a novel model enabling efficient billion-scale exhaustive retrieval on GPUs by refining binary vectors for high accuracy, significantly improving over previous binary and full-precision models.
Contribution
The paper presents RBE, a new binary embedding model with a residual refinement process, and an end-to-end GPU system for real-time billion-scale retrieval, outperforming existing models.
Findings
RBE outperforms state-of-the-art binary models in AUC.
RBE narrows the accuracy gap with full-precision models by over 80%.
The system achieves real-time retrieval from over a billion candidates.
Abstract
Rapid advances in GPU hardware and multiple areas of Deep Learning open up a new opportunity for billion-scale information retrieval with exhaustive search. Building on top of the powerful concept of semantic learning, this paper proposes a Recurrent Binary Embedding (RBE) model that learns compact representations for real-time retrieval. The model has the unique ability to refine a base binary vector by progressively adding binary residual vectors to meet the desired accuracy. The refined vector enables efficient implementation of exhaustive similarity computation with bit-wise operations, followed by a near- lossless k-NN selection algorithm, also proposed in this paper. The proposed algorithms are integrated into an end-to-end multi-GPU system that retrieves thousands of top items from over a billion candidates in real-time. The RBE model and the retrieval system were evaluated with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
Methodsk-Nearest Neighbors
