Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search
Haokui Zhang, Buzhou Tang, Wenze Hu, Xiaoyu Wang

TL;DR
This paper introduces a transformer-based feature compression method for Approximate Nearest Neighbor Search that enhances efficiency while maintaining or improving search accuracy across various ANNS techniques.
Contribution
It presents a novel transformer-based network with an INRP loss function for effective feature compression in ANNS, improving speed and accuracy.
Findings
Significantly improves efficiency of ANNS methods
Preserves or enhances search accuracy after compression
Applicable across multiple ANNS algorithms
Abstract
We propose a generic feature compression method for Approximate Nearest Neighbor Search (ANNS) problems, which speeds up existing ANNS methods in a plug-and-play manner. Specifically, based on transformer, we propose a new network structure to compress the feature into a low dimensional space, and an inhomogeneous neighborhood relationship preserving (INRP) loss that aims to maintain high search accuracy. Specifically, we use multiple compression projections to cast the feature into many low dimensional spaces, and then use transformer to globally optimize these projections such that the features are well compressed following the guidance from our loss function. The loss function is designed to assign high weights on point pairs that are close in original feature space, and keep their distances in projected space. Keeping these distances helps maintain the eventual top-k retrieval…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Data Management and Algorithms
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Residual Connection · Softmax · Dropout · Adam
