Boosted Dense Retriever

Patrick Lewis; Barlas O\u{g}uz; Wenhan Xiong; Fabio Petroni; Wen-tau; Yih; Sebastian Riedel

arXiv:2112.07771·cs.CL·December 16, 2021·1 cites

Boosted Dense Retriever

Patrick Lewis, Barlas O\u{g}uz, Wenhan Xiong, Fabio Petroni, Wen-tau, Yih, Sebastian Riedel

PDF

Open Access

TL;DR

DrBoost introduces a boosting-inspired dense retrieval ensemble that enhances retrieval accuracy and efficiency by sequentially training specialized models, resulting in more compact representations and improved performance under approximate search conditions.

Contribution

It presents a novel dense retrieval ensemble method that improves compactness and efficiency while maintaining retrieval quality, suitable for cost-effective deployment.

Findings

01

Representations are 4x more compact with comparable results.

02

Performs well under approximate search with coarse quantization.

03

Reduces latency and bandwidth by 4x.

Abstract

We propose DrBoost, a dense retrieval ensemble inspired by boosting. DrBoost is trained in stages: each component model is learned sequentially and specialized by focusing only on retrieval mistakes made by the current ensemble. The final representation is the concatenation of the output vectors of all the component models, making it a drop-in replacement for standard dense retrievers at test time. DrBoost enjoys several advantages compared to standard dense retrieval models. It produces representations which are 4x more compact, while delivering comparable retrieval results. It also performs surprisingly well under approximate search with coarse quantization, reducing latency and bandwidth needs by another 4x. In practice, this can make the difference between serving indices from disk versus from memory, paving the way for much cheaper deployments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning