Densifying Sparse Representations for Passage Retrieval by   Representational Slicing

Sheng-Chieh Lin; Jimmy Lin

arXiv:2112.04666·cs.IR·December 10, 2021·6 cites

Densifying Sparse Representations for Passage Retrieval by Representational Slicing

Sheng-Chieh Lin, Jimmy Lin

PDF

Open Access 1 Repo 3 Models

TL;DR

This paper introduces a training-free method to densify sparse text representations, making them more effective for retrieval and easily combinable with dense representations, improving retrieval performance.

Contribution

It proposes a novel densification technique for sparse representations that is interpretable, training-free, and compatible with dense models for improved retrieval.

Findings

01

Densified sparse representations improve retrieval effectiveness.

02

Combining DSRs with dense models balances efficiency and accuracy.

03

The approach is interpretable and does not require additional training.

Abstract

Learned sparse and dense representations capture different successful approaches to text retrieval and the fusion of their results has proven to be more effective and robust. Prior work combines dense and sparse retrievers by fusing their model scores. As an alternative, this paper presents a simple approach to densifying sparse representations for text retrieval that does not involve any training. Our densified sparse representations (DSRs) are interpretable and can be easily combined with dense representations for end-to-end retrieval. We demonstrate that our approach can jointly learn sparse and dense representations within a single model and then combine them for dense retrieval. Experimental results suggest that combining our DSRs and dense representations yields a balanced tradeoff between effectiveness and efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

castorini/dhr
jax

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques