Pre-train a Discriminative Text Encoder for Dense Retrieval via   Contrastive Span Prediction

Xinyu Ma; Jiafeng Guo; Ruqing Zhang; Yixing Fan; Xueqi Cheng

arXiv:2204.10641·cs.IR·April 25, 2022

Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction

Xinyu Ma, Jiafeng Guo, Ruqing Zhang, Yixing Fan, Xueqi Cheng

PDF

1 Repo

TL;DR

This paper introduces a novel contrastive span prediction pre-training task for text encoders to improve dense retrieval performance, avoiding decoder bypass effects and enhancing discriminative representations.

Contribution

It proposes a new contrastive span prediction method that pre-trains encoders independently, leading to more discriminative text representations for dense retrieval tasks.

Findings

01

Outperforms existing pre-training methods on retrieval benchmarks.

02

Effectively learns discriminative representations without decoder bypass.

03

Significantly improves dense retrieval accuracy.

Abstract

Dense retrieval has shown promising results in many information retrieval (IR) related tasks, whose foundation is high-quality text representation learning for effective search. Some recent studies have shown that autoencoder-based language models are able to boost the dense retrieval performance using a weak decoder. However, we argue that 1) it is not discriminative to decode all the input texts and, 2) even a weak decoder has the bypass effect on the encoder. Therefore, in this work, we introduce a novel contrastive span prediction task to pre-train the encoder alone, but still retain the bottleneck ability of the autoencoder. % Therefore, in this work, we propose to drop out the decoder and introduce a novel contrastive span prediction task to pre-train the encoder alone. The key idea is to force the encoder to generate the text representation close to its own random spans while far…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

albert-ma/costa
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning