Large Language Models as Foundations for Next-Gen Dense Retrieval: A   Comprehensive Empirical Assessment

Kun Luo; Minghao Qin; Zheng Liu; Shitao Xiao; Jun Zhao; Kang Liu

arXiv:2408.12194·cs.CL·August 26, 2024

Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment

Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu

PDF

Open Access 1 Video

TL;DR

This paper provides a comprehensive empirical assessment of large language models as backbone encoders for dense retrieval, highlighting their advantages in accuracy, generalization, and versatility across various retrieval tasks.

Contribution

It systematically evaluates over 15 models, revealing how size and pretraining influence retrieval performance and generalization capabilities.

Findings

01

Larger models improve in domain accuracy and data efficiency.

02

Extensive pretraining enhances retrieval performance.

03

Larger models excel in zero shot and multi-task retrieval scenarios.

Abstract

Pretrained language models like BERT and T5 serve as crucial backbone encoders for dense retrieval. However, these models often exhibit limited generalization capabilities and face challenges in improving in domain accuracy. Recent research has explored using large language models (LLMs) as retrievers, achieving SOTA performance across various tasks. Despite these advancements, the specific benefits of LLMs over traditional retrievers and the impact of different LLM configurations, such as parameter sizes, pretraining duration, and alignment processes on retrieval tasks remain unclear. In this work, we conduct a comprehensive empirical study on a wide range of retrieval tasks, including in domain accuracy, data efficiency, zero shot generalization, lengthy retrieval, instruction based retrieval, and multi task learning. We evaluate over 15 different backbone LLMs and non LLMs. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment· underline

Taxonomy

TopicsTopic Modeling

MethodsWordPiece · Linear Warmup With Linear Decay · Adam · Weight Decay · Attention Is All You Need · Gated Linear Unit · Dense Connections · Network On Network · Byte Pair Encoding · BERT