Control Token with Dense Passage Retrieval

Juhwan Lee; Jisu Kim

arXiv:2405.13008·cs.CL·May 24, 2024

Control Token with Dense Passage Retrieval

Juhwan Lee, Jisu Kim

PDF

Open Access

TL;DR

This paper improves retrieval-augmented generation in large language models by enhancing dense passage retrieval with control tokens, significantly reducing hallucinations and increasing retrieval accuracy.

Contribution

The novel integration of control tokens into the DPR model enhances retrieval accuracy, addressing limitations in existing retrieval-augmented generation methods.

Findings

01

13% improvement in Top-1 accuracy

02

4% improvement in Top-20 accuracy

03

Enhanced retrieval performance over standard DPR

Abstract

This study addresses the hallucination problem in large language models (LLMs). We adopted Retrieval-Augmented Generation(RAG) (Lewis et al., 2020), a technique that involves embedding relevant information in the prompt to obtain accurate answers. However, RAG also faced inherent issues in retrieving correct information. To address this, we employed the Dense Passage Retrieval(DPR) (Karpukhin et al., 2020) model for fetching domain-specific documents related to user queries. Despite this, the DPR model still lacked accuracy in document retrieval. We enhanced the DPR model by incorporating control tokens, achieving significantly superior performance over the standard DPR model, with a 13% improvement in Top-1 accuracy and a 4% improvement in Top-20 accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Software Testing and Debugging Techniques · Algorithms and Data Compression

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Linear Warmup With Linear Decay · Weight Decay · Attention Dropout · Linear Layer · Byte Pair Encoding · Adam · Residual Connection