Grounding Language Model with Chunking-Free In-Context Retrieval

Hongjin Qian; Zheng Liu; Kelong Mao; Yujia Zhou; Zhicheng Dou

arXiv:2402.09760·cs.CL·February 16, 2024·1 cites

Grounding Language Model with Chunking-Free In-Context Retrieval

Hongjin Qian, Zheng Liu, Kelong Mao, Yujia Zhou, Zhicheng Dou

PDF

Open Access 1 Video

TL;DR

This paper introduces a Chunking-Free In-Context retrieval method for RAG systems that improves evidence grounding by using encoded document states and specialized decoding strategies, surpassing traditional chunking approaches.

Contribution

The paper proposes a novel CFIC retrieval approach that eliminates document chunking, enhancing efficiency and accuracy in evidence retrieval for RAG systems.

Findings

01

CFIC outperforms traditional chunking methods in evidence retrieval accuracy.

02

The approach maintains semantic coherence without disrupting document context.

03

Enhanced decoding strategies improve retrieval efficiency and fidelity.

Abstract

This paper presents a novel Chunking-Free In-Context (CFIC) retrieval approach, specifically tailored for Retrieval-Augmented Generation (RAG) systems. Traditional RAG systems often struggle with grounding responses using precise evidence text due to the challenges of processing lengthy documents and filtering out irrelevant content. Commonly employed solutions, such as document chunking and adapting language models to handle longer contexts, have their limitations. These methods either disrupt the semantic coherence of the text or fail to effectively address the issues of noise and inaccuracy in evidence retrieval. CFIC addresses these challenges by circumventing the conventional chunking process. It utilizes the encoded hidden states of documents for in-context retrieval, employing auto-aggressive decoding to accurately identify the specific evidence text required for user queries,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Grounding Language Model with Chunking-Free In-Context Retrieval· underline

Taxonomy

TopicsTopic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Linear Warmup With Linear Decay · Linear Layer · Attention Dropout · Dense Connections · Softmax · Weight Decay · Byte Pair Encoding · Adam