Grounding Language Model with Chunking-Free In-Context Retrieval
Hongjin Qian, Zheng Liu, Kelong Mao, Yujia Zhou, Zhicheng Dou

TL;DR
This paper introduces a Chunking-Free In-Context retrieval method for RAG systems that improves evidence grounding by using encoded document states and specialized decoding strategies, surpassing traditional chunking approaches.
Contribution
The paper proposes a novel CFIC retrieval approach that eliminates document chunking, enhancing efficiency and accuracy in evidence retrieval for RAG systems.
Findings
CFIC outperforms traditional chunking methods in evidence retrieval accuracy.
The approach maintains semantic coherence without disrupting document context.
Enhanced decoding strategies improve retrieval efficiency and fidelity.
Abstract
This paper presents a novel Chunking-Free In-Context (CFIC) retrieval approach, specifically tailored for Retrieval-Augmented Generation (RAG) systems. Traditional RAG systems often struggle with grounding responses using precise evidence text due to the challenges of processing lengthy documents and filtering out irrelevant content. Commonly employed solutions, such as document chunking and adapting language models to handle longer contexts, have their limitations. These methods either disrupt the semantic coherence of the text or fail to effectively address the issues of noise and inaccuracy in evidence retrieval. CFIC addresses these challenges by circumventing the conventional chunking process. It utilizes the encoded hidden states of documents for in-context retrieval, employing auto-aggressive decoding to accurately identify the specific evidence text required for user queries,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Linear Warmup With Linear Decay · Linear Layer · Attention Dropout · Dense Connections · Softmax · Weight Decay · Byte Pair Encoding · Adam
