Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

Paulo Roberto de Moura J\'unior; Jean Lelong; Annabelle Blangero

arXiv:2603.25333·cs.CL·March 27, 2026

Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

Paulo Roberto de Moura J\'unior, Jean Lelong, Annabelle Blangero

PDF

Open Access

TL;DR

This paper introduces Adaptive Chunking, a novel framework that selects optimal document chunking strategies for RAG based on intrinsic metrics, significantly improving answer correctness and retrieval success across diverse domains.

Contribution

The paper proposes a new adaptive chunking framework with five intrinsic metrics and two novel chunkers, enhancing RAG performance without changing models or prompts.

Findings

01

Increases RAG answer correctness to 72% from 62-64%.

02

Boosts successfully answered questions by over 30%.

03

Demonstrates effectiveness across legal, technical, and social science texts.

Abstract

The effectiveness of Retrieval-Augmented Generation (RAG) is highly dependent on how documents are chunked, that is, segmented into smaller units for indexing and retrieval. Yet, commonly used "one-size-fits-all" approaches often fail to capture the nuanced structure and semantics of diverse texts. Despite its central role, chunking lacks a dedicated evaluation framework, making it difficult to assess and compare strategies independently of downstream performance. We challenge this paradigm by introducing Adaptive Chunking, a framework that selects the most suitable chunking strategy for each document based on a set of five novel intrinsic, document-based metrics: References Completeness (RC), Intrachunk Cohesion (ICC), Document Contextual Coherence (DCC), Block Integrity (BI), and Size Compliance (SC), which directly assess chunking quality across key dimensions. To support this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Information Retrieval and Search Behavior