Scaling DPPs for RAG: Density Meets Diversity

Xun Sun; Baiheng Xie; Li Huang; Qiang Gao

arXiv:2604.03240·cs.LG·April 7, 2026

Scaling DPPs for RAG: Density Meets Diversity

Xun Sun, Baiheng Xie, Li Huang, Qiang Gao

PDF

TL;DR

This paper introduces ScalDPP, a scalable DPP-based retrieval method for RAG that jointly optimizes for density and diversity, improving evidence relevance and coverage.

Contribution

It proposes a novel DPP-based retrieval mechanism with a lightweight adapter and a new set-level loss to enhance RAG performance.

Findings

01

ScalDPP outperforms standard relevance ranking in experiments.

02

The method effectively balances density and diversity in retrieved evidence.

03

Experimental results validate the superiority of ScalDPP.

Abstract

Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by grounding generation in external knowledge, yielding relevance responses that are aligned with factual evidence and evolving corpora. Standard RAG pipelines construct context through relevance ranking, performing point-wise scoring between the user query and each corpora chunk. This formulation, however, ignores interactions among retrieved candidates, leading to redundant contexts that dilute density and fail to surface complementary evidence. We argue that effective retrieval should optimize jointly for both density and diversity, ensuring the grounding evidence that is dense in information yet diverse in coverage. In this study, we propose ScalDPP, a diversity-aware retrieval mechanism for RAG that incorporates Determinantal Point Processes (DPPs) through a lightweight P-Adapter, enabling scalable modeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.