Document Optimization for Black-Box Retrieval via Reinforcement Learning
Omri Uzan, Ron Polonsky, Douwe Kiela, Christopher Potts

TL;DR
This paper introduces a reinforcement learning-based method for optimizing documents to improve black-box retrieval performance across various retriever types, leading to significant accuracy gains and efficiency improvements.
Contribution
It recasts document expansion as a document optimization problem using RL, enabling black-box access and improving retrieval results without retriever fine-tuning.
Findings
Optimized documents improve retrieval metrics like nDCG5 significantly.
Smaller, efficient retrievers outperform larger models after optimization.
Combining document optimization with retriever fine-tuning yields the best results.
Abstract
Document expansion is a classical technique for improving retrieval quality, and is attractive since it shifts computation offline, avoiding additional query-time processing. However, when applied to modern retrievers, it has been shown to degrade performance, often introducing noise that obfuscates the discriminative signal. We recast document expansion as a document optimization problem: a language model or a vision language model is fine-tuned to transform documents into representations that better align with the expected query distribution under a target retriever, using GRPO with the retriever's ranking improvements as rewards. This approach requires only black-box access to retrieval ranks, and is applicable across single-vector, multi-vector and lexical retrievers. We evaluate our approach on code retrieval and visual document retrieval (VDR) tasks. We find that learned document…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
