Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization
Alireza Salemi, Hamed Zamani

TL;DR
This paper presents an iterative utility maximization approach to optimize a unified search engine for multiple retrieval-augmented generation models, improving retrieval personalization and performance across diverse tasks.
Contribution
It introduces a novel iterative optimization framework that personalizes retrieval for multiple RAG models using offline and online feedback mechanisms.
Findings
Significant performance improvements over baselines on KILT datasets.
Effective personalization of retrieval results for different RAG agents.
Robustness demonstrated through comprehensive ablation studies.
Abstract
This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents, each with a distinct task, backbone large language model (LLM), and RAG strategy. We introduce an iterative approach where the search engine generates retrieval results for the RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase. This feedback is then used to iteratively optimize the search engine using an expectation-maximization algorithm, with the goal of maximizing each agent's utility function. Additionally, we adapt this to an online setting, allowing the search engine to refine its behavior based on real-time individual agents feedback to better serve the results for each of them. Experiments on datasets from the Knowledge-Intensive Language Tasks (KILT) benchmark demonstrates that our approach significantly on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · WordPiece · Residual Connection · Linear Warmup With Linear Decay · Dropout · Layer Normalization
