ReSLLM: Large Language Models are Strong Resource Selectors for Federated Search
Shuai Wang, Shengyao Zhuang, Bevan Koopman, Guido Zuccon

TL;DR
ReSLLM leverages large language models for effective zero-shot resource selection in federated search, enhanced further through an unsupervised fine-tuning protocol called SLAT, improving efficiency and relevance without extensive labeled data.
Contribution
The paper introduces ReSLLM, a novel zero-shot resource selection method using LLMs, and proposes SLAT, an unsupervised fine-tuning protocol that boosts performance without labeled training data.
Findings
ReSLLM achieves competitive zero-shot resource selection performance.
SLAT fine-tuning significantly improves ReSLLM effectiveness.
ReSLLM reduces reliance on labor-intensive labeled datasets.
Abstract
Federated search, which involves integrating results from multiple independent search engines, will become increasingly pivotal in the context of Retrieval-Augmented Generation pipelines empowering LLM-based applications such as chatbots. These systems often distribute queries among various search engines, ranging from specialized (e.g., PubMed) to general (e.g., Google), based on the nature of user utterances. A critical aspect of federated search is resource selection - the selection of appropriate resources prior to issuing the query to ensure high-quality and rapid responses, and contain costs associated with calling the external search engines. However, current SOTA resource selection methodologies primarily rely on feature-based learning approaches. These methods often involve the labour intensive and expensive creation of training labels for each resource. In contrast, LLMs have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Topic Modeling
