SEARA: An Automated Approach for Obtaining Optimal Retrievers

Zou Yuheng; Wang Yiran; Tian Yuzhu; Zhu Min; Huang Yanhua

arXiv:2507.06554·cs.IR·September 30, 2025

SEARA: An Automated Approach for Obtaining Optimal Retrievers

Zou Yuheng, Wang Yiran, Tian Yuzhu, Zhu Min, Huang Yanhua

PDF

Open Access 1 Video

TL;DR

SEARA introduces an efficient, automated method for evaluating and optimizing retrievers in Retrieval-Augmented Generation systems, reducing costs and improving domain-specific performance assessment.

Contribution

The paper presents SEARA, a novel subset sampling evaluation technique that enables low-cost, automated retrieval system assessment tailored to specific applications.

Findings

01

Successfully applied to knowledge-based Q&A and travel assistant scenarios.

02

Achieved robust retrieval evaluation with minimal data and comprehensive metrics.

03

Enabled automatic selection of optimal retrievers for different domains.

Abstract

Retrieval-Augmented Generation (RAG) is a core approach for enhancing Large Language Models (LLMs), where the effectiveness of the retriever largely determines the overall response quality of RAG systems. Retrievers encompass a multitude of hyperparameters that significantly impact performance outcomes and demonstrate sensitivity to specific applications. Nevertheless, hyperparameter optimization entails prohibitively high computational expenses. Existing evaluation methods suffer from either prohibitive costs or disconnection from domain-specific scenarios. This paper proposes SEARA (Subset sampling Evaluation for Automatic Retriever Assessment), which addresses evaluation data challenges through subset sampling techniques and achieves robust automated retriever evaluation by minimal retrieval facts extraction and comprehensive retrieval metrics. Based on real user queries, this method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SEARA: An Automated Approach for Obtaining Optimal Retrievers· underline

Taxonomy

TopicsNatural Language Processing Techniques · AI-based Problem Solving and Planning · Topic Modeling