RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation
William Fleshman, Benjamin Van Durme

TL;DR
This paper introduces RE-AdaptIR, a method that enhances large language model-based information retrieval systems using only unlabeled data, achieving better results across multiple domains without requiring labeled examples.
Contribution
The paper presents a novel approach, RE-AdaptIR, for improving IR models through reverse engineered adaptation using unlabeled data, reducing reliance on costly labeled datasets.
Findings
Improved IR performance in both training and unseen domains.
Effective zero-shot adaptation without labeled data.
Insights into fine-tuning scenarios for IR models.
Abstract
Large language models (LLMs) fine-tuned for text-retrieval have demonstrated state-of-the-art results across several information retrieval (IR) benchmarks. However, supervised training for improving these models requires numerous labeled examples, which are generally unavailable or expensive to acquire. In this work, we explore the effectiveness of extending reverse engineered adaptation to the context of information retrieval (RE-AdaptIR). We use RE-AdaptIR to improve LLM-based IR models using only unlabeled data. We demonstrate improved performance both in training domains as well as zero-shot in domains where the models have seen no queries. We analyze performance changes in various fine-tuning scenarios and offer findings of immediate use to practitioners.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Semantic Web and Ontologies · Recommender Systems and Techniques
