FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan,, Benjamin Van Durme, Dawn Lawrie, Luca Soldaini

TL;DR
This paper introduces FollowIR, a dataset and benchmark for evaluating and improving how information retrieval models follow detailed instructions, demonstrating that models can learn to better understand and utilize complex instructions through fine-tuning.
Contribution
The paper presents FollowIR, a new dataset and evaluation framework for instruction-following in IR models, and shows that fine-tuning enhances models' ability to follow complex instructions.
Findings
Existing IR models struggle with complex instructions.
Fine-tuning improves models' instruction-following capabilities.
FollowIR-7B outperforms baseline models after training.
Abstract
Modern Language Models (LMs) are capable of following long and complex instructions that enable a large and diverse set of user requests. While Information Retrieval (IR) models use these LMs as the backbone of their architectures, virtually none of them allow users to provide detailed instructions alongside queries, thus limiting their ability to satisfy complex information needs. In this work, we study the use of instructions in IR systems. First, we introduce our dataset FollowIR, which contains a rigorous instruction evaluation benchmark as well as a training set for helping IR models learn to better follow real-world instructions. FollowIR repurposes detailed instructions -- also known as narratives -- developed for professional assessors to evaluate retrieval systems. In particular, we build our benchmark from three collections curated for shared tasks at the Text REtrieval…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗jhu-clsp/FollowIR-7Bmodel· 168 dl· ♡ 15168 dl♡ 15
- 🤗sunatte/txt2sqlmodel
- 🤗RichardErkhov/jhu-clsp_-_FollowIR-7B-4bitsmodel· 5 dl5 dl
- 🤗RichardErkhov/jhu-clsp_-_FollowIR-7B-8bitsmodel
- 🤗MachoMaheen/devdock4bitmodel
- 🤗RichardErkhov/jhu-clsp_-_FollowIR-7B-ggufmodel· 442 dl· ♡ 1442 dl♡ 1
- 🤗sicer/arc-agi-legacymodel
- 🤗JilinHu/llemma_7b_3epoch_r32_e5_RQ1model· 1 dl1 dl
- 🤗Xin-Rui/LLAMA-Fac-NEW-A800model· ♡ 1♡ 1
- 🤗Linksome/lmfmodel
Videos
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics
MethodsSparse Evolutionary Training
