FollowIR: Evaluating and Teaching Information Retrieval Models to Follow   Instructions

Orion Weller; Benjamin Chang; Sean MacAvaney; Kyle Lo; Arman Cohan,; Benjamin Van Durme; Dawn Lawrie; Luca Soldaini

arXiv:2403.15246·cs.IR·May 8, 2024·1 cites

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan,, Benjamin Van Durme, Dawn Lawrie, Luca Soldaini

PDF

Open Access 1 Repo 10 Models 5 Datasets 1 Video

TL;DR

This paper introduces FollowIR, a dataset and benchmark for evaluating and improving how information retrieval models follow detailed instructions, demonstrating that models can learn to better understand and utilize complex instructions through fine-tuning.

Contribution

The paper presents FollowIR, a new dataset and evaluation framework for instruction-following in IR models, and shows that fine-tuning enhances models' ability to follow complex instructions.

Findings

01

Existing IR models struggle with complex instructions.

02

Fine-tuning improves models' instruction-following capabilities.

03

FollowIR-7B outperforms baseline models after training.

Abstract

Modern Language Models (LMs) are capable of following long and complex instructions that enable a large and diverse set of user requests. While Information Retrieval (IR) models use these LMs as the backbone of their architectures, virtually none of them allow users to provide detailed instructions alongside queries, thus limiting their ability to satisfy complex information needs. In this work, we study the use of instructions in IR systems. First, we introduce our dataset FollowIR, which contains a rigorous instruction evaluation benchmark as well as a training set for helping IR models learn to better follow real-world instructions. FollowIR repurposes detailed instructions -- also known as narratives -- developed for professional assessors to evaluate retrieval systems. In particular, we build our benchmark from three collections curated for shared tasks at the Text REtrieval…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

orionw/followir
pytorchOfficial

Models

Datasets

Videos

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions· underline

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics

MethodsSparse Evolutionary Training