ExcluIR: Exclusionary Neural Information Retrieval

Wenhao Zhang; Mengqi Zhang; Shiguang Wu; Jiahuan Pei; Zhaochun Ren,; Maarten de Rijke; Zhumin Chen; Pengjie Ren

arXiv:2404.17288·cs.IR·April 29, 2024

ExcluIR: Exclusionary Neural Information Retrieval

Wenhao Zhang, Mengqi Zhang, Shiguang Wu, Jiahuan Pei, Zhaochun Ren,, Maarten de Rijke, Zhumin Chen, Pengjie Ren

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces ExcluIR, a new resource and benchmark for exclusionary retrieval, revealing that current models struggle with such queries but can be improved, especially with generative approaches.

Contribution

The paper presents the first dedicated benchmark and training data for exclusionary retrieval, advancing research in understanding and improving models' handling of exclusionary queries.

Findings

01

Existing models struggle with exclusionary queries

02

Training data improves model performance but gaps remain

03

Generative models handle exclusionary queries better

Abstract

Exclusion is an important and universal linguistic skill that humans use to express what they do not want. However, in information retrieval community, there is little research on exclusionary retrieval, where users express what they do not want in their queries. In this work, we investigate the scenario of exclusionary retrieval in document retrieval for the first time. We present ExcluIR, a set of resources for exclusionary retrieval, consisting of an evaluation benchmark and a training set for helping retrieval models to comprehend exclusionary queries. The evaluation benchmark includes 3,452 high-quality exclusionary queries, each of which has been manually annotated. The training set contains 70,293 exclusionary queries, each paired with a positive document and a negative document. We conduct detailed experiments and analyses, obtaining three main observations: (1) Existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zwh-sdu/excluir
pytorchOfficial

Videos

ExcluIR: Exclusionary Neural Information Retrieval· underline

Taxonomy

TopicsTopic Modeling · Machine Learning in Healthcare · Neural Networks and Applications

MethodsSparse Evolutionary Training