Discovering Biases in Information Retrieval Models Using Relevance   Thesaurus as Global Explanation

Youngwoo Kim; Razieh Rahimi; James Allan

arXiv:2410.03584·cs.IR·October 7, 2024

Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation

Youngwoo Kim, Razieh Rahimi, James Allan

PDF

Open Access 1 Video

TL;DR

This paper introduces a global explanation method for neural relevance models by creating a relevance thesaurus that uncovers biases like brand name favoritism, enhancing interpretability and understanding of model behavior.

Contribution

The paper proposes a novel global explanation technique using a relevance thesaurus to interpret neural relevance models and identify biases such as brand name bias.

Findings

01

Relevance thesaurus improves ranking effectiveness.

02

Thesaurus reveals brand name bias in models.

03

Method enhances understanding of neural relevance models.

Abstract

Most efforts in interpreting neural relevance models have focused on local explanations, which explain the relevance of a document to a query but are not useful in predicting the model's behavior on unseen query-document pairs. We propose a novel method to globally explain neural relevance models by constructing a "relevance thesaurus" containing semantically relevant query and document term pairs. This thesaurus is used to augment lexical matching models such as BM25 to approximate the neural model's predictions. Our method involves training a neural relevance model to score the relevance of partial query and document segments, which is then used to identify relevant terms across the vocabulary space. We evaluate the obtained thesaurus explanation based on ranking effectiveness and fidelity to the target neural ranking model. Notably, our thesaurus reveals the existence of brand name…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation· underline

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques