Entity-Conditioned Question Generation for Robust Attention Distribution   in Neural Information Retrieval

Revanth Gangi Reddy; Md Arafat Sultan; Martin Franz; Avirup Sil; Heng; Ji

arXiv:2204.11373·cs.CL·April 26, 2022

Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval

Revanth Gangi Reddy, Md Arafat Sultan, Martin Franz, Avirup Sil, Heng, Ji

PDF

1 Repo

TL;DR

This paper introduces a method to improve neural information retrieval models by training them to distribute attention more evenly across passage entities, enhancing robustness and performance, especially in zero-shot scenarios.

Contribution

The paper proposes a novel synthetic data generation approach that conditions training on poorly attended entities to promote uniform attention in neural IR models.

Findings

01

Improved attention distribution over entities in IR models.

02

Enhanced retrieval performance on benchmark datasets.

03

Robustness gains observed in zero-shot settings.

Abstract

We show that supervised neural information retrieval (IR) models are prone to learning sparse attention patterns over passage tokens, which can result in key phrases including named entities receiving low attention weights, eventually leading to model under-performance. Using a novel targeted synthetic data generation method that identifies poorly attended entities and conditions the generation episodes on those, we teach neural IR to attend more uniformly and robustly to all entities in a given passage. On two public IR benchmarks, we empirically show that the proposed method helps improve both the model's attention patterns and retrieval performance, including in zero-shot settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

blender-nlp/entityconditionedqgen
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.